Gang Chen

UIUC ‹ FDU ♡

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Computer Vision (5)Multimodal Models (4)Robotics & Embodied AI (3)Reasoning & Chain-of-Thought (2)

Frequent co-authors

Yicheng Ji (2)Lidan Shou (2)Huan Li (2)Shuzheng Si (1)

Papers (8)

Apr 30, 2026

Tsinghua AI1d ago·also MiniCPM-o Team, UIUC

From Context to Skills: Can Language Models Learn from Context Skillfully?

Forget hand-crafted prompts: Ctx2Skill lets language models bootstrap their own skills from context, learning to reason better without any human labels.

Shuzheng Si, Haozhe Zhao, Yu Lei +10

Natural Language Processing Reasoning & Chain-of-Thought Tool Use & Agents

Apr 21, 2026

Xunpei Sun +31w ago·also SYSU, UIUC

RAFT-MSF++: Temporal Geometry-Motion Feature Fusion for Self-Supervised Monocular Scene Flow

Multi-frame monocular scene flow estimation gets a serious boost with RAFT-MSF++, which uses Geometry-Motion Feature fusion to achieve state-of-the-art results and improved robustness to occlusions.

Xunpei Sun, Zuoxun Hou, Gang Chen +1

Architecture Design (Transformers, SSMs, MoE)Computer Vision Robotics & Embodied AI

Apr 15, 2026

2w ago·also Tsinghua AI, Li Auto, PolyU, UIUC

PostureObjectstitch: Anomaly Image Generation Considering Assembly Relationships in Industrial Scenarios

Synthesizing realistic anomaly images for industrial assembly is now possible thanks to a diffusion model that respects component pose and assembly relationships.

Zebei Tong, Hongchang Chen, Yujie Lei +6

Computer Vision Data Curation & Synthetic Data

Apr 7, 2026

Yicheng Ji +53w ago·also Huawei, UIUC, ZJU

See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference of Video LLMs

Video-LLMs can be sped up by nearly 3x without sacrificing performance, simply by loosening the strict matching requirements of speculative decoding and focusing on visual-semantic relevance.

Yicheng Ji, Jinpeng Chen, Cong Wang +3

Computer Vision Inference & Quantization Multimodal Models

3w ago·also UIUC

Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques, and Prospects

Visual token dominance is the hidden culprit behind LVLM inference inefficiency, and this paper dissects the problem to reveal how to navigate the fidelity-efficiency tradeoff.

Yicheng Ji, Fei Ren, Yihang Li +5

Computer Vision Inference & Quantization Multimodal Models

Apr 5, 2026

Shenzhi Yang +93w ago·also UIUC

Can LLMs Learn to Reason Robustly under Noisy Supervision?

RLVR models exhibit "Early Correctness Coherence" under noisy supervision, suggesting a surprising opportunity for self-correction via dynamic label refinement.

Shenzhi Yang, Guangcheng Zhu, Bowen Song +7

Reasoning & Chain-of-Thought Red-Teaming & Adversarial Robustness RLHF & Preference Learning

Mar 10, 2026

Mar 10, 2026·also BUPT, UIUC

BEACON: Language-Conditioned Navigation Affordance Prediction under Occlusion

Robots can now better navigate using language instructions even when objects block their view, thanks to a new method that reasons about the environment in a bird's-eye view rather than relying on visible pixels.

Xinyu Gao, Gang Chen

Computer Vision Multimodal Models Robotics & Embodied AI

Feb 17, 2026

Feb 17, 2026·also UIUC

VLM-DEWM: Dynamic External World Model for Verifiable and Resilient Vision-Language Planning in Manufacturing

Achieve 95% recovery success in robotic manufacturing by giving vision-language models a persistent, queryable memory of the world.

Guoqin Tang, Qingxuan Jia, Gang Chen +3

Multimodal Models Robotics & Embodied AI World Models & Planning

Search

Gang Chen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (8)