Jianshu Zhang

Northwestern University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (3)Robotics & Embodied AI (3)World Models & Planning (2)Natural Language Processing (1)

Frequent co-authors

Haoran Lu (3)Han Liu (3)Mutian Shen (2)Yu Xiao (2)

Papers (5)

Jun 16, 2026

Jun 16, 2026·also Tsinghua AI, PKU

AnnotateAnything: Automatic Annotation of 3D Assets for Robot Manipulation

Achieving superior annotation efficiency and task success rates, AnnotateAnything revolutionizes how 3D assets are prepared for robot manipulation.

Haoran Lu, Mutian Shen, Yu Xiao +8

Multimodal Models Robotics & Embodied AI

Jun 16, 2026·also Tsinghua AI, PKU, ShanghaiTech, University of California

MagicSim: A Unified Infrastructure for Executable Embodied Interaction

MagicSim revolutionizes robot learning by merging diverse world construction and execution into a single, efficient framework that enhances both evaluation and interaction capabilities.

Haoran Lu, Songling Liu, Yue Chen +14

Robotics & Embodied AI World Models & Planning

May 28, 2026

Qikai Chang +6May 28, 2026·also Northwestern

PEARL: Training Socratic Tutors with Pedagogically Aligned Reinforcement Learning

Socratic tutors can be effectively trained via RL by decoupling student cognitive states, using generative pedagogical rewards, and stabilizing multi-objective optimization.

Qikai Chang, Zhenrong Zhang, Linbo Chen +4

Natural Language Processing RLHF & Preference Learning Tool Use & Agents

May 22, 2026

SPACENUM: Revisiting Spatial Numerical Understanding in VLMs

VLMs struggle to meaningfully ground numerical outputs in spatial contexts, often performing at chance levels in critical tasks.

Jianshu Zhang, Huifeixin Chen, Haoran Lu +3

Multimodal Models Robotics & Embodied AI

Mar 3, 2026

Mar 3, 2026·also Dolby Laboratories, with Dolby Laboratories

Phys4D: Fine-Grained Physics-Consistent 4D Modeling from Video Diffusion

Video diffusion models can now generate physically plausible 4D worlds thanks to a new pipeline that combines pretraining, supervised fine-tuning, and reinforcement learning.

Jianshu Zhang, Maojiang Su, Chenwei Xu +2

Computer Vision Multimodal Models World Models & Planning

Search

Jianshu Zhang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)