Kaichen Zhang

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (2)Computer Vision (1)Eval Frameworks & Benchmarks (1)Robotics & Embodied AI (1)

Frequent co-authors

Bo Li (2)Xiang An (1)Yin Xie (1)Feilong Tang (1)

Papers (2)

May 25, 2026

Xiang An +283w ago·also ERNIE Team, Monash, S-Lab, SenseTime +1

LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence

LLaVA-OV-2's codec-stream tokenization lets it crush existing video-language models, especially in tasks requiring fine-grained temporal understanding of high-frequency motion.

Xiang An, Yin Xie, Feilong Tang +26

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

May 19, 2026

evolvinglmms-lab.github.io/ParaVTMay 19, 2026·also HKUST

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

RL fine-tuning LMMs for tool use can collapse structural formats due to strong pretrained tool priors, but a surprisingly simple fix of targeted format rewards and frame-budget randomization can restore stability and boost performance.

Zuhao Yang, Kaichen Zhang, Sudong Wang +7

Multimodal Models Robotics & Embodied AI Tool Use & Agents

Search

Kaichen Zhang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)