Pengxiang Ding

Papers on Lattice

Total citations

Topics

h-index

Research focus

Multimodal Models (2)Robotics & Embodied AI (2)Computer Vision (1)Eval Frameworks & Benchmarks (1)

Frequent co-authors

Wenxuan Song (2)Yang Liu (1)Teng-Long Jiang (1)Xudong Wang (1)

Papers (2)

Mar 26, 2026

Tsinghua AIMar 26, 2026·also ZJU

MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation

Ditch the clunky architectures: a single diffusion model can now handle vision, language, and robot control to achieve SOTA manipulation performance.

Yang Liu, Pengxiang Ding, Teng-Long Jiang +10

Computer Vision Multimodal Models Robotics & Embodied AI

Feb 26, 2026

Feb 26, 2026·also Galbot, TU Munich, Xidian

Rethinking the Practicality of Vision-language-action Model: A Comprehensive Benchmark and An Improved Baseline

A practical VLA model, LLaVA-VLA, achieves strong generalization and versatility on a new benchmark, CEBench, while running on consumer-grade GPUs, eliminating the need for costly pre-training.

Wenxuan Song, Jiayi Chen, Xiaoquan Sun +11

Eval Frameworks & Benchmarks Multimodal Models Robotics & Embodied AI

Search

Pengxiang Ding

Research focus

Frequent co-authors

Papers (2)