Ye Wang

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Robotics & Embodied AI (5)Multimodal Models (5)Tool Use & Agents (2)Scaling Laws & Emergent Abilities (1)

Frequent co-authors

Anzhe Chen (4)Yiyang Huang (4)Jiazhao Zhang (4)Gengze Zhou (4)

Papers (7)

Jun 16, 2026

Haoqi Yuan +223d ago·also ZJU

Qwen-RobotManip Technical Report: Alignment Unlocks Scale for Robotic Manipulation Foundation Models

Qwen-RobotManip achieves unprecedented generalization in robotic manipulation by effectively aligning diverse data sources, outperforming existing models across multiple challenging benchmarks.

Haoqi Yuan, Zhixuan Liang, Anzhe Chen +20

Robotics & Embodied AI Scaling Laws & Emergent Abilities

Jiazhao Zhang +323d ago·also SJTU

Qwen-RobotNav Technical Report: A Scalable Navigation Model Designed for an Agentic Navigation System

Qwen-RobotNav achieves unprecedented flexibility in navigation tasks by allowing real-time reconfiguration of its observation strategy, setting new benchmarks in the field.

Jiazhao Zhang, Gengze Zhou, Hale Yin +30

Multimodal Models Robotics & Embodied AI Tool Use & Agents

Jun 15, 2026

NUS4d ago

Joycent: Diffusion-based Accent TTS without Accented Phone Prediction

Joycent synthesizes accented speech directly from standard phone sequences, eliminating the need for error-prone accented phone predictions.

Xintong Wang, Ye Wang

Speech & Audio

Jie Zhang +374d ago·also Tsinghua AI

Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation

Language-driven video generation in Qwen-RobotWorld achieves unprecedented accuracy in predicting robotic actions, outperforming existing models across key benchmarks.

Jie Zhang, Xiaoyue Chen, Anzhe Chen +35

Multimodal Models Robotics & Embodied AI World Models & Planning

Jun 8, 2026

K Challenge1w ago·also CAS, Mitsubishi Electric Research Laboratories (MERL)

ReCoVLA: VLM-Guided Reward Compilation for Failure Recovery in Vision-Language-Action Policies

A novel reward compilation approach boosts VLA policy success rates by over 30% in both simulated and real-world manipulation tasks.

Haodi Hu, Chung-Ta Huang, Jing Liu +4

Multimodal Models RLHF & Preference Learning Robotics & Embodied AI

May 28, 2026

Qiuyue Wang +433w ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

One model to control them all: Qwen-VLA achieves impressive zero-shot generalization across diverse robotic tasks and embodiments by unifying vision-language-action modeling.

Qiuyue Wang, Mingsheng Li, Jian Guan +41

Multimodal Models Robotics & Embodied AI Tool Use & Agents

Apr 30, 2026

Apr 30, 2026·also China Mobile, Hamburg

TripVVT: A Large-Scale Triplet Dataset and a Coarse-Mask Baseline for In-the-Wild Video Virtual Try-On

Ditch the garment masks: a simple human mask is all you need to nail video virtual try-on in the wild.

Dingbao Shao, Di Shao, Songhan Wu +12

Computer Vision Data Curation & Synthetic Data Multimodal Models

Search

Ye Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (7)