Yi Wang

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (8)Robotics & Embodied AI (3)Training Efficiency & Optimization (3)Reasoning & Chain-of-Thought (3)

Frequent co-authors

Xiaoyue Chen (3)Xiao Xu (3)Yanran Zhang (3)Zekai Zhang (3)

Papers (12)

Jun 15, 2026

Jie Zhang +372d ago·also ZJU

Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation

Language-driven video generation in Qwen-RobotWorld achieves unprecedented accuracy in predicting robotic actions, outperforming existing models across key benchmarks.

Jie Zhang, Xiaoyue Chen, Anzhe Chen +35

Multimodal Models Robotics & Embodied AI World Models & Planning

Jun 11, 2026

ETH6d ago·also BAIR, Tsinghua AI, PKU, Shanghai Innovation +3

FTP-1: A Generalist Foundation Tactile Policy Across Tactile Sensors for Contact-Rich Manipulation

FTP-1 not only excels on familiar tactile sensors but also achieves unprecedented success on unseen setups, redefining the potential for cross-sensor generalization in robotic manipulation.

Wendi Chen, Yi Wang, Zhuoyang Liu +5

Multimodal Models Robotics & Embodied AI

Jun 10, 2026

College of Computer Science and Technology1w ago·also Chongqing, Chongqing Key Laboratory of Computational, construction Key Laboratory of Digital, Key Laboratory of Cyberspace Big Data +2

Efficient Time Series Clustering from Multiscale Reservoir Dynamics with Granular-Ball Anchoring Graph Optimization

MSRGC-Net achieves state-of-the-art clustering performance with drastically reduced computational overhead by eliminating the need for iterative training.

Lifeng Shen, Shuyin Xia, Yi Wang

Training Efficiency & Optimization

1w ago·also CAS, NJU, Shanghai Innovation

InternVideo3: Agentify Foundation Models with Multimodal Contextual Reasoning

Efficient context handling in video tasks can elevate multimodal models to new heights of agency and reasoning capability.

Ziang Yan, Sheng Xia, Jiashuo Yu +8

Multimodal Models Reasoning & Chain-of-Thought Tool Use & Agents

Jun 4, 2026

1w ago·also Fudan, NJU, PKU, Shanghai Innovation

ViCuR: Visual Cues as Recoverable Privilege for Multimodal On-Policy Distillation

Teacher privilege in multimodal reasoning is redefined, showing that visually grounded cues can lead to superior performance in on-policy distillation.

Kanghui Tian, Siyuan Liu, Ziang Yan +3

Multimodal Models Reasoning & Chain-of-Thought

1w ago·also Fudan, HKU, NJU, Shanghai Innovation +2

Imagine Before You Predict: Interleaved Latent Visual Reasoning for Video Event Prediction

Future-L1 shows that preserving visual semantics in latent space can dramatically enhance video event prediction accuracy, outperforming previous models by substantial margins.

Tianxiang Jiang, Linquan Wu, Sheng Xia +4

Multimodal Models Reasoning & Chain-of-Thought World Models & Planning

Jun 2, 2026

Tianhe Wu +232w ago·also Trento

Qwen-Image-Flash: Beyond Objective Design

Rethinking few-step distillation reveals that the training pipeline's organization is as crucial as the distillation objectives themselves.

Tianhe Wu, Kun Yan, Zikai Zhou +21

Inference & Quantization Multimodal Models Training Efficiency & Optimization

May 27, 2026

Niantong Li +313w ago

Qwen-Image-Bench: From Generation to Creation in Text-to-Image Evaluation

Existing text-to-image benchmarks miss the mark on real-world artistic creation, but Qwen-Image-Bench finally provides a creator-centric evaluation that reliably distinguishes state-of-the-art models.

Niantong Li, Guangzheng Hu, Wei Qiao +29

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

May 25, 2026

Wenbin Zou +63w ago·also Shenzhen Loop Area Institute, SYSU

SP-MoMamba: Superpixel-driven Mixture of State Space Experts for Efficient Image Super-Resolution

Ditch the rigid grid: SP-MoMamba uses superpixels to let Mamba-based super-resolution models "see" images like humans do, boosting performance and efficiency.

Wenbin Zou, Yawen Cui, Yi Wang +4

Architecture Design (Transformers, SSMs, MoE)Computer Vision Training Efficiency & Optimization

May 4, 2026

May 4, 2026·also Central South University, Fullive Innovation (Beijing) AI Technology Co., HFUT, HKU +3

AcademiClaw: When Students Set Challenges for AI Agents

Today's best AI agents can only solve 55% of real-world academic tasks that university students find challenging, revealing a significant gap between current AI capabilities and the demands of academic workflows.

Junjie Yu, Pengrui Lu, Weiye Si +69

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

May 1, 2026

Yi Wang +16May 1, 2026

Learning while Deploying: Fleet-Scale Reinforcement Learning for Generalist Robot Policies

Generalist robot policies can achieve 95% success rates on real-world manipulation tasks by continually learning from a fleet of robots, even in the face of distribution shifts and long-tail failures.

Yi Wang, Xincheng Li, Pengwei Xie +14

Multimodal Models RLHF & Preference Learning Robotics & Embodied AI

Apr 27, 2026

Apr 27, 2026·also BUPT

Listen to the Voices of Everyday Users: Democratizing Privacy Ratings for Sensitive Data Access in Mobile Apps

User-driven privacy ratings of mobile apps reveal significant discrepancies with expert assessments, suggesting a need for more inclusive and user-centric privacy evaluation mechanisms.

Liuan Wang, Liu Wang, Tianshu Zhou +2

Constitutional AI & AI Ethics Natural Language Processing

Search

Yi Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (12)