Tao Wang

ByteDance

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (3)Multimodal Models (3)Recommendation & Information Retrieval (2)Reasoning & Chain-of-Thought (2)

Frequent co-authors

Haoxiang Sun (2)Jian Zhao (2)Qijia He (1)Jiayi Cheng (1)

Papers (8)

Jul 21, 2026

5d ago·also UW, ByteDance, NYU, Ohio State

CodeRescue: Budget-Calibrated Recovery Routing for Coding Agents

Recovery routing can outperform escalation strategies by leveraging execution feedback, achieving a higher solve rate at only 35% of the typical recovery cost.

Qijia He, Jiayi Cheng, Chenqian Le +6

Code Generation & Program Synthesis Tool Use & Agents

Jun 29, 2026

3w ago·also ByteDance, Waterloo

Legible Shared Autonomy: Implicit Communication of Robot Belief through Motion

Users can now intuitively grasp a robot's inferred goals through its motion, reducing control effort and enhancing collaboration.

Jinwei Liu, Pengfei Li, Shaofeng Chen +2

Robotics & Embodied AI

Jun 25, 2026

Changxin Lao +43Jun 25, 2026·also ByteDance, Hunan, Kuaishou, Lingnan University +2

AgentX: Towards Agent-Driven Self-Iteration of Industrial Recommender Systems

AgentX can autonomously iterate on recommendation algorithms, outpacing human-driven processes and fundamentally changing how we approach system development.

Changxin Lao, Fei Pan, Guozhuang Ma +41

Recommendation & Information Retrieval Tool Use & Agents

Jun 24, 2026

Jun 24, 2026·also ByteDance, Northwestern, PKU, Waterloo

From Structure to Synergy: A Survey of Vision-Language Perception Paradigm Evolution in Multimodal Large Language Models

Unified vision-language perception in MLLMs is not just an evolution; it’s a critical leap toward achieving artificial general intelligence.

Haoxiang Sun, Tao Wang, Jian Zhao

Multimodal Models

Jun 24, 2026·also ByteDance, PKU, Waterloo, XJTU

V-Zero: Answer-Label-Free On-Policy Distillation with Contrastive Evidence Gating for Fine-Grained Visual Reasoning

V-Zero achieves fine-grained visual reasoning without any annotated answer labels, outperforming traditional methods in both speed and accuracy.

Haoxiang Sun, Zhihang Yi, Langxuan Deng +3

Multimodal Models Reasoning & Chain-of-Thought RLHF & Preference Learning

Jun 18, 2026

Yalun Dai +6Jun 18, 2026·also B-Instruct VLM + DiT-L MMDiT action, ByteDance, NTU

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Spatial reasoning can be transformed from isolated frame predictions to dynamic scene understanding, significantly boosting performance in multi-view and video tasks.

Yalun Dai, Hao Li, Yuhao Dong +4

Multimodal Models Reasoning & Chain-of-Thought Tool Use & Agents

Jun 16, 2026

Flux-Guard: Facial Identity Protection using diffusion models

Flux-Guard achieves a breakthrough by enabling effective face editing that simultaneously thwarts face recognition systems without compromising image quality.

Tao Wang, Jianyi Liu

Computer Vision Constitutional AI & AI Ethics

Jun 4, 2026

Yuejie Li +7Jun 4, 2026·also ByteDance, Centaur AI, Intelligent Game and Decision Lab

Answer Presence Drives RAG Rewriting Gains

Removing gold answer strings from rewritten contexts can cause F1 scores to plummet by up to 64 points, underscoring their critical role in retrieval-augmented QA performance.

Yuejie Li, Yueying Hua, Ke Yang +5

Natural Language Processing Recommendation & Information Retrieval

Search

Tao Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (8)