Xue Yang

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (4)Eval Frameworks & Benchmarks (4)Computer Vision (3)Tool Use & Agents (3)

Frequent co-authors

Zhihang Zhong (3)Ziyang Gong (3)Yan Li (2)Yifan Yang (2)

Papers (8)

Jun 1, 2026

2w ago·also Tsinghua AI, AI Laboratory, Northwestern, SEU +4

Moment-Video: Diagnosing Temporal Fidelity of Video MLLMs on Momentary Visual Events

Current video MLLMs struggle to grasp fleeting visual events, with top models barely surpassing 39% accuracy on critical momentary tasks.

Xiaolin Liu, Yilun Zhu, Xuehui Wang +8

Computer Vision Multimodal Models

May 28, 2026

Wanghan Xu +382w ago·also Tsinghua AI, CUHK, SCUT, Shanghai AI Lab +3

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

Current AI agents struggle to reliably rediscover scientific knowledge, with top performers averaging only 21.5 out of a possible score, revealing critical gaps in their research capabilities.

Wanghan Xu, Shuo Li, Tianlin Ye +36

Eval Frameworks & Benchmarks Scientific Discovery & Drug Design

May 26, 2026

Xudong Lu +113w ago·also Huawei

OmniInteract: Benchmarking Real-World Streaming Interaction for Real-Time Omnimodal Assistants

Current omnimodal LLMs that ace offline benchmarks still fumble basic real-time interactions, highlighting a critical gap in their ability to handle streaming audio-visual data.

Xudong Lu, Xueying Li, Annan Wang +9

Eval Frameworks & Benchmarks Multimodal Models Speech & Audio

May 22, 2026

Microsoft Research3w ago·also Fudan, M QA pairs over more than, SJTU, Tongji

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

SkillOpt transforms agent skill development into a reproducible optimization process, achieving state-of-the-art results by treating skills as trainable parameters.

Yifan Yang, Ziyang Gong, Weiquan Huang +12

Natural Language Processing Tool Use & Agents Training Efficiency & Optimization

3w ago·also AI Laboratory, Cornell, Northeastern, PhotoFlow +3

PhotoFlow: Agentic 3D Virtual Photography Missions

LLM-powered agents can now produce surprisingly strong photographs in complex 3D environments, suggesting a path towards embodied AI with aesthetic awareness.

Jiarui Guo, Haojia Wei, Yifei Liu +4

Computer Vision Multimodal Models Tool Use & Agents

3w ago·also Microsoft Research, M QA pairs over more than, SJTU

From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

Model-generated skills can actually hurt agent performance, and bigger models don't necessarily make for better skill extractors or consumers.

Zisu Huang, Jingwen Xu, Yifan Yang +13

Eval Frameworks & Benchmarks Tool Use & Agents

May 21, 2026

D visual recognition and3w ago·also AI Laboratory, Beihang, Chongqing, D scene information. First +4

SpaceDG: Benchmarking Spatial Intelligence under Visual Degradation

Visual degradations can cripple the spatial reasoning abilities of even state-of-the-art MLLMs, but targeted finetuning can restore—and even surpass—human-level performance.

Xiaolong Zhou, Yifei Liu, Ziyang Gong +6

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Apr 30, 2026

Tsinghua AIApr 30, 2026·also NJU

DPN-LE: Dual Personality Neuron Localization and Editing for Large Language Models

LLMs can have their personalities surgically altered by tweaking just 0.5% of their neurons, preserving general capabilities while achieving competitive control.

Lifan Zheng, Xue Yang, Jiawei Chen +5

Interpretability & Mechanistic Interp Natural Language Processing

Search

Xue Yang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (8)