Xian Wu

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (3)Natural Language Processing (2)Tool Use & Agents (2)Architecture Design (Transformers, SSMs, MoE) (1)

Frequent co-authors

Zhengyi Zhao (1)Shubo Zhang (1)Zezhong Wang (1)Yuxi Zhang (1)

Papers (4)

Apr 9, 2026

1w ago

Guaranteeing Knowledge Integration with Joint Decoding for Retrieval-Augmented Generation

RAG models struggle to use retrieved knowledge even when it's relevant, but GuarantRAG's two-stage generation and joint decoding boosts accuracy by 12% and slashes hallucinations by 16%.

Zhengyi Zhao, Shubo Zhang, Zezhong Wang +7

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Recommendation & Information Retrieval

1w ago

Towards Real-world Human Behavior Simulation: Benchmarking Large Language Models on Long-horizon, Cross-scenario, Heterogeneous Behavior Traces

LLMs exhibit a "Utopian bias" when simulating human behavior, converging towards an unrealistic "positive average person" and failing to capture individual differences and long-tail behaviors.

Jiawei Chen, Ruoxi Xu, Boxi Cao +13

Eval Frameworks & Benchmarks Tool Use & Agents World Models & Planning

CMU ML1w ago·also Tsinghua AI, Waterloo

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Today's best AI agents can only complete 33% of common online tasks like booking appointments or filling out job applications, revealing a significant gap between current capabilities and real-world utility.

Yuxuan Zhang, Yubo Wang, Yipeng Zhu +19

Eval Frameworks & Benchmarks Natural Language Processing Tool Use & Agents

Apr 2, 2026

Liang Zhu +32w ago

From Guessing to Placeholding: A Cost-Theoretic Framework for Uncertainty-Aware Code Completion

LLMs can cut code editing costs by up to 50% simply by knowing when *not* to guess.

Liang Zhu, Haolin Chen, Lidong Zhao +1

Code Generation & Program Synthesis Eval Frameworks & Benchmarks

Search

Xian Wu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)