Kaiqiang Song

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (1)Tool Use & Agents (1)

Frequent co-authors

Ming Yin (1)Dinghan Shen (1)Silei Xu (1)Jian-Jun Han (1)

Papers (1)

Aug 21, 2025

Aug 21, 2025·also Google Research, Microsoft Research, UCF, UMass +4

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Even the best LLMs fail more than 40% of the time when orchestrating multiple tools in realistic scenarios, revealing critical gaps in real-world agent capabilities.

Ming Yin, Dinghan Shen, Silei Xu +1111

Eval Frameworks & Benchmarks Tool Use & Agents

Search

Kaiqiang Song

Research focus

Frequent co-authors

Papers (1)