Simin Ma

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (2)Tool Use & Agents (2)RLHF & Preference Learning (1)

Frequent co-authors

Xun Wang (2)Yebowen Hu (2)Sathish Indurthi (2)Shujian Liu (2)

Papers (2)

Feb 12, 2026

Google ResearchFeb 12, 2026·also TCD, UMass, Wayfair, Yanshan +1

CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use

Forget hand-crafted reward functions: CM2 uses checklists to train tool-using agents, outperforming SFT baselines by up to 12 points on key benchmarks.

Xun Wang, Yebowen Hu, Chenyang Zhao +5

Eval Frameworks & Benchmarks RLHF & Preference Learning Tool Use & Agents

Aug 21, 2025

Aug 21, 2025·also Google Research, Microsoft Research, UCF, UMass +4

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Even the best LLMs fail more than 40% of the time when orchestrating multiple tools in realistic scenarios, revealing critical gaps in real-world agent capabilities.

Ming Yin, Dinghan Shen, Silei Xu +1111

Eval Frameworks & Benchmarks Tool Use & Agents

Search

Simin Ma

Research focus

Frequent co-authors

Papers (2)