Yiyuan Li

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (1)Natural Language Processing (1)Tool Use & Agents (1)

Frequent co-authors

Xiangyi Li (1)K. Choe (1)Yiming Liu (1)Xiaokun Chen (1)

Papers (1)

Apr 6, 2026

Apple MLApr 6, 2026

ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces

LLM agents automating productivity tasks achieve only moderate success (39-64%) while exhibiting surprisingly high rates of unsafe actions (7-33%) in realistic, multi-service workflows.

Xiangyi Li, K. Choe, Yiming Liu +12

Eval Frameworks & Benchmarks Natural Language Processing Tool Use & Agents

Search

Yiyuan Li

Research focus

Frequent co-authors

Papers (1)