Avery Nie

University of Toronto

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (2)Tool Use & Agents (2)Robotics & Embodied AI (1)

Frequent co-authors

K. Shi (2)Ziao Zhang (2)Shiting Huang (2)Zhen Fang (2)

Papers (2)

May 27, 2026

May 27, 2026·also UofT

AsyncTool: Evaluating the Asynchronous Function Calling Capability under Multi-Task Scenarios

LLM agents struggle to juggle multiple tasks when tool use involves realistic delays, revealing critical weaknesses in temporal reasoning and coordination.

K. Shi, Ziao Zhang, Shiting Huang +10

Eval Frameworks & Benchmarks Tool Use & Agents

Apr 19, 2026

Apr 19, 2026·also CUHK, UofT

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

Even the best LLMs struggle to effectively discover, refine, and reuse skills over a lifetime of experience, suggesting current benchmarks significantly overestimate real-world agentic capabilities.

Ziao Zhang, K. Shi, Shiting Huang +14

Eval Frameworks & Benchmarks Robotics & Embodied AI Tool Use & Agents

Search

Avery Nie

Research focus

Frequent co-authors

Papers (2)