Bo Chen

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (1)Natural Language Processing (1)Tool Use & Agents (1)

Frequent co-authors

Jiachen Zhu (1)Menghui Zhu (1)Renting Rui (1)Rong Shan (1)

Papers (1)

Jun 6, 2025

Jun 6, 2025·also HIT, Huawei

Evolutionary Perspectives on the Evaluation of LLM-Based AI Agents: A Comprehensive Survey

Current LLM evaluation benchmarks often conflate chatbots and true AI agents, leading to misaligned research efforts, but this survey provides a framework for targeted evaluation based on environmental complexity and agent capabilities.

Jiachen Zhu, Menghui Zhu, Renting Rui +97

Eval Frameworks & Benchmarks Natural Language Processing Tool Use & Agents

Search

Bo Chen

Research focus

Frequent co-authors

Papers (1)