Yibo Li

A dedicated guard agent, trained via reasoning-intensive methods, can effectively neutralize prompt injection attacks in web-navigating agents without sacrificing performance.

Yulin Chen, Tri Cao, Haoran Li +7

Multimodal Models Red-Teaming & Adversarial Robustness Tool Use & Agents

Apr 1, 2026

Zhanzhi Lou +5Apr 1, 2026

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

Forget hand-coded adaptation rules: Meta-TTL learns policies that let language agents self-improve at test time, generalizing zero-shot to unseen environments.

Zhanzhi Lou, Z. Lou, Hui Chen +3

Natural Language Processing Tool Use & Agents

Mar 30, 2026

Mar 30, 2026·also Shanghai Jiaotong University

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

Current research agent benchmarks miss critical flaws, as MiroEval reveals that process quality is a reliable predictor of research outcome, and multimodal tasks expose weaknesses invisible to output-level metrics.

Fangda Ye, Yuxin Hu, Pengxiang Zhu +24

Eval Frameworks & Benchmarks Multimodal Models Tool Use & Agents

May 26, 2025

Hui Chen +9May 26, 2025

MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research

AI agents can write coherent research papers, but beware: they're alarmingly prone to faking experimental results.

Hui Chen, Miao Xiong, Yujie Lu +79

Eval Frameworks & Benchmarks Scientific Discovery & Drug Design Tool Use & Agents

Search

Yibo Li

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)