Yi Lu

Nanjing University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (5)Eval Frameworks & Benchmarks (4)Robotics & Embodied AI (2)Natural Language Processing (2)

Frequent co-authors

Ping Nie (3)Dongfu Jiang (3)Zhuofeng Li (2)Haoxiang Zhang (2)

Papers (6)

Jun 25, 2026

Jun 25, 2026·also Tsinghua AI, Northwestern

PressMimic: Pressure-Guided Motion Capture and Control for Humanoid Robot Imitation

Pressure integration in humanoid motion imitation significantly enhances accuracy and stability, revealing the limitations of traditional vision-based methods.

Yi Lu, Shenghao Ren, Tianyu Xiong +4

Multimodal Models Robotics & Embodied AI

Jun 12, 2026

Jun 12, 2026·also SJTU, Texas A&M, UCSD, UofT +1

Dr-DCI: Scaling Direct Corpus Interaction via Dynamic Workspace Expansion

DR-DCI achieves a remarkable 73.3% accuracy in agentic search tasks while efficiently scaling from 100K to 10M documents, outperforming traditional methods.

Yi Lu, Zhuofeng Li, Ping Nie +6

Recommendation & Information Retrieval Tool Use & Agents

Apr 22, 2026

CMU MLApr 22, 2026·also NJU

SkillLearnBench: Benchmarking Continual Learning Methods for Agent Skill Generation on Real-World Tasks

Continual learning for LLM agents hits a wall: scaling models doesn't reliably improve skill generation, and self-feedback can lead to recursive drift.

Shan Zhong, Shanshan Zhong, Yi Lu +17

Eval Frameworks & Benchmarks Robotics & Embodied AI Tool Use & Agents

Apr 15, 2026

Apr 15, 2026·also JHU, NJU, SJTU, UCSD +1

ReviewGrounder: Improving Review Substantiveness with Rubric-Guided, Tool-Integrated Agents

A clever two-stage agent using smaller models can produce better, more substantive peer reviews than brute-force application of the largest LLMs.

Zhuofeng Li, Yi Lu, Dongfu Jiang +5

Eval Frameworks & Benchmarks Natural Language Processing Tool Use & Agents

Apr 9, 2026

CMU MLApr 9, 2026·also Tsinghua AI, NJU, Waterloo

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Today's best AI agents can only complete 33% of common online tasks like booking appointments or filling out job applications, revealing a significant gap between current capabilities and real-world utility.

Yuxuan Zhang, Yubo Wang, Yipeng Zhu +19

Eval Frameworks & Benchmarks Natural Language Processing Tool Use & Agents

Mar 17, 2026

Mar 17, 2026·also Corresponding Author, NJU, Waterloo

SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding

A Qwen3-8B model, trained with a new SFT+RLAIF recipe on a challenging new benchmark, SWE-QA-Pro, beats GPT-4o in repository-level code understanding.

Songcheng Cai, Z. Lyu, Yuansheng Ni +14

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Search

Yi Lu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (6)