Yusen Zhang

Columbia University ♠, New York University ♢, Barnard College

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (2)Recommendation & Information Retrieval (1)Tool Use & Agents (1)

Frequent co-authors

Haonan Wang (1)Jiaxiang Liu (1)Yurong Liu (1)Austin Senna Wijaya (1)

Papers (2)

Jun 9, 2026

6d ago·also Barnard College, NYU

LakeQA: An Exploratory QA Benchmark over a Million-Scale Data Lake

Even state-of-the-art LLMs like GPT-5.2 falter in LakeQA, scoring just 18.37% on a benchmark that demands both searching and multi-hop reasoning.

Haonan Wang, Jiaxiang Liu, Yurong Liu +11

Eval Frameworks & Benchmarks Recommendation & Information Retrieval

6d ago·also Barnard College, NYU

VISTA: A Versatile Interactive User Simulation Toolkit for Agent Evaluation

VISTA reveals that integrating UI and API interactions can drastically enhance the realism and comprehensiveness of agent evaluations, outperforming existing benchmarks.

Yunan Lu, Ryan Shea, Yusen Zhang

Eval Frameworks & Benchmarks Tool Use & Agents

Search

Yusen Zhang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)