Tanya Roosta

UC Berkeley

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (2)Recommendation & Information Retrieval (1)Tool Use & Agents (1)Constitutional AI & AI Ethics (1)

Frequent co-authors

Preetam Prabhu Srikar Dammu (1)A. Palkhiwala (1)Chirag Shah (1)Julia Kharchenko (1)

Papers (2)

Mar 4, 2026

BAIRMar 4, 2026

iAgentBench: Benchmarking Sensemaking Capabilities of Information-Seeking Agents on High-Traffic Topics

Existing QA benchmarks are too easy for LLMs, so iAgentBench offers a more realistic challenge by requiring agents to synthesize information from multiple sources on high-traffic topics.

Preetam Prabhu Srikar Dammu, A. Palkhiwala, Tanya Roosta +1

Eval Frameworks & Benchmarks Recommendation & Information Retrieval Tool Use & Agents

Aug 6, 2025

UWAug 6, 2025·also Amazon Science, BAIR, Stanford HAI

I Think, Therefore I Am Under-Qualified? A Benchmark for Evaluating Linguistic Shibboleth Detection in LLM Hiring Evaluations

LLMs evaluating job candidates exhibit significant bias against hedging language, docking candidates by 25.6% on average, even when the content is equivalent.

Julia Kharchenko, Tanya Roosta, Aman Chadha +1

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Natural Language Processing

Search

Tanya Roosta

Research focus

Frequent co-authors

Papers (2)