Sayash Kapoor

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (3)Red-Teaming & Adversarial Robustness (2)Constitutional AI & AI Ethics (1)Tool Use & Agents (1)Open-Source Models & Weights (1)

Frequent co-authors

Sayash Kapoor (2)Arvind Narayanan (2)Y. Bengio (1)Yoshua Bengio (1)

Papers (3)

Feb 24, 2026

MilaFeb 24, 2026·also BAIR, CMU ML, Meta AI, Tsinghua AI +13

International AI Safety Report 2026

A global consensus on AI safety risks and capabilities has emerged from a panel of 100+ independent experts, representing a landmark effort in international collaboration.

Y. Bengio, Yoshua Bengio, Stephen Clare +1656

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Feb 18, 2026

Towards a Science of AI Agent Reliability

Despite progress in AI agent capabilities, reliability across crucial dimensions like consistency and robustness remains stubbornly low, revealing a critical gap in current evaluation practices.

Stephan Rabanser, Stephan Rabanser, Sayash Kapoor +9

Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness Tool Use & Agents

Apr 29, 2025

Apr 29, 2025·also Mila

The Leaderboard Illusion

Chatbot Arena, the go-to LLM leaderboard, is systematically gamed by undisclosed private testing and data access advantages, leading to biased rankings and overfitting.

Shivalika Singh, Yiyang Nan, Alex Wang +1034

Eval Frameworks & Benchmarks Open-Source Models & Weights

Search

Sayash Kapoor

Research focus

Frequent co-authors

Papers (3)