Annay Xie

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (1)Natural Language Processing (1)Reasoning & Chain-of-Thought (1)

Frequent co-authors

Shaojie Shi (1)Zhengyu Shi (1)Lin Zheng (1)Lingran Zheng (1)

Papers (1)

Mar 16, 2026

Mar 16, 2026·also USTC

InterveneBench: Benchmarking LLMs for Intervention Reasoning and Causal Study Design in Real Social Systems

LLMs still fall short when it comes to reasoning about real-world policy interventions and causal study design, as revealed by the new InterveneBench benchmark.

Shaojie Shi, Zhengyu Shi, Lin Zheng +17

Eval Frameworks & Benchmarks Natural Language Processing Reasoning & Chain-of-Thought

Search

Annay Xie

Research focus

Frequent co-authors

Papers (1)