Michal Shmueli-Scheuer

IBM Research

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (2)Training Efficiency & Optimization (1)Tool Use & Agents (1)

Frequent co-authors

Elron Bandel (2)Asaf Yehudai (1)Yotam Perlitz (1)Leshem Choshen (1)

Papers (2)

Apr 14, 2026

2w ago·also AI2, HUJI, Technion

Growing Pains: Extensible and Efficient LLM Benchmarking Via Fixed Parameter Calibration

Stop re-running full benchmarks: Calibrate new LLM datasets against existing suites with just 100 "anchor" questions and still get highly accurate performance predictions.

Asaf Yehudai, Yotam Perlitz, Elron Bandel +2

Eval Frameworks & Benchmarks Training Efficiency & Optimization

Feb 26, 2026

Feb 26, 2026·also HUJI

General Agent Evaluation

General-purpose agents can match the performance of specialized agents across diverse environments without any environment-specific tuning, challenging the need for task-specific engineering.

Elron Bandel, Elron Bandel, Asaf Yehudai +23

Eval Frameworks & Benchmarks Tool Use & Agents

Search

Michal Shmueli-Scheuer

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)