Saadia Gabriel

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Data Curation & Synthetic Data (2)Reasoning & Chain-of-Thought (2)RLHF & Preference Learning (2)Tool Use & Agents (1)

Frequent co-authors

Hritik Bansal (2)Ashima Suvarna (2)Negin Raoof (1)Richard Zhuang (1)

Papers (3)

Jun 23, 2026

BAIR2w ago·also Stanford HAI, UW, Cornell, Harvard +9

OpenThoughts-Agent: Data Recipes for Agentic Models

Training data diversity is the secret sauce that boosts agentic model performance, with OpenThoughts-Agent achieving a notable accuracy leap over existing benchmarks.

Negin Raoof, Richard Zhuang, Etash Guha +43

Data Curation & Synthetic Data Tool Use & Agents

Apr 20, 2026

Google ResearchApr 20, 2026

When Can LLMs Learn to Reason with Weak Supervision?

Generalization in LLMs hinges on training reward saturation dynamics, with reasoning faithfulness emerging as a critical predictor of success under weak supervision.

Salman Rahman, Jingyan Shen, Anna Mordvina +2

Data Curation & Synthetic Data Reasoning & Chain-of-Thought RLHF & Preference Learning

Apr 9, 2026

Ashima Suvarna +9Apr 9, 2026

SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions

Forget brute-force scaling: targeted data curation for RLVR can unlock surprisingly large gains in LLM reasoning.

Ashima Suvarna, Ashima Suvarna, Kendrick Phan +7

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought RLHF & Preference Learning

Search

Saadia Gabriel

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)