Greg Durrett

LLMs struggle to generate diverse and specific connections between concepts, even with high token budgets and "thinking" prompts, revealing a gap in creative associative reasoning.

Manya Wadhwa, Tiasa Singha Roy, Harvey Lederman +3

Eval Frameworks & Benchmarks Natural Language Processing Reasoning & Chain-of-Thought

Feb 18, 2026

Wenxuan Ding +4Feb 18, 2026·also UMass

Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents

LLMs can learn to make better decisions in complex environments by explicitly reasoning about the cost of exploration, leading to more efficient information gathering and problem-solving.

Wenxuan Ding, Wenxuan Ding, Nicholas Tomlin +2

Code Generation & Program Synthesis Reasoning & Chain-of-Thought Tool Use & Agents

Sep 26, 2025

Adaptive Margin RLHF via Preference over Preferences

Forget fixed margins in RLHF: modeling the *strength* of human preferences with "preference-over-preference" learning boosts both discriminative accuracy and generative quality.

Yaswanth Chittepu, Prasann Singhal, Greg Durrett +1

RLHF & Preference Learning

Search

Greg Durrett

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)