A. Geramifard

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)Inference & Quantization (1)Training Efficiency & Optimization (1)

Frequent co-authors

Yuanda Xu (2)Hejian Sang (2)Ran He (2)Alborz Geramifard (2)

Papers (2)

Jul 6, 2026

Yuanda Xu +132w ago·also LinkedIn Corporation

TREK: Distill to Explore, Reinforce to Refine

TREK transforms the way models tackle challenging prompts by expanding their exploration support, leading to substantial performance gains even in the hardest task scenarios.

Yuanda Xu, Zhengze Zhou, Kayhan Behdin +11

Reasoning & Chain-of-Thought RLHF & Preference Learning

Apr 15, 2026

Yuanda Xu +4Apr 15, 2026·also LinkedIn Corporation

TIP: Token Importance in On-Policy Distillation

Overconfident tokens, often missed by entropy-based methods, carry surprisingly dense corrective signals in on-policy distillation, allowing for near-baseline performance with <10% of tokens.

Yuanda Xu, Hejian Sang, Ran He +2

Inference & Quantization Training Efficiency & Optimization

Search

A. Geramifard

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)