Mehryar Mohri

Google Research & CIMS New York

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Natural Language Processing (5)Training Efficiency & Optimization (4)Architecture Design (Transformers, SSMs, MoE) (2)Computer Vision (1)

Frequent co-authors

Yutao Zhong (3)M. Mohri (2)Yutao Zhong (1)Corinna Cortes (1)

Papers (6)

May 27, 2026

Google ResearchMay 27, 2026

Principled Algorithms for Optimizing Generalized Metrics in Multi-Label Learning

Achieving provable, non-asymptotic guarantees for optimizing complex multi-label metrics like F-measure is now possible with a new family of algorithms that decompose exactly for $O(l)$ time complexity.

Mehryar Mohri, Yutao Zhong

Natural Language Processing Training Efficiency & Optimization

Apr 30, 2026

Corinna Cortes +4Apr 30, 2026·also Google Research

Optimized Deferral for Imbalanced Settings

Expert imbalance can cripple learning-to-defer systems, but a novel cost-sensitive margin-based loss function can restore performance.

Corinna Cortes, Anqi Mao, Mehryar Mohri +2

Computer Vision Natural Language Processing Training Efficiency & Optimization

Google ResearchApr 30, 2026

Mind the Gap: Structure-Aware Consistency in Preference Learning

Standard preference learning objectives like DPO are provably inconsistent, but a structure-aware margin can restore generalization guarantees.

Mehryar Mohri, Yutao Zhong

RLHF & Preference Learning Scalable Oversight & Alignment Theory

M. Mohri +2Apr 30, 2026·also Google Research

Linear-Core Surrogates: Smooth Loss Functions with Linear Rates for Classification and Structured Prediction

Get the best of both worlds: Linear-Core Surrogates offer the fast optimization of smooth losses and the statistical efficiency of margin-based losses, without sacrificing differentiability.

M. Mohri, Mehryar Mohri, Yutao Zhong

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Training Efficiency & Optimization

Mar 30, 2026

Google ResearchMar 30, 2026·also Courant Institute of Mathematical, Harvard, NYU, School of Engineering and Applied

Next-Token Prediction and Regret Minimization

Bounded context windows in next-token prediction models can be fundamentally incompatible with low adversarial regret, even with long context lengths.

Mehryar Mohri, Clayton Sanford, Jon Schneider +1

Natural Language Processing Red-Teaming & Adversarial Robustness

Feb 19, 2026

Google ResearchFeb 19, 2026

A Theoretical Framework for Modular Learning of Robust Generative Models

Modular generative models can theoretically and empirically outperform monolithic models, offering a robust alternative to resource-intensive retraining on aggregate data.

Mehryar Mohri

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Training Efficiency & Optimization

Search

Mehryar Mohri

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (6)