Marjan Ghazvininejad

Research focus

Eval Frameworks & Benchmarks (1)Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)Architecture Design (Transformers, SSMs, MoE) (1)Computer Vision (1)

Frequent co-authors

Pranjal Aggarwal (1)Seungone Kim (1)Ilia Kulikov (1)Jack Lanchantin (1)

Papers (2)

Mar 19, 2026

Meta AIMar 19, 2026·also CMU ML, CAS, UNC

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

On-policy reward modeling with LLM judges not only unlocks significant performance gains on complex mathematical reasoning tasks, but also generalizes to improve performance on simpler numerical and multiple-choice benchmarks.

Pranjal Aggarwal, Marjan Ghazvininejad, Seungone Kim +20

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought RLHF & Preference Learning

Mar 3, 2026

Meta AIMar 3, 2026·also NYU

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Vision models are far more data-hungry than language models, but Mixture-of-Experts can harmonize this asymmetry for truly unified multimodal models.

Shengbang Tong, David Fan, John Nguyen +18

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Search

Marjan Ghazvininejad

Research focus

Frequent co-authors

Papers (2)