Boyi Liu

Papers on Lattice

Total citations

Topics

h-index

Research focus

Reasoning & Chain-of-Thought (2)RLHF & Preference Learning (2)Eval Frameworks & Benchmarks (1)Tool Use & Agents (1)

Frequent co-authors

Pranjal Aggarwal (1)Marjan Ghazvininejad (1)Seungone Kim (1)Ilia Kulikov (1)

Papers (2)

Mar 19, 2026

Meta AIMar 19, 2026·also CMU ML, CAS, UESTC, UNC +1

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

On-policy reward modeling with LLM judges not only unlocks significant performance gains on complex mathematical reasoning tasks, but also generalizes to improve performance on simpler numerical and multiple-choice benchmarks.

Pranjal Aggarwal, Marjan Ghazvininejad, Seungone Kim +20

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought RLHF & Preference Learning

Xiaoyin Chen +8Mar 19, 2026·also Snowflake AI Research

Learning to Self-Evolve

Forget prompt engineering – LSE trains LLMs to self-edit their own contexts at test time, outperforming even GPT-5 and Claude Sonnet 4.5 in Text-to-SQL and question answering.

Xiaoyin Chen, Xiaoyin Chen, Canwen Xu +6

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Search

Boyi Liu

Research focus

Frequent co-authors

Papers (2)