Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

S. Srivastava | Lattice

S. Srivastava

Papers on Lattice

1

Total citations

12

Topics

2

h-index

3

Research focus

Natural Language Processing (1)RLHF & Preference Learning (1)

Frequent co-authors

Vaneet Aggarwal (1)

Papers (1)

Jul 5, 2025

Jul 5, 2025

A Technical Survey of Reinforcement Learning Techniques for Large Language Models

Despite the dominance of RLHF for LLM alignment, outcome-based RL methods are proving surprisingly effective at improving stepwise reasoning.

S. Srivastava, Vaneet Aggarwal12

Natural Language Processing RLHF & Preference Learning