Lattice
Lattice

Search

Search papers, labs, and topics across Lattice.

Tournament-GRPO: Group-Wise Tournament Rewards for Reinforcement Learning in Open-Ended Long-Form Generation | Lattice