Search papers, labs, and topics across Lattice.
1
0
2
N-GRPO achieves superior performance in mathematical reasoning by mixing embeddings, leading to diverse yet semantically consistent solution paths.