Search papers, labs, and topics across Lattice.
4
1
6
11
Don't let valuable steps in failed trajectories go unnoticed: GraphGPO leverages state-transition graphs for fine-grained credit assignment in agentic RL, boosting performance and efficiency.
Generate minute-long, high-fidelity animations without visual degradation or character drift using a surprisingly simple latent flow restoration technique.
Visuomotor control can now generalize to unseen environments and instructions by grounding world models in a vision-language latent space, outperforming standard vision-language approaches by a large margin.
Context inconsistency in stepwise group-based RL can severely bias advantage estimation, but a hierarchical grouping strategy can fix it without extra compute.