Search papers, labs, and topics across Lattice.
Chinese Academy of Sciences, University of Chinese Academy of Sciences
4
0
6
DocFormFlow boosts document formatting accuracy while slashing token consumption, revealing that target localization is the key to success.
PRISM enables T2I models to generate high-fidelity images from lengthy prompts, outperforming traditional methods by a notable margin.
Ditch discrete waypoints: VLA models can now generate smooth, physically plausible robot trajectories by directly regressing continuous action functions.
By learning to project actions onto a low-dimensional manifold, ABot-M0 achieves faster and more stable robotic control policies compared to directly predicting actions in the full high-dimensional space.