Search papers, labs, and topics across Lattice.
2
0
4
2
Sapiens2's leap in human-centric vision proves that scaling up high-resolution transformers with a unified pretraining objective unlocks unprecedented fidelity and generalization.
Training 3D avatar diffusion models on millions of in-the-wild videos is now possible, thanks to a clever 3D tokenization and visibility-aware training strategy that overcomes partial observability.