Search papers, labs, and topics across Lattice.
5
0
8
UniSVQ achieves state-of-the-art performance in 2-bit quantization, outperforming traditional methods while enhancing inference speed.
MemoryCard transforms long videos into coherent, topic-focused segments, boosting long-video QA accuracy by over 21% while maintaining visual-token efficiency.
Crafter revolutionizes scientific figure generation by enabling multi-type outputs and local editability, outperforming existing systems across diverse benchmarks.
Multi-source visual reasoning can actually *hurt* performance when modalities conflict, but MARS solves this by adaptively emphasizing mutual promotion and suppressing noise, leading to significant gains.
Forget separate structure and fidelity models – Khala shows you can generate high-quality music with text-vocal alignment using a single acoustic-token hierarchy.