Search papers, labs, and topics across Lattice.
2
0
5
2
Multimodal trackers can achieve state-of-the-art results without ballooning parameter counts by adaptively aligning cross-modal attention maps and using hierarchical mixture of experts for efficient global reasoning.
Ditch the byte-level baggage: HiVG's hierarchical tokenization for SVGs slashes token redundancy and coordinate hallucinations, paving the way for more efficient and geometrically sound vector graphics generation.