Search papers, labs, and topics across Lattice.
3
0
9
0
Shuffle-tokenization lets you train a single next-app predictor that generalizes across app ecosystems, even when you don't know the app names.
Forget mixed-precision: tunable INT8 emulation can simultaneously boost accuracy and performance in FP64 HPC workloads on GPUs.
Existing multimodal models struggle with multi-image reasoning, but a new benchmark and inference-time attention fix exposes and alleviates these shortcomings.