Search papers, labs, and topics across Lattice.
2
0
6
2
Multimodal models can now achieve state-of-the-art performance in real-world tasks like document understanding and audio-video comprehension with significantly reduced inference latency thanks to novel token-reduction techniques.
LLMs, like humans, exhibit a "frequency bias," performing better when prompted and fine-tuned with more common textual expressions.