Search papers, labs, and topics across Lattice.
2
0
6
YouZhi-LLM achieves unprecedented concurrency and accuracy in financial LLMs by dramatically reducing KV-cache overhead, setting a new standard for deployment efficiency.
A 1.7B parameter model can now rival much larger audio language models, thanks to a novel architecture and data synthesis pipeline.