Search papers, labs, and topics across Lattice.
University of California San Diego
1
0
2
PALUTE achieves 1,264 TPS at only 0.16 W, revolutionizing edge LLM inference with unprecedented energy efficiency.