Search papers, labs, and topics across Lattice.
1
0
3
By exploiting the low entropy of BF16 exponents with Huffman coding, LEXI slashes inter-chiplet communication latency in LLMs by up to 45% without sacrificing accuracy.