Search papers, labs, and topics across Lattice.
The authors thank Prof. Cong Wang of the City University of Hong Kong for his valuable suggestions on revising this paper
1
0
2
0
Achieve near-plaintext LLM inference speeds with strong privacy guarantees and minimal accuracy loss by jointly obfuscating data and model parameters – a first for models at the 671B scale.