Search papers, labs, and topics across Lattice.
University of Southern California
2
0
5
A novel differentially private RAG algorithm, DP-KSA, achieves a strong privacy-utility tradeoff by extracting and augmenting prompts with differentially private keywords derived from an ensemble of LLM responses.
Decomposing linear layers in MPC inference can actually speed things up, but only if you carefully manage the extra communication rounds and truncation steps.