Search papers, labs, and topics across Lattice.
This paper introduces $p^2$RAG, a privacy-preserving RAG service designed to support arbitrary top-$k$ retrieval without relying on secure sorting. The system employs an interactive bisection method to efficiently identify the top-$k$ documents while using secret sharing across two non-colluding servers to protect both the data owner's database and the user's prompt. Experimental results demonstrate that $p^2$RAG achieves significant speedups (3-300x) compared to existing privacy-preserving RAG systems, particularly for larger values of $k$.
Privacy-preserving RAG gets a massive speed boost (up to 300x) by ditching secure sorting for an interactive bisection method, finally making large-$k$ retrieval practical.
Retrieval-Augmented Generation (RAG) enables large language models to use external knowledge, but outsourcing the RAG service raises privacy concerns for both data owners and users. Privacy-preserving RAG systems address these concerns by performing secure top-$k$ retrieval, which typically is secure sorting to identify relevant documents. However, existing systems face challenges supporting arbitrary $k$ due to their inability to change $k$, new security issues, or efficiency degradation with large $k$. This is a significant limitation because modern long-context models generally achieve higher accuracy with larger retrieval sets. We propose $p^2$RAG, a privacy-preserving RAG service that supports arbitrary top-$k$ retrieval. Unlike existing systems, $p^2$RAG avoids sorting candidate documents. Instead, it uses an interactive bisection method to determine the set of top-$k$ documents. For security, $p^2$RAG uses secret sharing on two semi-honest non-colluding servers to protect the data owner's database and the user's prompt. It enforces restrictions and verification to defend against malicious users and tightly bound the information leakage of the database. The experiments show that $p^2$RAG is 3--300$\times$ faster than the state-of-the-art PRAG for $k = 16$--$1024$.