Search papers, labs, and topics across Lattice.
STAMP is introduced as a task-aware text privatization framework that selectively allocates privacy budgets to tokens based on their importance to the downstream task and privacy sensitivity. It employs a polar mechanism that perturbs only the direction of token embeddings on the unit sphere, preserving their magnitude, to privatize individual token embeddings. Experiments on SQuAD, Yelp, and AG News datasets show that STAMP achieves superior privacy-utility trade-offs compared to isotropic noise mechanisms.
By perturbing only the direction of token embeddings, STAMP's polar mechanism maintains semantic relationships and achieves a better privacy-utility trade-off than adding isotropic noise.
We present STAMP (Selective Task-Aware Mechanism for Text Privacy), a new framework for task-aware text privatization that achieves an improved privacy-utility trade-off. STAMP selectively allocates privacy budgets across tokens by jointly considering (i) each token's importance to the downstream task (as measured via a task- or query-specific representation), and (ii) its privacy sensitivity (e.g., names, dates, identifiers). This token-level partitioning enables fine-grained, group-wise control over the level of noise applied to different parts of the input, balancing privacy protection with task relevance. To privatize individual token embeddings, we introduce the polar mechanism, which perturbs only the direction of embeddings on the unit sphere while preserving their magnitude. Decoding is performed via cosine nearest-neighbor search, aligning the perturbation geometry with the decoding geometry. Unlike isotropic noise mechanisms, the polar mechanism maintains semantic neighborhoods in the embedding space and better preserves downstream utility. Experimental evaluations on SQuAD, Yelp, and AG News datasets demonstrate that STAMP, when combined with the normalized polar mechanism, consistently achieves superior privacy-utility trade-offs across varying per-token privacy budgets.