Search papers, labs, and topics across Lattice.
IMDEA Software Institute
1
0
2
0
Secure multi-tenant LLM serving without sacrificing performance is now possible: CacheSolidarity selectively isolates prefixes, boosting cache reuse by up to 70% and cutting inference latency by 30% compared to blunt-force defenses.