Search papers, labs, and topics across Lattice.
IBM Research
2
0
6
LLMs can slash memory use by 4x during reasoning without sacrificing accuracy, simply by "zooming in" on relevant cached information instead of attending to everything.
Agentic systems leak sensitive data in 80% of workflows, even when the final output seems safe, because current privacy evaluations miss intermediate steps.