Search papers, labs, and topics across Lattice.
3
0
7
Watermarking agent memories is now possible without performance degradation or reliance on logs, enabling snapshot-only attribution even after memory migration or leakage.
LLMs can now be rigorously benchmarked on realistic sales skills, revealing a wide performance gap where some models rival humans while others fall short.
Forget brittle, off-distribution steering: ROAST leverages on-distribution rollouts and normalization to achieve significant gains (+9.7% on GSM8K, +12.1% on TruthfulQA) by carefully balancing activation contributions.