Search papers, labs, and topics across Lattice.
IBM Research
2
0
6
LLMs can slash memory use by 4x during reasoning without sacrificing accuracy, simply by "zooming in" on relevant cached information instead of attending to everything.
Steer LLMs like never before with AI Steerability 360, an open-source toolkit that unifies input, structural, state, and output steering methods under a common pipeline.