Search papers, labs, and topics across Lattice.
Thoughtworks, Martian
2
0
2
STRIDE reveals that training data influences can be efficiently traced in LLMs using sparse recovery, achieving attribution 13 times faster than traditional methods.
LLM activation spaces aren't linear, and exploiting their true geometry with "Curveball steering" unlocks more effective control than standard linear interventions.