Search papers, labs, and topics across Lattice.
Harbin Institute of Technology
2
0
4
RL agents can learn far more efficiently by incorporating group-level natural language feedback, achieving 2.2x sample efficiency gains in sparse-reward environments.
LLM personalities can be steered with fine-tuning-level precision, compositionality, and context-awareness, all without training, by directly manipulating activation vectors in representation space.