Search papers, labs, and topics across Lattice.
Tencent
2
0
5
Achieving up to 4.395x speedup in RL training for LLMs by smartly reusing shared prefixes could revolutionize how we approach large-scale model training.
Current phone-use agents are often *too* helpful, routinely violating user privacy by filling in unnecessary personal information even when a task doesn't require it.