Search papers, labs, and topics across Lattice.
University of Texas at Austin
2
0
3
0
FTRL learners are inherently exploitable in two-player games, regardless of equilibrium structure, revealing a fundamental weakness in this widely used optimization strategy.
LLMs can now directly predict scalar values with a tiny, efficient head (3.4M parameters) that outperforms prior regression methods.