Search papers, labs, and topics across Lattice.
MIRAI
2
0
5
Forget meticulously annotating subtasks – SuperIgor lets language models self-learn to generate and refine instruction-following plans through RL feedback.
A single transformer can master StarCraft, Football, and POGEMA, suggesting we can unify MARL under one foundation model.