Search papers, labs, and topics across Lattice.
2
0
5
0
LLMs exhibit significant geographical performance disparities and task-specific gaps when evaluated on the new GaoYao benchmark, highlighting the need for more nuanced multilingual and multicultural training.
Forget human-annotated datasets: MathAgent synthesizes mathematical reasoning data so effectively that models trained on just 1K generated examples outperform those trained on existing datasets.