Search papers, labs, and topics across Lattice.
Humboldt-Universit盲t zu Berlin
1
0
3
6
Forget scaling up data volume: repeating a smaller, high-quality German dataset yields superior language models compared to single-pass training on a larger, less filtered corpus.