Search papers, labs, and topics across Lattice.
University of Copenhagen
2
0
2
Systematic gaps in AI evaluation reporting are exposed, revealing inconsistencies that hinder reliable comparisons across thousands of models and benchmarks.
ECI_{sem} ranks LLM negatives highest among non-hybrid sources, revealing that effective hard-negative selection can be achieved without fine-tuning.