Search papers, labs, and topics across Lattice.
3
0
7
5
Long-form video generation struggles with transitions, scoring only 0.256 on transition quality even when prompt fulfillment is high (0.71), revealing a critical bottleneck exposed by the new DirectorBench diagnostic benchmark.
Saturated LLM benchmarks can be revived without creating new datasets: a self-improving LLM judge in an elimination tournament recovers ranking signal and breaks ties.
Ditch the deep thought: this new agentic search framework slashes reasoning steps by 70% while boosting accuracy by prioritizing parallel evidence gathering.