Search papers, labs, and topics across Lattice.
2
0
6
2
LLMs can get up to 6x more logically consistent without human feedback, simply by fusing NLI scores into the DPO training loop.
LALMs still struggle to get the joke, with a new benchmark showing they can't reliably recognize, locate, or understand audio puns.