Search papers, labs, and topics across Lattice.
The University of Texas at Austin
2
0
3
GENIE reveals that traditional metrics fail to capture the nuanced dimensions of novelty, offering a sharper lens for evaluating LLM creativity.
LLMs struggle with procedural rule induction, revealing a significant performance gap that challenges current AI capabilities.