Search papers, labs, and topics across Lattice.
McGill University
2
0
4
Even GPT-5.4 can't handle investment banking tasks, failing nearly half the criteria and producing zero client-ready outputs on a new benchmark designed with 500+ bankers.
LLMs can't reliably predict scientific experiment outcomes, and more worryingly, they have no idea when they're wrong, unlike human experts whose accuracy skyrockets when they feel confident.