Search papers, labs, and topics across Lattice.
1
0
3
4
Even state-of-the-art LLMs like GPT-3.5 struggle to maintain consistently high performance across key trustworthiness dimensions when applied to mental health scenarios.