Search papers, labs, and topics across Lattice.
1
0
2
ClinEnv reveals that LLMs struggle significantly with management decisions in clinical scenarios, achieving only 0.17 F1 for these critical actions despite better performance in diagnosis.