Search papers, labs, and topics across Lattice.
Khulna University of Engineering and Technology
1
0
3
LLMs ace semantic similarity in medical QA, but VB-Score reveals they're failing to extract key medical entities, especially when answering questions about chronic conditions affecting older and minority populations.