Search papers, labs, and topics across Lattice.
Sogang University
1
0
3
Multimodal LLMs aren't just for generation: they can dramatically improve audio-text retrieval robustness, especially when handling complex, real-world queries and acoustically similar distractors.