Search papers, labs, and topics across Lattice.
2
0
4
Despite recent advances, sign language translation models still struggle to leverage the full range of linguistic cues, especially non-manual signals like facial expressions.
Even when deprived of explicit hypernym examples during training, vision-language models can still generalize taxonomic relationships from images, revealing the surprising power of linguistic priors in cross-modal learning.