Search papers, labs, and topics across Lattice.
MoE Key Laboratory of Brain-inspired Intelligent Perception and Cognition, University of Science and Technology of China
1
0
1
2
Masking compositional concepts in one modality while leveraging contextual cues from another can dramatically enhance the compositionality of vision-language models.