Search papers, labs, and topics across Lattice.
2
0
3
Micro-expressions that look identical can convey opposite emotions, and MEDN teases apart motion and emotion cues to spot the difference.
By explicitly disentangling language into global context, spatial relations, and object attributes, ProVG achieves state-of-the-art remote sensing visual grounding, suggesting that fine-grained linguistic cues are key to unlocking performance in this domain.