Search papers, labs, and topics across Lattice.
1
0
3
Achieve state-of-the-art emotion recognition by fusing visual and audio cues with a bi-directional cross-attention mechanism, outperforming unimodal approaches.