Search papers, labs, and topics across Lattice.
LIVIA, Dept. of Software and IT Engineering, ETS Montreal, Canada
2
0
3
Despite advances in multimodal deep learning, recognizing subtle emotional states like ambivalence and hesitancy from video remains a significant challenge, even for state-of-the-art models.
Forget noisy LLM-generated prompts: this method uses interpretable Action Units to guide CLIP for personalized, fine-grained video emotion recognition, achieving state-of-the-art results.