Search papers, labs, and topics across Lattice.
The paper introduces FineMuSe, a new multimodal dataset in Spanish for fine-grained sexism detection in social media videos, annotated with a hierarchical taxonomy encompassing sexism types, non-sexism, and rhetorical devices. They evaluated a range of LLMs on both binary and fine-grained sexism detection tasks using this dataset. Results show that multimodal LLMs perform competitively with humans in identifying nuanced sexism but struggle with co-occurring sexist types conveyed visually.
Multimodal LLMs can spot subtle sexism as well as humans, but still miss the visual cues when multiple types of sexism occur together.
Online sexism appears in various forms, which makes its detection challenging. Although automated tools can enhance the identification of sexist content, they are often restricted to binary classification. Consequently, more subtle manifestations of sexism may remain undetected due to the lack of fine-grained, context-sensitive labels. To address this issue, we make the following contributions: (1) we present FineMuSe, a new multimodal sexism detection dataset in Spanish that includes both binary and fine-grained annotations; (2) we introduce a comprehensive hierarchical taxonomy that encompasses forms of sexism, non-sexism, and rhetorical devices of irony and humor; and (3) we evaluate a wide range of LLMs for both binary and fine-grained sexism detection. Our findings indicate that multimodal LLMs perform competitively with human annotators in identifying nuanced forms of sexism; however, they struggle to capture co-occurring sexist types when these are conveyed through visual cues.