Mar 2, 2026arXiv:2603.02047

NICO-RAG: Multimodal Hypergraph Retrieval-Augmented Generation for Understanding the Nicotine Public Health Crisis

Manuel Serna-Aguilera, Raegan Anderes, Page Dobbs, Khoa Luu

AI Summary

The paper introduces the Nicotine Innovation Counter-Offensive (NICO) dataset, a multimodal collection of over 200,000 samples related to tobacco and nicotine products, to aid public health researchers. To facilitate factual question answering over this dataset, they propose NICO-RAG, a retrieval-augmented generation framework that uses hypergraphs to represent entities and relations extracted from both images and text. Experiments demonstrate that NICO-RAG achieves comparable performance to state-of-the-art RAG methods adapted for images, without incurring the computational cost of processing image tokens.

Key Contribution

You can now retrieve images for RAG using semantic similarity of image descriptions, sidestepping the high cost of processing image tokens.

Abstract

The nicotine addiction public health crisis continues to be pervasive. In this century alone, the tobacco industry has released and marketed new products in an aggressive effort to lure new and young customers for life. Such innovations and product development, namely flavored nicotine or tobacco such as nicotine pouches, have undone years of anti-tobacco campaign work. Past work is limited both in scope and in its ability to connect large-scale data points. Thus, we introduce the Nicotine Innovation Counter-Offensive (NICO) Dataset to provide public health researchers with over 200,000 multimodal samples, including images and text descriptions, on 55 tobacco and nicotine product brands. In addition, to provide public health researchers with factual connections across a large-scale dataset, we propose NICO-RAG, a retrieval-augmented generation (RAG) framework that can retrieve image features without incurring the high-cost of language models, as well as the added cost of processing image tokens with large-scale datasets such as NICO. At construction time, NICO-RAG organizes image- and text-extracted entities and relations into hypergraphs to produce as factual responses as possible. This joint multimodal knowledge representation enables NICO-RAG to retrieve images for query answering not only by visual similarity but also by the semantic similarity of image descriptions. Experimentals show that without needing to process additional tokens from images for over 100 questions, NICO-RAG performs comparably to the state-of-the-art RAG method adapted for images.

Multimodal Models Natural Language Processing Recommendation & Information Retrieval

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

NICO-RAG: Multimodal Hypergraph Retrieval-Augmented Generation for Understanding the Nicotine Public Health Crisis

Related Papers