Amazon ScienceBAAIGlasgowMay 26, 2026arXiv:2605.26941

The 2nd EReL@MIR Workshop on Efficient Representation Learning for Multimodal Information Retrieval

Junchen Fu, Xuri Ge, Xin Xin, Alexandros Karatzoglou, Ioannis Arapakis, Xi Wang, Qijiong Liu, Joemon M. Jose

AI Summary

This paper proposes the EReL@MIR workshop to address the efficiency bottlenecks of large, pretrained multimodal foundation models in multimodal information retrieval (MIR) tasks. The workshop aims to foster discussion and collaboration on solutions for adapting these models for IR tasks during training, deployment, and inference. The goal is to overcome limitations hindering the practical use of foundation models for representation learning in information retrieval.

Key Contribution

Massive multimodal models like Qwen and CLIP excel at information retrieval, but their sheer size makes them impractical – this workshop tackles the efficiency gap.

Abstract

Multimodal representation learning has attracted increasing attention in AI, driven by the strong performance of large, pretrained multimodal foundation models such as Qwen, LLaVA, and CLIP. These models deliver impressive performance on a range of multimodal information retrieval (MIR) tasks, including web search, cross-modal retrieval, and recommender systems. Yet their massive parameter counts create major efficiency bottlenecks when adapting their representations for IR tasks during training, deployment, and inference. These limitations hinder the practical use of foundation models for representation learning in information retrieval. To address these issues, we propose organizing the EReL@MIR workshop at MM 2026, bringing together researchers from academia and industry to discuss emerging solutions, open challenges, and new efficiency metrics and benchmarks for multimodal IR representation learning in the foundation-model era. The workshop's official website is available at https://erel-mir.github.io/.

Multimodal Models Recommendation & Information Retrieval Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References25

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

The 2nd EReL@MIR Workshop on Efficient Representation Learning for Multimodal Information Retrieval

Related Papers