March 4 – March 11, 2026

Recommendation & Information Retrieval - Weekly Roundup

100 papers published across 12 labs.

6% acceleration

Selected Labs publishing this week

Microsoft Research3 Tsinghua AI2 AI21 UW1 Mila1

Top Papers

Mar 11, 2026

3w ago·also Microsoft Research

A Systematic Study of Pseudo-Relevance Feedback with LLMs

LLM-generated text alone can be a surprisingly effective and cost-efficient source of feedback for pseudo-relevance feedback, rivaling corpus-derived feedback in low-resource information retrieval tasks.

Nour Jedidi, Jimmy Lin

Natural Language Processing Recommendation & Information Retrieval

3w ago

Does Reasoning Make Search More Fair? Comparing Fairness in Reasoning and Non-Reasoning Rerankers

Reasoning rerankers don't magically fix fairness issues in search, preserving the biases of their input rankings despite boosting relevance.

Saron Samuel, Benjamin Van Durme, Eugene Yang

Constitutional AI & AI Ethics Reasoning & Chain-of-Thought Recommendation & Information Retrieval

AI23w ago·also UW

Meta-Reinforcement Learning with Self-Reflection for Agentic Search

Agentic search gets a meta-RL boost: MR-Search learns to self-reflect and adapt search strategies across episodes, significantly outperforming standard RL baselines.

Teng Xiao, Yige Yuan, Hamish Ivison +6

Recommendation & Information Retrieval Tool Use & Agents World Models & Planning

Massimiliano Altieri +43w ago

DNS-GT: A Graph-based Transformer Approach to Learn Embeddings of Domain Names from DNS Queries

By modeling contextual relationships between DNS queries, DNS-GT significantly improves domain name embedding quality, leading to better performance in botnet detection and domain classification.

Massimiliano Altieri, Ronan Hamon, Roberto Corizzo +2

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Recommendation & Information Retrieval

3w ago

Differentiable Geometric Indexing for End-to-End Generative Retrieval

By combining differentiable indexing with isotropic geometric optimization, DGI achieves state-of-the-art generative retrieval, especially for long-tail items that are often missed by other methods.

Architecture Design (Transformers, SSMs, MoE)Recommendation & Information Retrieval Training Efficiency & Optimization

All Papers (100)

Mar 11, 2026

3w ago·also Microsoft Research

A Systematic Study of Pseudo-Relevance Feedback with LLMs

Nour Jedidi, Jimmy Lin

Natural Language Processing Recommendation & Information Retrieval

3w ago

Does Reasoning Make Search More Fair? Comparing Fairness in Reasoning and Non-Reasoning Rerankers

Reasoning rerankers don't magically fix fairness issues in search, preserving the biases of their input rankings despite boosting relevance.

Saron Samuel, Benjamin Van Durme, Eugene Yang

Constitutional AI & AI Ethics Reasoning & Chain-of-Thought Recommendation & Information Retrieval

AI23w ago·also UW

Meta-Reinforcement Learning with Self-Reflection for Agentic Search

Agentic search gets a meta-RL boost: MR-Search learns to self-reflect and adapt search strategies across episodes, significantly outperforming standard RL baselines.

Teng Xiao, Yige Yuan, Hamish Ivison +6

Recommendation & Information Retrieval Tool Use & Agents World Models & Planning

Massimiliano Altieri +43w ago

DNS-GT: A Graph-based Transformer Approach to Learn Embeddings of Domain Names from DNS Queries

By modeling contextual relationships between DNS queries, DNS-GT significantly improves domain name embedding quality, leading to better performance in botnet detection and domain classification.

Massimiliano Altieri, Ronan Hamon, Roberto Corizzo +2

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Recommendation & Information Retrieval

3w ago

Differentiable Geometric Indexing for End-to-End Generative Retrieval

By combining differentiable indexing with isotropic geometric optimization, DGI achieves state-of-the-art generative retrieval, especially for long-tail items that are often missed by other methods.

Architecture Design (Transformers, SSMs, MoE)Recommendation & Information Retrieval Training Efficiency & Optimization

Hao-Nguyen Nguyen +33w ago

LLMGreenRec: LLM-Based Multi-Agent Recommender System for Sustainable E-Commerce

LLMGreenRec shows how LLMs can bridge the gap between user's green intentions and actual purchases, while simultaneously reducing the recommender system's carbon footprint.

Hao-Nguyen Nguyen, Hieu M. Nguyen, Son Van Nguyen +1

Natural Language Processing Recommendation & Information Retrieval Tool Use & Agents

3w ago

MDER-DR: Multi-Hop Question Answering with Entity-Centric Summaries

Forget brittle KG traversals: MDER-DR's entity-centric summaries and decomposed queries boost multi-hop QA accuracy by up to 66% over standard RAG.

Riccardo Campi, Nicolò Oreste Pinciroli Vago, Mathyas Giudici +2

Natural Language Processing Reasoning & Chain-of-Thought Recommendation & Information Retrieval

Yunkai Lou +43w ago

A Hypergraph-Based Framework for Exploratory Business Intelligence

Hypergraphs and sampling can speed up exploratory business intelligence queries by over 16x compared to Neo4j, while maintaining high accuracy.

Yunkai Lou, Shunyang Li, Longbin Lai +2

Natural Language Processing Recommendation & Information Retrieval

Jennifer D'Souza +73w ago

An Extreme Multi-label Text Classification (XMTC) Library Dataset: What if we took"Use of Practical AI in Digital Libraries"seriously?

A massive, bilingual, authority-grounded dataset could finally make AI-assisted cataloging a reality.

Jennifer D'Souza, Sameer Sadruddin, Maximilian Kahler +5

Data Curation & Synthetic Data Natural Language Processing Recommendation & Information Retrieval

F. Shoaei +33w ago

LROO Rug Pull Detector: A Leakage-Resistant Framework Based on On-Chain and OSINT Signals

Spot rug-pulls before they happen: a new framework combines blockchain data with social media buzz to predict crypto scams with improved accuracy.

F. Shoaei, Mohammad Pishdar, M. Bag-Mohammadi +1

Natural Language Processing Recommendation & Information Retrieval

Australian Museum3w ago·also Australian Museum Research Institute, UTS

Conversational AI-Enhanced Exploration System to Query Large-Scale Digitised Collections of Natural History Museums

Unlock millions of natural history specimens with a conversational AI that understands complex queries and dynamically retrieves data from live museum APIs.

Yiyuan Wang, Andrew R. Johnston, Zoë Sadokierski +2

Natural Language Processing Recommendation & Information Retrieval Tool Use & Agents

3w ago·also Institute of Artificial intelligence

Modeling Stage-wise Evolution of User Interests for News Recommendation

News recommendations get a boost by modeling user interests as a stage-wise evolution, capturing both long-term preferences and rapidly shifting short-term interests.

Zhiyong Cheng, Yike Jin, Zhijie Zhang +2

Natural Language Processing Recommendation & Information Retrieval

Mila3w ago

LLM2Vec-Gen: Generative Embeddings from Large Language Models

Forget contrastive learning: LLM2Vec-Gen learns text embeddings by representing the *response* an LLM would generate, unlocking safety and reasoning abilities for embedding tasks.

Parishad BehnamGhader, Vaibhav Adlakha, Fabian David Schmidt +3

Natural Language Processing Recommendation & Information Retrieval Training Efficiency & Optimization

3w ago·also IBM Research

RAGPerf: An End-to-End Benchmarking Framework for Retrieval-Augmented Generation Systems

Pinpointing performance bottlenecks in RAG pipelines just got easier: RAGPerf offers a modular benchmarking framework to dissect and optimize each component.

Shaobo Li, Y. Zhou, Yuan Xu +5

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

A. Volpini +33w ago

Structured Linked Data as a Memory Layer for Agent-Orchestrated Retrieval

Ditching flat text for structured linked data in RAG systems can boost accuracy by nearly 30%, but only if you go beyond basic JSON-LD and add agent-friendly instructions and neural search.

A. Volpini, Elie Raad, B. Gamba +1

Natural Language Processing Recommendation & Information Retrieval Tool Use & Agents

Linkedin Inc3w ago

Beyond Interleaving: Causal Attention Reformulations for Generative Recommender Systems

Ditch the interleaved item-action token mess: new architectures slash sequence complexity by 50% in generative recommenders, boosting performance and cutting training time.

Hailing Cheng

Architecture Design (Transformers, SSMs, MoE)Recommendation & Information Retrieval

3w ago

Breaking User-Centric Agency: A Tri-Party Framework for Agent-Based Recommendation

Item agents that self-promote can simultaneously boost recommendation accuracy and fairness, overturning the assumption that these goals are inherently at odds.

Yaxin Gong, Chongming Gao, Chenxiao Fan +3

Natural Language Processing Recommendation & Information Retrieval Tool Use & Agents

S.-L. Ng +23w ago

A systematic review of secure coded caching

Secure coded caching, crucial for modern content delivery, often treats security as an afterthought, resulting in fragmented solutions that this review seeks to unify and improve.

S.-L. Ng, M. Paterson, E. Quaglia

Distributed Systems & Hardware Recommendation & Information Retrieval

Mar 10, 2026

3w ago

TA-Mem: Tool-Augmented Autonomous Memory Retrieval for LLM in Long-Term Conversational QA

LLMs can now autonomously retrieve relevant memories from a database using specialized tools, significantly improving performance on long-term conversational question answering.

Mengwei Yuan, Jianan Liu, Jing Yang +4

Reasoning & Chain-of-Thought Recommendation & Information Retrieval Tool Use & Agents

R. Mahdavi +23w ago

ZipPIR: High-throughput Single-server PIR without Client-side Storage

ZipPIR delivers SimplePIR-level throughput without the massive client-side storage, finally making high-performance private information retrieval practical for resource-constrained devices.

R. Mahdavi, Abdulrahman Diaa, Florian Kerschbaum

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Recommendation & Information Retrieval

3w ago·also KAIST

Dynamic Multi-period Experts for Online Time Series Forecasting

Stop treating concept drift as one thing: DynaME's hybrid approach, separating recurring and emergent drifts, unlocks better online time series forecasting.

Seungha Hong, Sukang Chae, SuYeon Kim +1

Natural Language Processing Recommendation & Information Retrieval

Seydina Ousmane Diallo +23w ago

Enabling Multi-Client Authorization in Dynamic SSE

Achieve fine-grained access control in searchable encryption without re-encryption or excessive interaction, enabling practical multi-client deployments in dynamic clouds.

Seydina Ousmane Diallo, Maryline Laurent, Nesrine Kaaniche

Distributed Systems & Hardware Recommendation & Information Retrieval

Soroush Seifi +33w ago

Ego: Embedding-Guided Personalization of Vision-Language Models

Forget retraining: Ego personalizes VLMs on the fly by extracting and leveraging visual tokens that represent specific concepts using the model's internal attention.

Soroush Seifi, Simon Gardier, Vaggelis Dorovatas +1

Multimodal Models Recommendation & Information Retrieval Tool Use & Agents

National Centre for Scientific Research3w ago

AI Act Evaluation Benchmark: An Open, Transparent, and Reproducible Evaluation Dataset for NLP and RAG Systems

Now you can test if your AI system is ready for the EU AI Act, thanks to a new benchmark that combines legal expertise and LLM-generated scenarios.

Athanasios Davvetas, Michael Papademas, Xenia Ziouvelou +1

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

3w ago

From Verification to Amplification: Auditing Reverse Image Search as Algorithmic Gatekeeping in Visual Misinformation Fact-checking

Reverse image search, a key tool for visual fact-checking, often amplifies misinformation and irrelevant content, burying debunking information.

Cong Lin, Yifei Chen, Jiangyue Chen +3

Computer Vision Multimodal Models Recommendation & Information Retrieval

Arash Shahmansoori3w ago

PRECEPT: Planning Resilience via Experience, Context Engineering & Probing Trajectories A Unified Framework for Test-Time Adaptation with Compositional Rule Learning and Pareto-Guided Prompt Evolution

LLM agents can now achieve a +41pp boost in first-try success and 100% accuracy in 2-way logistics compositions by using PRECEPT's novel combination of retrieval, memory, and prompt evolution.

Arash Shahmansoori

Reasoning & Chain-of-Thought Recommendation & Information Retrieval Tool Use & Agents

Jožef Stefan Institute3w ago·also S.Cyril and Methodius University

Fusing Semantic, Lexical, and Domain Perspectives for Recipe Similarity Estimation

Forget relying on just ingredients: this method shows how fusing semantic, lexical, and nutritional aspects significantly improves recipe similarity estimation, aligning more closely with expert judgment.

Denica Kjorvezir, Danilo Najkov, Eva Valencič +4

Natural Language Processing Recommendation & Information Retrieval

Jiashuo Sun +33w ago

TaSR-RAG: Taxonomy-guided Structured Reasoning for Retrieval-Augmented Generation

Forget brittle multi-hop reasoning: TaSR-RAG's taxonomy-guided triple matching boosts RAG performance by 14% without costly graph construction.

Jiashuo Sun, Yixuan Xie, Jimeng Shi +1

Natural Language Processing Reasoning & Chain-of-Thought Recommendation & Information Retrieval

Jožef Stefan Institute3w ago

Beyond Fine-Tuning: Robust Food Entity Linking under Ontology Drift with FoodOntoRAG

Forget expensive fine-tuning: FoodOntoRAG links food entities with near SOTA accuracy while adapting to evolving ontologies using a clever RAG architecture with retrieval, selection, scoring, and synonym generation agents.

Jan Drole, Ana Gjorgjevikj, Barbara Korouši'c Seljak +1

Data Curation & Synthetic Data Natural Language Processing Recommendation & Information Retrieval

3w ago

RecThinker: An Agentic Framework for Tool-Augmented Reasoning in Recommendation

LLM-powered recommendation agents can now autonomously investigate and bridge information gaps, leading to better recommendations, thanks to a new tool-augmented reasoning framework.

Haobo Zhang, Yutao Zhu, Kelong Mao +2

Reasoning & Chain-of-Thought Recommendation & Information Retrieval Tool Use & Agents

Nicolás Della Penna3w ago

What Do We Care About in Bandits with Noncompliance? BRACE: Bandits with Recommendations, Abstention, and Certified Effects

Recommendation welfare can provably exceed any learner-measurable treatment policy when downstream actors possess private information, forcing a critical re-evaluation of learning objectives in bandit settings with noncompliance.

Nicolás Della Penna

Recommendation & Information Retrieval RLHF & Preference Learning

Thao Do +43w ago

LooComp: Leverage Leave-One-Out Strategy to Encoder-only Transformer for Efficient Query-aware Context Compression

Achieve RAG efficiency without sacrificing accuracy: LooComp prunes context by identifying and retaining only the most critical sentences for answering a query.

Thao Do, Dinh Phu Tran, An Vo +2

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Recommendation & Information Retrieval

Taegyeong Lee +33w ago

DEO: Training-Free Direct Embedding Optimization for Negation-Aware Retrieval

Forget fine-tuning: this training-free method boosts retrieval accuracy for tricky negation queries by up to 10% using clever embedding optimization.

Taegyeong Lee, Jiwon Park, Seunghyun Hwang +1

Natural Language Processing Recommendation & Information Retrieval Training Efficiency & Optimization

3w ago

Fine-grained Motion Retrieval via Joint-Angle Motion Images and Token-Patch Late Interaction

Ditch global embeddings for text-motion retrieval: this method uses joint-angle motion images and token-patch late interaction to achieve state-of-the-art accuracy and interpretability.

Yao Zhang, Zhuchenyang Liu, Yanlan He +2

Computer Vision Multimodal Models Recommendation & Information Retrieval

3w ago

Evaluate-as-Action: Self-Evaluated Process Rewards for Retrieval-Augmented Agents

Retrieval-augmented agents get a serious reasoning boost by explicitly evaluating their own retrieval quality at each step, leading to state-of-the-art performance on multi-hop question answering.

Jiangming Shu, Yuxiang Zhang, Ye Ma +2

Reasoning & Chain-of-Thought Recommendation & Information Retrieval Tool Use & Agents

3w ago·also College of Computer Science and Technology, Department of Data Science, Information System, Qi An Xin Technology Group Inc. +1

Evoking User Memory: Personalizing LLM via Recollection-Familiarity Adaptive Retrieval

LLMs can now retrieve memories like humans, using a fast familiarity check or a deliberate recollection process, leading to better personalization without overwhelming the model with irrelevant context.

Yingyi Zhang, Wenlin Zhang, Penyue Jia +6

Natural Language Processing Recommendation & Information Retrieval

Communications Research Centre3w ago·also Carleton

AI-Enabled Data-driven Intelligence for Spectrum Demand Estimation

Spectrum regulators can now leverage AI to dynamically plan and allocate spectrum resources, thanks to a new data-driven approach that accurately forecasts demand with high reliability across diverse urban environments.

Colin Brown, Mohamad Alkadamani, Halim Yanikomeroglu

Natural Language Processing Recommendation & Information Retrieval Tool Use & Agents

Rong J. B. Zhu3w ago

From Weighting to Modeling: A Nonparametric Estimator for Off-Policy Evaluation

Ditch the IPW variance headache: a new nonparametric weighting method slashes variance in off-policy evaluation without sacrificing bias.

Rong J. B. Zhu

Natural Language Processing Recommendation & Information Retrieval

Arihant Jain +33w ago

$P^2$GNN: Two Prototype Sets to boost GNN Performance

$P^2$GNN's plug-and-play prototype approach boosts GNN performance by injecting global context and denoising local neighborhoods, achieving state-of-the-art results across diverse datasets.

Arihant Jain, Gundeep Arora, Anoop Saladi +1

Architecture Design (Transformers, SSMs, MoE)Recommendation & Information Retrieval

Abhishikth Mallampalli +13w ago

MITRA: An AI Assistant for Knowledge Retrieval in Physics Collaborations

Tired of sifting through mountains of internal docs? This RAG system uses a clever two-tiered vector DB to surface the right physics analysis, not just keywords.

Abhishikth Mallampalli, Sridhara Dasu

Recommendation & Information Retrieval Scientific Discovery & Drug Design Tool Use & Agents

Tsinghua AI3w ago·also NJU

TimberAgent: Gram-Guided Retrieval for Executable Music Effect Control

Forget tweaking knobs – this new Gram-matrix-based audio representation lets you *retrieve* the perfect, editable audio effect preset, outperforming standard methods.

Shihao He, Yihan Xia, Fang Liu +2

Recommendation & Information Retrieval Speech & Audio Tool Use & Agents

Isabelle Augenstein3w ago

Understanding the Interplay between LLMs' Utilisation of Parametric and Contextual Knowledge: A keynote at ECIR 2025

Language models often disregard provided context, choosing instead to rely on potentially outdated or conflicting information learned during pre-training, revealing a critical flaw in their knowledge integration.

Isabelle Augenstein

Interpretability & Mechanistic Interp Recommendation & Information Retrieval Scaling Laws & Emergent Abilities

Jožef Stefan Institute3w ago

Evaluation of LLMs in retrieving food and nutritional context for RAG systems

LLMs can drastically reduce manual effort for domain experts in accessing complex food and nutrition data via RAG, but still struggle with queries that exceed the representational scope of the metadata.

Maks Požarnik Vavken, Matevž Ogrinc, Tome Eftimov +1

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

Zhihua Tian +33w ago

Diagnosing and Repairing Citation Failures in Generative Engine Optimization

Stop blindly rewriting content: AgentGEO diagnoses *why* documents fail to be cited in AI responses, leading to a 40% boost in citations with minimal content changes.

Zhihua Tian, Yuhan Chen, Yao Tang +1

Natural Language Processing Recommendation & Information Retrieval

3w ago·also Sorbonne

A Voronoi Cell Formulation for Principled Token Pruning in Late-Interaction Retrieval Models

Token pruning in dense retrieval gets a geometric upgrade: Voronoi cells offer a principled way to shrink your index without sacrificing search quality.

Yash Kankanampati, Yuxuan Zong, Nadi Tomeh +2

Inference & Quantization Natural Language Processing Recommendation & Information Retrieval

Microsoft Research3w ago

Overview of the TREC 2025 Retrieval Augmented Generation (RAG) Track

Can RAG systems handle complex, multi-sentence queries while maintaining factual grounding and transparency?

Shivani Upadhyay, Nandan Thakur, Ronak Pradeep +2

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

Lahore University of Management Sciences3w ago·also UC Davis

PixelConfig: Longitudinal Measurement and Reverse-Engineering of Meta Pixel Configurations

Meta Pixel's default settings lead to near-ubiquitous tracking of user activity and identity, even on health-related websites, while advertised tracking restrictions are easily bypassed.

Abdullah Ghani, Yash Vekaria, Zubair Shafiq Lahore University of Management Sciences Un California +1

Natural Language Processing Recommendation & Information Retrieval

Ronald Doku3w ago

The Confidence Gate Theorem: When Should Ranked Decision Systems Abstain?

Confidence-based abstention in ranked decision systems often fails due to overlooked contextual uncertainty, challenging the common practice of exception-based intervention.

Ronald Doku

Natural Language Processing Recommendation & Information Retrieval

3w ago

TA-GGAD: Testing-time Adaptive Graph Model for Generalist Graph Anomaly Detection

A single graph foundation model can now achieve state-of-the-art anomaly detection across diverse graph domains, thanks to a new theory of "Anomaly Disassortativity" that tackles domain shift.

Xiong Zhang, Hong Peng, Changlong Fu +3

Natural Language Processing Recommendation & Information Retrieval

3w ago

EmbC-Test: How to Speed Up Embedded Software Testing Using LLMs and RAG

Slash embedded software testing time by up to 66% with an LLM-powered RAG pipeline that generates 270 syntactically correct unit tests per hour.

Maximilian Harnot, Sebastian Komarnicki, Michal Polok +1

Code Generation & Program Synthesis Recommendation & Information Retrieval

Mar 9, 2026

Matei Benescu +13w ago

Why Large Language Models can Secretly Outperform Embedding Similarity in Information Retrieval

LLMs may secretly be better at information retrieval than embedding similarity suggests, but current datasets are too "short-sighted" to prove it.

Matei Benescu, Ivo Pascal de Jong

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Recommendation & Information Retrieval

Google Research3w ago·also UC Santa Cruz

CAST: Modeling Visual State Transitions for Consistent Video Retrieval

Forget local semantic alignment: CAST unlocks temporally coherent video retrieval and generation by explicitly modeling visual state transitions.

Yanqing Liu, Yingcheng Liu, Fanghong Dong +4

Computer Vision Multimodal Models Recommendation & Information Retrieval

3w ago·also Sony

LoopLens: Supporting Search as Creation in Loop-Based Music Composition

LoopLens reveals a stark divide in how musicians with and without domain expertise approach creative search for music loops, highlighting the need for vocabulary-independent discovery tools.

Sheng Long, Atsuya Kobayashi, Kei Tateno

Recommendation & Information Retrieval Speech & Audio

Ronald Sielinski3w ago

Quantifying Uncertainty in AI Visibility: A Statistical Framework for Generative Search Measurement

Generative search rankings are far more unstable than you think: single-run citation metrics provide a misleadingly precise view of domain visibility.

Ronald Sielinski

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

3w ago

OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning

Even the most advanced LLMs stumble when asked to reason over a large, heterogeneous document corpus, achieving only 34% accuracy on the new OfficeQA Pro benchmark despite direct access to the relevant documents.

Krista Opsahl-Ong, Arnav Singhvi, Jasmine Collins +13

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Recommendation & Information Retrieval

3w ago·also Laboratoire IBISC, Paris-Saclay, Univ-Evry

Sequential Service Region Design with Capacity-Constrained Investment and Spillover Effect

Forget exhaustive enumeration: a Transformer-based reinforcement learning approach can efficiently optimize sequential service region design under uncertainty, outperforming standard DRL methods.

Tingting Chen, Feng Chu, Jiantong Zhang

Recommendation & Information Retrieval

3w ago·also Northwestern, Xi'an Jiaotong-Liverpool University

Detecting Fake Reviewer Groups in Dynamic Networks: An Adaptive Graph Learning Method

Spotting coordinated fake reviewers just got easier: a new graph learning method boosts detection accuracy by adaptively weighing network diversity and similarity.

Jing Zhang, Ke Huang, Yao Zhang +1

Natural Language Processing Recommendation & Information Retrieval

Lucas Shen +13w ago

Social Proof is in the Pudding: The (Non)-Impact of Social Proof on Software Downloads

Turns out, buying stars and downloads for open-source software doesn't actually trick developers into using it.

Lucas Shen, Gaurav Sood

Code Generation & Program Synthesis Open-Source Models & Weights Recommendation & Information Retrieval

aampe3w ago

Unifying On- and Off-Policy Variance Reduction Methods

Online A/B testing's classic Difference-in-Means estimator is just off-policy Inverse Propensity Scoring in disguise.

Olivier Jeunen

Natural Language Processing Recommendation & Information Retrieval

Chun-Hsi Ku +13w ago

Structure-Preserving Graph Contrastive Learning for Mathematical Information Retrieval

Swapping variables in mathematical formulas during graph contrastive learning surprisingly improves retrieval accuracy by preserving crucial algebraic relationships.

Chun-Hsi Ku, Hung-Hsuan Chen

Reasoning & Chain-of-Thought Recommendation & Information Retrieval

NVIDIA3w ago·also HUJI

Retrieval-Augmented Gaussian Avatars: Improving Expression Generalization

Retrieval augmentation lets head avatars handle novel expressions better by mixing in similar expressions from a large unlabeled dataset during training, boosting generalization without extra labels or architecture changes.

Matan Levy, Gavriel Habib, Issar Tzachor +5

Computer Vision Recommendation & Information Retrieval

3w ago·also UMass, UMD

Beyond Relevance: On the Relationship Between Retrieval and RAG Information Coverage

Stop blindly optimizing for retrieval relevance in RAG pipelines: coverage-based retrieval metrics are better early indicators of the final generated response's information coverage.

Saron Samuel, Alexander Martin, Eugene Yang +5

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

Sarmad Chandio +13w ago

Examining the Role of YouTube Production and Consumption Dynamics on the Formation of Extreme Ideologies

YouTube channels favored by users with extreme ideologies disproportionately produce content laced with anger and grievance, amplifying ideological shifts.

Sarmad Chandio, Rishab Nithyanand

Constitutional AI & AI Ethics Natural Language Processing Recommendation & Information Retrieval

Abdul Rehman Akbar +63w ago

PathoScribe: Transforming Pathology Data into a Living Library with a Unified LLM-Driven Framework for Semantic Retrieval and Clinical Integration

Unlock the hidden knowledge in millions of pathology reports: PathoScribe turns static archives into a reasoning-enabled "living library" accessible via natural language.

Abdul Rehman Akbar, Samuel Wales-McGrath, Alejadro Levya +4

Natural Language Processing Recommendation & Information Retrieval Scientific Discovery & Drug Design

3w ago

Gradually Excavating External Knowledge for Implicit Complex Question Answering

LLMs can achieve state-of-the-art results on complex reasoning tasks with far fewer parameters by iteratively excavating and reasoning over external knowledge.

Xin Jiang, Qun Liu

Reasoning & Chain-of-Thought Recommendation & Information Retrieval Tool Use & Agents

He-Yen Hsieh +23w ago

Alfa: Attentive Low-Rank Filter Adaptation for Structure-Aware Cross-Domain Personalized Gaze Estimation

Achieve state-of-the-art personalized gaze estimation by intelligently reweighting pre-trained features, rather than learning new ones from scratch.

He-Yen Hsieh, Wei-Te Mark Ting, H. T. Kung

Computer Vision Recommendation & Information Retrieval Training Efficiency & Optimization

Ruixiang Zhao +43w ago

SAVE: Speech-Aware Video Representation Learning for Video-Text Retrieval

By explicitly modeling speech, SAVE leapfrogs existing audio-visual methods for video-text retrieval, achieving substantial gains over the state-of-the-art.

Ruixiang Zhao, Zhihao Xu, Bangxiang Lan +2

Multimodal Models Recommendation & Information Retrieval Speech & Audio

J. Castillo +13w ago

A Consensus-Driven Multi-LLM Pipeline for Missing-Person Investigations

A consensus-driven multi-LLM pipeline can improve information extraction for missing-person investigations, offering a practical approach to leveraging LLMs in high-stakes scenarios.

J. Castillo, Ravi Mukkamala

Natural Language Processing Recommendation & Information Retrieval Tool Use & Agents

Bo Jiang3w ago

One Model Is Enough: Native Retrieval Embeddings from LLM Agent Hidden States

Ditch the extra embedding model: LLMs can retrieve information almost as well using just their internal representations, cutting complexity and latency.

Bo Jiang

Architecture Design (Transformers, SSMs, MoE)Recommendation & Information Retrieval Tool Use & Agents

TOBB University of Economics and Technology3w ago

SPD-RAG: Sub-Agent Per Document Retrieval-Augmented Generation

By decomposing RAG along the document axis with specialized agents, SPD-RAG achieves state-of-the-art performance on multi-document QA while slashing API costs by over 60%.

Yagiz Can Akay, Muhammed Yusuf Kartal, Esra Alparslan +2

Natural Language Processing Reasoning & Chain-of-Thought Recommendation & Information Retrieval+1

Amazon Science3w ago·also TU Berlin, UvA

ERASE -- A Real-World Aligned Benchmark for Unlearning in Recommender Systems

Current machine unlearning methods for recommender systems struggle with robustness and sequential deletions, especially in attention-based and recurrent models, highlighting a critical gap ERASE helps to expose.

Pierre Lubitzsch, Maarten de Rijke, M. D. Rijke +1

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Recommendation & Information Retrieval

CMU ML3w ago·also Microsoft Research, Brandeis, Glasgow, USC

CAGR: A Cross-Accelerator Graph Optimization Framework for Efficient Recommender System Inference

Get near-peak performance for your recommender system across GPUs and TPUs without tedious platform-specific tuning, thanks to a new cross-accelerator graph optimization framework.

Zijian Shen, Wenyu Zhao, Boyuan Wang +2

Distributed Systems & Hardware Inference & Quantization Recommendation & Information Retrieval

Daniele Molino +33w ago

Retrieval-Augmented Anatomical Guidance for Text-to-CT Generation

Injecting retrieved anatomical priors into text-to-CT generation dramatically improves image fidelity and clinical consistency, offering a scalable path to more realistic medical image synthesis.

Daniele Molino, Camillo Maria Caruso, Paolo Soda +1

Computer Vision Multimodal Models Recommendation & Information Retrieval

3w ago

UIS-Digger: Towards Comprehensive Research Agent Systems for Real-world Unindexed Information Seeking

Current LLM agents stumble when vital information isn't indexed by search engines, but a new multi-agent framework, UIS-Digger, shows how proactive browsing and file parsing can overcome this limitation.

Chuqiao Kuang, Tianyi Zhuang, Yuxin Cheng +2

Eval Frameworks & Benchmarks Recommendation & Information Retrieval Tool Use & Agents

Ummar Abbas +113w ago

Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA

LLMs struggle to provide reliable answers to Islamic queries, but Fanar-Sadiq's multi-agent architecture, with specialized modules for scripture, jurisprudence, and calculations, delivers grounded and verifiable responses.

Ummar Abbas, M. Ouzzani, Mourad Ouzzani +9

Natural Language Processing Recommendation & Information Retrieval Tool Use & Agents

Songyang Chen +33w ago

Towards Effective and Efficient Graph Alignment without Supervision

Unsupervised graph alignment gets a speed boost: GlobAlign-E slashes computation time by an order of magnitude while simultaneously boosting accuracy by up to 20%.

Songyang Chen, Youfang Lin, Shuai Zheng +1

Computer Vision Natural Language Processing Recommendation & Information Retrieval

3w ago·also Ant Group, NJU

Learning Hierarchical Knowledge in Text-Rich Networks with Taxonomy-Informed Representation Learning

Text-rich networks get a hierarchical upgrade: TIER leverages LLMs and contrastive learning to build taxonomy-aware node embeddings, significantly outperforming existing methods.

Yun-Hui Liu, Yongchao Liu, Yinfeng Chen +3

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Recommendation & Information Retrieval

3w ago·also NJU, PKU

Mitigating Homophily Disparity in Graph Anomaly Detection: A Scalable and Adaptive Approach

By adaptively fusing low- and high-frequency graph signals based on local anomaly context, SAGAD achieves state-of-the-art graph anomaly detection while scaling linearly to large graphs.

Yun-Hui Liu, Qizhuo Xie, Yinfeng Chen +4

Architecture Design (Transformers, SSMs, MoE)Recommendation & Information Retrieval

Department of Data Science3w ago·also Baidu, College of Computer Science and Technology, HKU, Information System +1

Enhancing Cross-View UAV Geolocalization via LVLM-Driven Relational Modeling

Forget independent feature extraction: a new architecture uses LVLMs to explicitly model the relationships between drone and satellite imagery, substantially boosting geolocalization accuracy.

Bowen Liu, Pengyue Jia, Wanyu Wang +6

Computer Vision Multimodal Models Recommendation & Information Retrieval

Mar 8, 2026

NUS3w ago·also DAMO

Verifiable Reasoning for LLM-based Generative Recommendation

LLMs can generate better recommendations if they pause to verify their reasoning steps, rather than reasoning in one long chain.

Natural Language Processing Reasoning & Chain-of-Thought Recommendation & Information Retrieval

Xiangyang Yang +63w ago·also School of Information and Artificial Intelligence

GP-Tree: An in-memory spatial index combining adaptive grid cells with a prefix tree for efficient spatial querying

Ditch those clunky MBRs: GP-Tree uses fine-grained grid cells in a prefix tree to speed up spatial queries by up to 10x.

Xiangyang Yang, Xuefeng Guan, Lanxue Dang +4

Architecture Design (Transformers, SSMs, MoE)Recommendation & Information Retrieval

East China University of Science and Technology3w ago·also School of Information Science and Engineering

SeDa: A Unified System for Dataset Discovery and Multi-Entity Augmented Semantic Exploration

Tired of fragmented datasets? SeDa unifies 7.6M+ datasets from 200+ platforms with semantic annotation and provenance tracking, making cross-domain data discovery a breeze.

Kan Ling, Zhen Qin, Hengrun Zhang

Data Curation & Synthetic Data Natural Language Processing Recommendation & Information Retrieval

Shih-Ying Yeh +33w ago·also National Tsing Hua University

KohakuRAG: A simple RAG framework with hierarchical document indexing

A hierarchical RAG framework with ensemble inference and LLM-powered query planning crushes the WattBot 2025 Challenge, showing that carefully structured retrieval and answer stabilization are key to high-precision question answering.

Shih-Ying Yeh, Yueh-Feng Ku, Ko-Wei Huang +1

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Recommendation & Information Retrieval

David Beauchemin +13w ago

Benchmarking Large Language Models for Quebec Insurance: From Closed-Book to Retrieval-Augmented Generation

RAG can backfire spectacularly on strong LLMs in Quebec insurance QA, causing "context distraction" and performance regressions, even as it massively boosts weaker models.

David Beauchemin, Richard Khoury

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

3w ago·also HUST, V generation refers to text-and-image-to-video generation

Deep Research for Recommender Systems

Recommender systems can move beyond passive item lists: RecPilot's multi-agent framework autonomously explores item spaces and generates user-centric reports, significantly reducing user effort in item evaluation.

Kesha Ou, Chenghao Wu, Xiaolei Wang +5

Recommendation & Information Retrieval

Mar 7, 2026

Andrea Giuseppe Di Francesco +23w ago

Retrieval-Augmented Generation for Predicting Cellular Responses to Gene Perturbation

Naive retrieval hurts performance when predicting cellular responses to gene perturbations, but a differentiable, cell-type-aware retrieval mechanism like PT-RAG significantly boosts accuracy.

Andrea Giuseppe Di Francesco, Andrea Rubbi, Pietro Liò

Natural Language Processing Recommendation & Information Retrieval Scientific Discovery & Drug Design

Mar 5, 2026

Hong Kong Polytechnic Univeristy3w ago

DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval

LLMs can now tap into the full power of R's statistical methods: a new retrieval method boosts package retrieval accuracy by 17% by understanding data distributions, not just function names.

Maojun Sun, Yue Wu, Yifei Xie +5

Code Generation & Program Synthesis Recommendation & Information Retrieval Tool Use & Agents

3w ago·also Tsinghua AI, OPPO

TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings

Forget task-specific fine-tuning: TSEmbed unlocks SOTA multimodal embeddings by disentangling task objectives with a Mixture-of-Experts and a novel expert-aware negative sampling strategy.

Yebo Wu, Fenglin Liu, Ziwei Xie +4

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Recommendation & Information Retrieval

Target3w ago

Beyond Text: Aligning Vision and Language for Multimodal E-Commerce Retrieval

E-commerce retrieval gets a visual boost: domain-specific fine-tuning and two-stage alignment unlock the power of product images, outperforming text-only approaches.

Qujiaheng Zhang, Guagnyue Xu, Fengjie Li

Computer Vision Multimodal Models Recommendation & Information Retrieval

3w ago·also Deakin, Monash, PolyU

Auto-Generating Personas from User Reviews in VR App Stores

Automatically generating personas from VR app store reviews can efficiently foster empathy and uncover hidden accessibility needs in VR development.

Yi Wang, Kexin Cheng, Xiao Liu +5

Natural Language Processing Recommendation & Information Retrieval

3w ago

KARL: Knowledge Agents via Reinforcement Learning

Forget Claude and GPT: KARL, a reinforcement-learning-trained enterprise search agent, achieves Pareto-optimal performance on a diverse suite of search tasks, even outperforming closed models with sufficient compute.

Jonathan D. Chang, Andrew Drozdov, Shubham Toshniwal +27

Eval Frameworks & Benchmarks Recommendation & Information Retrieval Tool Use & Agents

3w ago·also BIT, HKUST

Beyond Linear LLM Invocation: An Efficient and Effective Semantic Filter Paradigm

Semantic filtering with LLMs doesn't have to be a slow, linear slog: this new clustering-sampling-voting approach slashes LLM calls by up to 355x without sacrificing accuracy.

Nan Hou, Kangfei Zhao, Jiadong Xie +1

Natural Language Processing Recommendation & Information Retrieval Tool Use & Agents

3w ago·also Curtin, Eastern Institute of Technology, HKU, SJTU +1

Debiasing Sequential Recommendation with Time-aware Inverse Propensity Scoring

By dynamically weighting historical interactions, TIPS lets sequential recommenders see past the biases of what users *actually* clicked, revealing what they *would* have clicked.

Sirui Huang, Jing Long, Qian Li +2

Data Curation & Synthetic Data Recommendation & Information Retrieval

OpenAI3w ago·also Cohere

NaiLIA: Multimodal Nail Design Retrieval Based on Dense Intent Descriptions and Palette Queries

Nail design retrieval gets a major upgrade: NaiLIA leverages dense intent descriptions and palette queries to outperform standard methods, opening the door to more nuanced and personalized image search.

Kanon Amemiya, Daichi Yashima, Kei Katsumata +4

Computer Vision Multimodal Models Recommendation & Information Retrieval

Leipzig University3w ago·also Bauhaus Weimar, FSU Jena, Kassel

Detecting RAG Advertisements Across Advertising Styles

Entity recognition models can effectively spot RAG-powered native ads, even when advertisers try to disguise them with different styles.

Sebastian Heineking, Wilhelm Pertsch, Ines Zelch +4

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

Huixiang Luo +53w ago

Knowledge-informed Bidding with Dual-process Control for Online Advertising

Human expertise, often overlooked in black-box bidding models, can be effectively injected into online advertising bid optimization via a dual-process control mechanism, leading to significant performance gains.

Huixiang Luo, Long Gao, Yaqi Liu +3

Natural Language Processing Recommendation & Information Retrieval Tool Use & Agents

3w ago

Core-based Hierarchies for Efficient GraphRAG

Ditch Leiden clustering for GraphRAG: k-core decomposition offers a deterministic, faster, and more effective way to build knowledge graph hierarchies for better LLM reasoning.

Jakir Hossain, Ahmet Erdem Sariyuce, Ahmet Erdem Sarıyüce

Natural Language Processing Reasoning & Chain-of-Thought Recommendation & Information Retrieval

Bastian Pfeifer +23w ago

Robust Node Affinities via Jaccard-Biased Random Walks and Rank Aggregation

Forget personalized PageRank and Node2Vec: Jaccard-biased random walks plus rank aggregation yield surprisingly robust node affinities, outperforming alternatives on diverse graph types.

Bastian Pfeifer, M. Schimek, Michael G. Schimek

Natural Language Processing Recommendation & Information Retrieval

Mengnan Li +73w ago

MOOSEnger -- a Domain-Specific AI Agent for the MOOSE Ecosystem

MOOSEnger achieves a 93% success rate in generating runnable multiphysics simulation inputs from natural language, while LLMs alone fail 92% of the time.

Mengnan Li, Jason M. Miller, Jason Miller +5

Code Generation & Program Synthesis Recommendation & Information Retrieval Tool Use & Agents

M show up to 21.3w ago·also PKU, XJTU

MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus

Achieve expert-level hepatology diagnosis by mimicking multidisciplinary consultation, using an AI system that combines knowledge graph reasoning, clinical guidelines, and a multi-agent system for traceable consensus.

Zheng Li, Jiayi Xu, Zhikai Hu +4

Interpretability & Mechanistic Interp Recommendation & Information Retrieval Scientific Discovery & Drug Design