March 18 – March 25, 2026

Recommendation & Information Retrieval - Weekly Roundup

47 papers published across 3 labs.

6% acceleration

Selected Labs publishing this week

Amazon Science2 CMU ML2 Microsoft Research1

Top Papers

Mar 19, 2026

Zhixing You +21w ago

D-Mem: A Dual-Process Memory System for LLM Agents

LLM agents can achieve near-perfect memory recall without prohibitive costs by strategically combining fast, lossy retrieval with slower, exhaustive deliberation.

Zhixing You, Jiachen Yuan, Jason Cai

Reasoning & Chain-of-Thought Recommendation & Information Retrieval Tool Use & Agents

E. Nortey +91w ago

An Optimised Greedy-Weighted Ensemble Framework for Financial Loan Default Prediction

Forget static model averaging: dynamically weighting ensembles based on empirical performance can significantly boost accuracy and interpretability in financial loan default prediction.

E. Nortey, Ezekiel Nii Noye Nortey, Jones Asante-Koranteng +7

Natural Language Processing Recommendation & Information Retrieval Training Efficiency & Optimization

1w ago

OCP: Orthogonal Constrained Projection for Sparse Scaling in Industrial Commodity Recommendation

Orthogonal constraints can rescue sparse embeddings in recommender systems from representation collapse, unlocking significant performance gains in large-scale industrial deployments.

Chen Sun, Chengqi Sun, Beiling Xu +10

Architecture Design (Transformers, SSMs, MoE)Recommendation & Information Retrieval Training Efficiency & Optimization

Amazon Science1w ago

RADIUS: Ranking, Distribution, and Significance - A Comprehensive Alignment Suite for Survey Simulation

LLM-generated survey responses can be statistically accurate yet still miss the option most preferred by humans, highlighting a critical flaw in current evaluation methods.

Weronika Lajewska, Weronika Łajewska, Paul Missault +2

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

Chuxuan Hu +31w ago

SODIUM: From Open Web Data to Queryable Databases

Automating web data integration for expert querying is now possible: SODIUM-Agent achieves a 2x accuracy boost over existing systems on a new benchmark of 105 real-world tasks.

Chuxuan Hu, Philip Li, Maxwell Yang +1

Data Curation & Synthetic Data Natural Language Processing Recommendation & Information Retrieval+1

All Papers (47)

Mar 19, 2026

Zhixing You +21w ago

D-Mem: A Dual-Process Memory System for LLM Agents

LLM agents can achieve near-perfect memory recall without prohibitive costs by strategically combining fast, lossy retrieval with slower, exhaustive deliberation.

Zhixing You, Jiachen Yuan, Jason Cai

Reasoning & Chain-of-Thought Recommendation & Information Retrieval Tool Use & Agents

E. Nortey +91w ago

An Optimised Greedy-Weighted Ensemble Framework for Financial Loan Default Prediction

Forget static model averaging: dynamically weighting ensembles based on empirical performance can significantly boost accuracy and interpretability in financial loan default prediction.

E. Nortey, Ezekiel Nii Noye Nortey, Jones Asante-Koranteng +7

Natural Language Processing Recommendation & Information Retrieval Training Efficiency & Optimization

1w ago

OCP: Orthogonal Constrained Projection for Sparse Scaling in Industrial Commodity Recommendation

Orthogonal constraints can rescue sparse embeddings in recommender systems from representation collapse, unlocking significant performance gains in large-scale industrial deployments.

Chen Sun, Chengqi Sun, Beiling Xu +10

Architecture Design (Transformers, SSMs, MoE)Recommendation & Information Retrieval Training Efficiency & Optimization

Amazon Science1w ago

RADIUS: Ranking, Distribution, and Significance - A Comprehensive Alignment Suite for Survey Simulation

LLM-generated survey responses can be statistically accurate yet still miss the option most preferred by humans, highlighting a critical flaw in current evaluation methods.

Weronika Lajewska, Weronika Łajewska, Paul Missault +2

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

Chuxuan Hu +31w ago

SODIUM: From Open Web Data to Queryable Databases

Automating web data integration for expert querying is now possible: SODIUM-Agent achieves a 2x accuracy boost over existing systems on a new benchmark of 105 real-world tasks.

Chuxuan Hu, Philip Li, Maxwell Yang +1

Data Curation & Synthetic Data Natural Language Processing Recommendation & Information Retrieval+1

1w ago·also Microsoft Research

HypeMed: Enhancing Medication Recommendations with Hypergraph-Based Patient Relationships

Hypergraph modeling of patient visits, coupled with contrastive pre-training, significantly boosts medication recommendation accuracy and safety by capturing complex relationships missed by traditional graph-based approaches.

Xiangxu Zhang, Xiao Zhou, Hongteng Xu +1

Natural Language Processing Recommendation & Information Retrieval Scientific Discovery & Drug Design

Rui Chai +11w ago

Regret Bounds for Competitive Resource Allocation with Endogenous Costs

Decentralized competitive allocation provably beats simpler baselines in modular systems with endogenous costs, finally justifying its use with rigorous regret bounds.

Rui Chai, Ruiya Chai

Recommendation & Information Retrieval Training Efficiency & Optimization

Sakshi Arya +21w ago

Kernel Single-Index Bandits: Estimation, Inference, and Learning

Semiparametric bandits can achieve $\tilde{O}(\sqrt{T})$ regret while retaining interpretability, thanks to a novel kernelized ε-greedy algorithm and Stein-based estimation.

Sakshi Arya, Satarupa Bhattacharjee, Bharath K. Sriperumbudur

Recommendation & Information Retrieval

Vedant Pandya1w ago

Progressive Training for Explainable Citation-Grounded Dialogue: Reducing Hallucination to Zero in English-Hindi LLMs

Citation-grounded supervised fine-tuning slashes hallucination rates to zero in encoder-decoder models, proving that explicit citation mechanisms are a potent tool for factual accuracy in dialogue systems.

Vedant Pandya

Natural Language Processing Recommendation & Information Retrieval Training Efficiency & Optimization

Xiaoyu Liu +11w ago

TopoChunker: Topology-Aware Agentic Document Chunking Framework

RAG systems can achieve state-of-the-art performance by explicitly preserving document topology, outperforming LLM-based chunking while simultaneously reducing token overhead.

Xiaoyu Liu, Xiaoyu Liu

Natural Language Processing Recommendation & Information Retrieval Tool Use & Agents

Zelin Liu +71w ago

GEAR: Geography-knowledge Enhanced Analog Recognition Framework in Extreme Environments

Forget expensive deep-sea expeditions: GEAR finds structurally similar terrestrial environments with surprising accuracy, opening new avenues for biological research.

Zelin Liu, Bocheng Li, Yuling Zhou +5

Recommendation & Information Retrieval Scientific Discovery & Drug Design

Duc V. Nguyen +11w ago

Modeling the Impacts of Swipe Delay on User Quality of Experience in Short Video Streaming

Turns out, users care more about late-session swipe delays than early ones when binging short videos.

Duc V. Nguyen, Huyen T. T. Tran

Recommendation & Information Retrieval

Mingyang Liu +111w ago

Online Learning and Equilibrium Computation with Ranking Feedback

Learning from ranked preferences alone can be surprisingly difficult: even with access to the full ranking of actions, standard online learning guarantees break down unless the environment is sufficiently stable.

Mingyang Liu, Mingyang Liu, Yongshan Chen +9

Natural Language Processing Recommendation & Information Retrieval RLHF & Preference Learning

1w ago·also Cornell, Institute of Science Tokyo, LY Corporation, Meiji University +2

Off-Policy Learning with Limited Supply

Greedy off-policy learning, optimal in theory, can fail spectacularly when supplies are limited, but a simple fix—prioritizing items with high *relative* reward—can restore performance.

Koichi Tanaka, Ren Kishimoto, Bushun Kawagishi +4

Natural Language Processing Recommendation & Information Retrieval RLHF & Preference Learning

1w ago

Attack by Unlearning: Unlearning-Induced Adversarial Attacks on Graph Neural Networks

Legally mandated data deletion requests can be weaponized to stealthily cripple GNN performance, even if the model appears robust during initial training.

Jiahao Zhang, Jiahao Zhang, Jiahao Zhang +4

Constitutional AI & AI Ethics Recommendation & Information Retrieval Red-Teaming & Adversarial Robustness

1w ago

Interplay: Training Independent Simulators for Reference-Free Conversational Recommendation

Escape the scripted feel of simulated conversations: Interplay trains independent user and recommender LLMs that interact in real-time, without pre-defined target items, for more realistic and diverse conversational recommendation data.

Jerome Ramos, Feng Xia, Xi Wang +4

Data Curation & Synthetic Data Natural Language Processing Recommendation & Information Retrieval

1w ago·also Radboud, UvA

Total Recall QA: A Verifiable Evaluation Suite for Deep Research Agents

Current benchmarks fail to rigorously evaluate deep research agents, but a new framework leveraging structured knowledge bases and synthetic data offers a verifiable and scalable solution.

Mahta Rafiee, Heydar Soudani, Zahra Abbasiantaeb +3

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Recommendation & Information Retrieval+1

Md Takrim Ul Alam +91w ago

Prompt Control-Flow Integrity: A Priority-Aware Runtime Defense Against Prompt Injection in LLM Systems

Stop prompt injections cold: PCFI's priority-aware runtime defense intercepts all attacks in testing with zero false positives and negligible overhead.

Md Takrim Ul Alam, Md Takrim Ul Alam, Akif Islam +7

Constitutional AI & AI Ethics Natural Language Processing Recommendation & Information Retrieval+1

1w ago

Hypothesis-Conditioned Query Rewriting for Decision-Useful Retrieval

Stop retrieving background noise: HCQR refines RAG by generating targeted queries that seek evidence to directly support or refute candidate answers.

Hangeol Chang, Changsu Lee, Changsun Lee +5

Natural Language Processing Reasoning & Chain-of-Thought Recommendation & Information Retrieval

1w ago·also University of Orléans

Best-of-Both-Worlds Multi-Dueling Bandits: Unified Algorithms for Stochastic and Adversarial Preferences under Condorcet and Borda Objectives

Imagine a single algorithm that dominates in both predictable and chaotic ranking scenarios – this paper delivers it for multi-dueling bandits.

S. Akash, Pratik Gajane, Jawar Singh

Recommendation & Information Retrieval RLHF & Preference Learning

Yilin Wang +71w ago

DaPT: A Dual-Path Framework for Multilingual Multi-hop Question Answering

Multilingual question answering is harder than you think: even state-of-the-art RAG systems stumble when dealing with questions and knowledge in multiple languages.

Yilin Wang, Yuchun Fan, Jiaoyang Li +5

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

Mar 18, 2026

Stupid Human2w ago·also Oxford

Auditing Preferences for Brands and Cultures in LLMs

LLMs exhibit consistent and detectable geographic preferences for brands and cultures, revealing potential biases in market intermediation that persist across user personas.

Jasmine Rienecker, Jasmine Rienecker, Katarina Mpofu +9

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Recommendation & Information Retrieval

2w ago·also Bilibili Inc.

Deploying Semantic ID-based Generative Retrieval for Large-Scale Podcast Discovery at Spotify

Spotify's GLIDE model proves that generative LLMs can drive significant gains in podcast discovery and non-habitual listening in a real-world, production environment.

Edoardo D'Amico, Marco De Nadai, P. Chandar +56

Natural Language Processing Recommendation & Information Retrieval Speech & Audio

Guangzhi Wang +32w ago

CRE-T1 Preview Technical Report: Beyond Contrastive Learning for Reasoning-Intensive Retrieval

Ditch static embeddings: Generative retrieval, powered by reinforcement learning, lets models dynamically reason about relevance, outperforming larger contrastively-trained models on reasoning-intensive tasks.

Guangzhi Wang, Ying Jiao, Yinghao Jiao +1

Natural Language Processing Reasoning & Chain-of-Thought Recommendation & Information Retrieval

Michał Szyfelbein +12w ago

Average Case Graph Searching in Non-Uniform Cost Models

Finding a hidden node in a graph just got a whole lot faster: a new algorithm slashes the average search cost with provable approximation guarantees, even with non-uniform query costs.

Michał Szyfelbein, Michal Szyfelbein

Recommendation & Information Retrieval

2w ago

VLM2Rec: Resolving Modality Collapse in Vision-Language Model Embedders for Multimodal Sequential Recommendation

Naive fine-tuning of VLMs for multimodal sequential recommendation causes catastrophic modality collapse, but can be fixed with gradient rebalancing and cross-modal regularization.

Junyoung Kim, Woojoo Kim, Jaehyung Lim +2

Multimodal Models Recommendation & Information Retrieval

2w ago·also WHU

From Isolated Scoring to Collaborative Ranking: A Comparison-Native Framework for LLM-Based Paper Evaluation

Stop training LLMs to assign arbitrary scores to papers in isolation; comparison-based ranking unlocks significantly better generalization and accuracy in paper evaluation.

P. Zheng, Pujun Zheng, Jiacheng Yao +8

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

Karan Goyal +32w ago

Public Profile Matters: A Scalable Integrated Approach to Recommend Citations in the Wild

Existing citation recommendation benchmarks overestimate real-world performance because they fail to account for the temporal constraints of recommending citations for *new* papers.

Karan Goyal, Dikshant Kukreja, Vikram Goyal +1

Natural Language Processing Recommendation & Information Retrieval

2w ago

A Unified Language Model for Large Scale Search, Recommendation, and Reasoning

Forget tool-augmented systems: NEO shows you can consolidate search, recommendation, and reasoning into a single language-steerable LLM by representing items as SIDs and interleaving them with natural language.

Marco De Nadai, Edoardo D'Amico, Max Lefarov +23

Reasoning & Chain-of-Thought Recommendation & Information Retrieval Tool Use & Agents

2w ago·also UTS

Learning Evolving Preferences: A Federated Continual Framework for User-Centric Recommendation

Federated recommendation systems can now better adapt to evolving user preferences without sacrificing privacy, thanks to a novel approach that retains historical knowledge and transfers insights between similar users.

Chunxu Zhang, Zhi Xue, Guodong Long +2

Distributed Systems & Hardware Recommendation & Information Retrieval Training Efficiency & Optimization

2w ago

ListK: Semantic ORDER BY and LIMIT K with Listwise Prompting

Semantic sorting in LLMs can be twice as fast with no loss in accuracy by strategically combining listwise ranking algorithms.

Jay W. Shin, Jason Shin, Jiwon Chang +1

Code Generation & Program Synthesis Natural Language Processing Recommendation & Information Retrieval

Oliver Zahn +12w ago

Facts as First Class Objects: Knowledge Objects for Persistent LLM Memory

LLMs forget up to 60% of facts when summarizing and erode over half of project constraints during iterative compaction, but a simple discrete memory system (KOs) fixes this while slashing costs by 252x.

Oliver Zahn, Simran Chana

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

Amazon Science2w ago

LAAF: Logic-layer Automated Attack Framework A Systematic Red-Teaming Methodology for LPCI Vulnerabilities in Agentic Large Language Model Systems

Agentic LLMs are surprisingly vulnerable: a new framework finds successful attacks in 84% of attempts by escalating prompt injection techniques across multiple stages.

Hammad Atta, Hammad Atta, Ken Huang +25

Recommendation & Information Retrieval Red-Teaming & Adversarial Robustness Tool Use & Agents

GE HealthCare2w ago

Negation is Not Semantic: Diagnosing Dense Retrieval Failure Modes for Trade-offs in Contradiction-Aware Biomedical QA

Seemingly sophisticated dense retrieval methods can catastrophically fail at contradiction detection due to "Semantic Collapse," highlighting the surprising effectiveness of a simple, decoupled lexical approach for reliable biomedical QA.

S. Sahoo, Soumya Ranjan Sahoo, Gagan N. +3

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

Z.H. College of Engineering & Technology2w ago·also Aligarh Muslim University, Interdisciplinary Center for Artificial Intelligence

Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval

LLMs can be systematically shifted from stochastic pattern-matchers to verified truth-seekers using a carefully orchestrated, multi-stage retrieval and verification pipeline.

Md. Asraful Haque, Aasar Mehdi, Maaz Mahboob +1

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval+1

Jin Xie +52w ago

SEAL-Tag: Self-Tag Evidence Aggregation with Probabilistic Circuits for PII-Safe Retrieval-Augmented Generation

RAG systems can now achieve 8x better PII leakage protection without sacrificing utility or speed, thanks to a novel "Verify-then-Route" paradigm.

Jin Xie, Jin Xie, Songze Li +3

Natural Language Processing Recommendation & Information Retrieval Red-Teaming & Adversarial Robustness

Alexandros Efstratiou +22w ago

Information Pathways in Online Science Communication: The Role of Platform Actors and News Media

"Superspreader" networks on Twitter amplify contrarian scientific viewpoints, influencing news media coverage and potentially distorting public understanding of science.

Alexandros Efstratiou, Giuseppe Russo, Luca Luceri

Natural Language Processing Recommendation & Information Retrieval

Zichen Tang +52w ago

Is Your LLM-as-a-Recommender Agent Trustable? LLMs'Recommendation is Easily Hacked by Biases (Preferences)

LLM-powered recommendation agents, despite their reasoning prowess, are easily manipulated by contextual biases in high-stakes scenarios like paper review and job recruitment.

Zichen Tang, Ziru Zhang, Zirui Zhang +3

Eval Frameworks & Benchmarks Recommendation & Information Retrieval Red-Teaming & Adversarial Robustness

2w ago·also Northeastern, Punch Cyber Analytics

Retrieval-Augmented LLMs for Security Incident Analysis

LLMs armed with RAG can reconstruct cyberattacks with high precision and recall, but the best model for the job depends on your budget: DeepSeek V3 matches Claude Sonnet 4's accuracy at 1/15th the cost.

Xavier Cadet, Xavier Cadet, Aditya Vikram Singh +14

Natural Language Processing Recommendation & Information Retrieval Tool Use & Agents

Oksana Kolomenko +22w ago

Embedding World Knowledge into Tabular Models: Towards Best Practices for Embedding Pipeline Design

Forget chasing leaderboard hype: this study reveals that larger embedding models and strategic concatenation are key to unlocking LLM-powered tabular prediction, regardless of public rankings.

Oksana Kolomenko, Ricardo Knauer, Erik Rodner

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

2w ago

Adaptive Guidance for Retrieval-Augmented Masked Diffusion Models

No training needed: ARAM dynamically adjusts retrieved context guidance in masked diffusion models based on signal quality, resolving retrieval-prior conflicts on the fly.

Jaemin Kim, Jong Chul Ye

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Recommendation & Information Retrieval

2w ago

Retrieval-Augmented LLM Agents: Learning to Learn from Experience

Retrieval-augmented LLM agents can learn to learn from experience, achieving significantly better generalization on unseen tasks by combining the strengths of fine-tuning and in-context retrieval.

Thomas Palmeira Ferraz, Romain Deffayet, Vassilina Nikoulina +2

Reasoning & Chain-of-Thought Recommendation & Information Retrieval Tool Use & Agents

CMU ML2w ago·also JHU

Temporal Narrative Monitoring in Dynamic Information Environments

Discover emergent narratives in real-time without predefined labels, revealing how information evolves during crises.

David Farr, Stephen Prochaska, Jack Moody +4

Natural Language Processing Recommendation & Information Retrieval

2w ago

PJB: A Reasoning-Aware Benchmark for Person-Job Retrieval

Stop chasing leaderboard gains on generic benchmarks: PJB reveals that domain-specific weaknesses in person-job retrieval far outweigh the benefits of general model upgrades, and that query understanding modules can actually hurt performance.

Guangzhi Wang, Xiaohui Yang, Kai Li +4

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Recommendation & Information Retrieval

Chaeyoung Huh +32w ago

PACE-RAG: Patient-Aware Contextual and Evidence-based Policy RAG for Clinical Drug Recommendation

LLMs can now recommend drugs with state-of-the-art accuracy by synthesizing individual patient context with the prescribing tendencies of similar cases, outperforming guideline-based and similar-patient retrieval methods.

Chaeyoung Huh, Hyunmin Hwang, Jung Hwan Shin +1

Natural Language Processing Recommendation & Information Retrieval Scientific Discovery & Drug Design

Chinenye Omejieke +22w ago

Objective Mispricing Detection for Shortlisting Undervalued Football Players via Market Dynamics and News Signals

Forget subjective scouting reports: this framework objectively identifies undervalued football players by blending market dynamics with news sentiment, offering a data-driven edge in talent acquisition.

Chinenye Omejieke, Shuyao Chen, Xia Cui

Natural Language Processing Recommendation & Information Retrieval

CMU ML2w ago·also INSA Rennes

CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents

Forget specialized tools: a standard Unix terminal and clever RL are all you need to beat much larger LLMs at code search.

Lintang Sutawika, Aditya Bharat Soni, R. BharathSriraamR +11

Code Generation & Program Synthesis Recommendation & Information Retrieval Tool Use & Agents