Search papers, labs, and topics across Lattice.
This survey examines the transformative impact of large language models (LLMs), specifically retrieval-augmented generation (RAG), on web research and applications, moving beyond traditional pipeline architectures. It analyzes how LLMs are reshaping tasks like information retrieval, question answering, and web analytics, while also enabling new applications such as web-based summarization. The paper identifies key developments, open challenges, and future research directions for leveraging LLMs to enhance web-based solutions.
LLMs are not just augmenting web research; they're fundamentally reshaping it by replacing traditional pipelines with generative-retrieval architectures like RAG.
Web research and practices have evolved significantly over time, offering users diverse and accessible solutions across a wide range of tasks. While advanced concepts such as Web 4.0 have emerged from mature technologies, the introduction of large language models (LLMs) has profoundly influenced both the field and its applications. This wave of LLMs has permeated science and technology so deeply that no area remains untouched. Consequently, LLMs are reshaping web research and development, transforming traditional pipelines into generative solutions for tasks like information retrieval, question answering, recommendation systems, and web analytics. They have also enabled new applications such as web-based summarization and educational tools. This survey explores recent advances in the impact of LLMs-particularly through the use of retrieval-augmented generation (RAG)-on web research and industry. It discusses key developments, open challenges, and future directions for enhancing web solutions with LLMs.