Search papers, labs, and topics across Lattice.
The paper introduces CiteLLM, an agentic platform integrated into LaTeX editors for trustworthy scientific reference discovery. It addresses the challenge of LLM hallucinations by using them only for query generation and candidate ranking, while retrieving references from trusted academic repositories. The system achieves superior performance in returning valid and usable references through dynamic discipline-aware routing, paragraph-level semantic matching, and an integrated chatbot for explanation.
CiteLLM offers a LaTeX-integrated agent that grounds claims in trusted academic sources, bypassing LLM hallucinations by using them only for search and ranking, not content generation.
Large language models (LLMs) have created new opportunities to enhance the efficiency of scholarly activities; however, challenges persist in the ethical deployment of AI assistance, including (1) the trustworthiness of AI-generated content, (2) preservation of academic integrity and intellectual property, and (3) protection of information privacy. In this work, we present CiteLLM, a specialized agentic platform designed to enable trustworthy reference discovery for grounding author-drafted claims and statements. The system introduces a novel interaction paradigm by embedding LLM utilities directly within the LaTeX editor environment, ensuring a seamless user experience and no data transmission outside the local system. To guarantee hallucination-free references, we employ dynamic discipline-aware routing to retrieve candidates exclusively from trusted web-based academic repositories, while leveraging LLMs solely for generating context-aware search queries, ranking candidates by relevance, and validating and explaining support through paragraph-level semantic matching and an integrated chatbot. Evaluation results demonstrate the superior performance of the proposed system in returning valid and highly usable references.