Search papers, labs, and topics across Lattice.
This survey synthesizes recent advances in using LLMs to assist or automate stages of the peer review process, including review generation, rebuttal, meta-review, and revision. It categorizes techniques like fine-tuning, agent-based systems, and RL-based methods for review generation, and also covers methods for after-review tasks and evaluation. The survey provides a structured overview of datasets, modeling choices, limitations, and ethical concerns, offering practical guidance for researchers aiming to integrate LLMs into the peer review workflow.
LLMs are rapidly transforming peer review, but critical gaps remain in ensuring quality, fairness, and ethical considerations across the entire workflow.
Peer review is a multi-stage process involving reviews, rebuttals, meta-reviews, final decisions, and subsequent manuscript revisions. Recent advances in large language models (LLMs) have motivated methods that assist or automate different stages of this pipeline. In this survey, we synthesize techniques for (i) peer review generation, including fine-tuning strategies, agent-based systems, RL-based methods, and emerging paradigms to enhance generation; (ii) after-review tasks including rebuttals, meta-review and revision aligned to reviews; and (iii) evaluation methods spanning human-centered, reference-based, LLM-based and aspect-oriented. We catalog datasets, compare modeling choices, and discuss limitations, ethical concerns, and future directions. The survey aims to provide practical guidance for building, evaluating, and integrating LLM systems across the full peer review workflow.