Exploratory Oncology Research & Clinical Trial CenterNational Cancer Center Hospital EastUniversity of YamanashiUTokyoMar 12, 2026arXiv:2603.11597

Performance Evaluation of Open-Source Large Language Models for Assisting Pathology Report Writing in Japanese

Masataka Kawai, Singo Sakashita, Shumpei Ishikawa, Shogo Watanabe, Anna Matsuoka, M. Sakurai, Yasuto Fujimoto, Y. Takahara, A. Ohara, H. Miyake, Genichiro Ishii

AI Summary

This paper evaluates seven open-source LLMs on three tasks relevant to Japanese pathology report writing: structured report generation/extraction, typo correction, and explanatory text generation. Thinking and medical-specialized models excelled at structured reporting and typo correction, while subjective evaluations of explanatory text varied greatly. The study concludes that open-source LLMs can be useful in specific, clinically relevant scenarios for assisting Japanese pathology report writing.

Key Contribution

Open-source LLMs can help write Japanese pathology reports, but pathologists strongly disagree on which model provides the best explanations.

Abstract

The performance of large language models (LLMs) for supporting pathology report writing in Japanese remains unexplored. We evaluated seven open-source LLMs from three perspectives: (A) generation and information extraction of pathology diagnosis text following predefined formats, (B) correction of typographical errors in Japanese pathology reports, and (C) subjective evaluation of model-generated explanatory text by pathologists and clinicians. Thinking models and medical-specialized models showed advantages in structured reporting tasks that required reasoning and in typo correction. In contrast, preferences for explanatory outputs varied substantially across raters. Although the utility of LLMs differed by task, our findings suggest that open-source LLMs can be useful for assisting Japanese pathology report writing in limited but clinically relevant scenarios.

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

Citation Metrics

Citations0

Influential citations0

References9

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Performance Evaluation of Open-Source Large Language Models for Assisting Pathology Report Writing in Japanese

Related Papers