Responsible AI Research CentreApr 8, 2026arXiv:2604.06737

Luwen Technical Report

Yiquan Wu, Yuhang Liu, Siying Zhou, Kun Kuang

AI Summary

This paper introduces Luwen, an open-source Chinese legal language model, built upon the Baichuan foundation model. The model is trained using continual pre-training on a large legal corpus, supervised fine-tuning with legal instruction data, and retrieval-augmented generation with a legal knowledge base. Experiments across five legal tasks show Luwen outperforms strong baselines, demonstrating effective adaptation of general LLMs to the legal domain.

Key Contribution

Open-source Luwen shows that adapting general-purpose LLMs with legal-specific data and retrieval augmentation can yield significant performance gains on complex Chinese legal tasks.

Abstract

Large language models have demonstrated remarkable capabilities across a wide range of natural language processing tasks, yet their application in the legal domain remains challenging due to the specialized terminology, complex reasoning requirements, and rapidly evolving legal knowledge involved. In this paper, we present Luwen, an open-source Chinese legal language model built upon the Baichuan foundation model through three key techniques: continual pre-training on a large-scale legal corpus, supervised fine-tuning with carefully curated legal instruction data, and retrieval-augmented generation integrated with a comprehensive legal knowledge base. We evaluate Luwen on five representative legal tasks spanning both prediction and generation settings, including legal judgment prediction, judicial examination, legal text summarization, law article question answering, and judicial decision reasoning. Experimental results show that Luwen outperforms several strong baselines, demonstrating the effectiveness of our approach in adapting general-purpose language models to the legal domain.

Data Curation & Synthetic Data Natural Language Processing Open-Source Models & Weights

Citation Metrics

Citations0

Influential citations0

References12

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Luwen Technical Report

Related Papers