Feb 15, 2026arXiv:2602.14081

CCiV: A Benchmark for Structure, Rhythm and Quality in LLM-Generated Chinese \textit{Ci} Poetry

Shangqing Zhao, Yupei Ren, Yuhao Zhou, Xiaopeng Bai, Man Lan

AI Summary

The paper introduces CCiV, a benchmark for evaluating LLM-generated Chinese *Ci* poetry based on structure, rhythm, and quality, using 30 *Cipai* to evaluate 17 LLMs. The benchmark reveals that LLMs often generate valid but unexpected historical variants and struggle more with tonal patterns than structural rules. Form-aware prompting improves stronger models' structural and tonal control but can degrade weaker models, exposing a misalignment between formal correctness and literary quality.

Key Contribution

LLMs writing Chinese poetry often nail the structure but botch the rhythm, and even when they get both right, the result can still be artistically bankrupt.

Abstract

The generation of classical Chinese \textit{Ci} poetry, a form demanding a sophisticated blend of structural rigidity, rhythmic harmony, and artistic quality, poses a significant challenge for large language models (LLMs). To systematically evaluate and advance this capability, we introduce \textbf{C}hinese \textbf{Ci}pai \textbf{V}ariants (\textbf{CCiV}), a benchmark designed to assess LLM-generated \textit{Ci} poetry across these three dimensions: structure, rhythm, and quality. Our evaluation of 17 LLMs on 30 \textit{Cipai} reveals two critical phenomena: models frequently generate valid but unexpected historical variants of a poetic form, and adherence to tonal patterns is substantially harder than structural rules. We further show that form-aware prompting can improve structural and tonal control for stronger models, while potentially degrading weaker ones. Finally, we observe weak and inconsistent alignment between formal correctness and literary quality in our sample. CCiV highlights the need for variant-aware evaluation and more holistic constrained creative generation methods.

Eval Frameworks & Benchmarks Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

CCiV: A Benchmark for Structure, Rhythm and Quality in LLM-Generated Chinese \textit{Ci} Poetry

Related Papers