CMU MLApr 15, 2026arXiv:2604.14111

Interpretable Stylistic Variation in Human and LLM Writing Across Genres, Models, and Decoding Strategies

Swati Rallapalli, Swati Rallapalli, Shannon Gallagher, Shannon K. Gallagher, Ronald Yurko, Ronald Yurko, Tyler Brooks, Tyler Brooks, Chuck Loughin, Charles Loughin, Michele Sezgin, Michele Sezgin, Violet Turri, Violet Turri

AI Summary

This paper analyzes stylistic differences between human and LLM-generated text across 8 genres and 11 LLMs using Biber's linguistic features. The study finds that genre is a stronger determinant of style than the source (human vs. LLM), and that model choice impacts style more than decoding strategy. Surprisingly, prompting strategies aimed at mimicking human style had little impact on the key linguistic differentiators of LLM-generated text.

Key Contribution

LLMs can mimic human writing, but not as well as you think: genre matters more than the source (human vs. LLM), and model choice trumps decoding strategy when it comes to style.

Abstract

Large Language Models (LLMs) are now capable of generating highly fluent, human-like text. They enable many applications, but also raise concerns such as large scale spam, phishing, or academic misuse. While much work has focused on detecting LLM-generated text, only limited work has gone into understanding the stylistic differences between human-written and machine-generated text. In this work, we perform a large scale analysis of stylistic variation across human-written text and outputs from 11 LLMs spanning 8 different genres and 4 decoding strategies using Douglas Biber's set of lexicogrammatical and functional features. Our findings reveal insights that can guide intentional LLM usage. First, key linguistic differentiators of LLM-generated text seem robust to generation conditions (e.g., prompt settings to nudge them to generate human-like text, or availability of human-written text to continue the style); second, genre exerts a stronger influence on stylistic features than the source itself; third, chat variants of the models generally appear to be clustered together in stylistic space, and finally, model has a larger effect on the style than decoding strategy, with some exceptions. These results highlight the relative importance of model and genre over prompting and decoding strategies in shaping the stylistic behavior of machine-generated text.

Eval Frameworks & Benchmarks Interpretability & Mechanistic Interp Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References29

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Interpretable Stylistic Variation in Human and LLM Writing Across Genres, Models, and Decoding Strategies

Related Papers