Search papers, labs, and topics across Lattice.
This paper introduces a parameter-efficient fine-tuning approach for text style transfer (TST) using large language models (LLMs). It addresses the lack of parallel style corpora by synthesizing data via round-trip translation to a 'neutralized' style, creating a shared input style for training and inference. Experiments across four domains demonstrate the method's superiority over zero-shot prompting and few-shot in-context learning (ICL), as measured by BLEU and style accuracy, with further improvements from retrieval-augmented generation (RAG).
Forget parallel corpora: this style transfer method uses roundtrip translation to synthesize training data for parameter-efficient LLM finetuning, outperforming prompting and ICL baselines.
This paper proposes a novel method for Text Style Transfer (TST) based on parameter-efficient fine-tuning of Large Language Models (LLMs). Addressing the scarcity of parallel corpora that map between styles, the study employs roundtrip translation to synthesize such parallel datasets from monolingual corpora. This approach creates 'neutralized' text devoid of stylistic attributes, essentially creating a shared input style at training-time and inference-time. Experimental results demonstrate consistent superiority of this method over zero-shot prompting and fewshot ICL techniques measured by BLEU scores and style accuracy scores across four investigated domains. Furthermore, the integration of retrieval-augmented generation (RAG) for terminology and name knowledge enhances robustness and stylistic consistency.