Search papers, labs, and topics across Lattice.
This paper explores the semantic connections between Herman Melville's reading and writing using computational methods. They compare passages from Melville's works with texts from his library, segmenting texts at the sentence and 5-gram level and computing similarity using BERTScore. The study demonstrates that this approach can identify expert-identified instances of similarity and highlight new passages for qualitative examination, suggesting a framework for source and influence studies.
Computational semantic similarity can successfully capture literary influences, identifying connections between an author's reading and writing that warrant further qualitative examination.
This study investigates the potential influence of Herman Melville reading on his own writings through computational semantic similarity analysis. Using documented records of books known to have been owned or read by Melville, we compare selected passages from his works with texts from his library. The methodology involves segmenting texts at both sentence level and non-overlapping 5-gram level, followed by similarity computation using BERTScore. Rather than applying fixed thresholds to determine reuse, we interpret precision, recall, and F1 scores as indicators of possible semantic alignment that may suggest literary influence. Experimental results demonstrate that the approach successfully captures expert-identified instances of similarity and highlights additional passages warranting further qualitative examination. The findings suggest that semantic similarity methods provide a useful computational framework for supporting source and influence studies in literary scholarship.