Search papers, labs, and topics across Lattice.
The paper introduces Perspectives, an interactive document clustering tool built as an extension to the Discourse Analysis Tool Suite, aimed at assisting Digital Humanities scholars in exploring large document collections. It employs a flexible, aspect-focused document clustering pipeline, incorporating document rewriting prompts and instruction-based embeddings to initially steer the clustering process. The system allows for human-in-the-loop refinement through cluster editing and embedding model fine-tuning, enabling users to uncover topics and sentiments.
Uncover hidden topics and sentiments in your document collections with a new interactive clustering tool designed for Digital Humanities.
This paper introduces Perspectives, an interactive extension of the Discourse Analysis Tool Suite designed to empower Digital Humanities (DH) scholars to explore and organize large, unstructured document collections. Perspectives implements a flexible, aspect-focused document clustering pipeline with human-in-the-loop refinement capabilities. We showcase how this process can be initially steered by defining analytical lenses through document rewriting prompts and instruction-based embeddings, and further aligned with user intent through tools for refining clusters and mechanisms for fine-tuning the embedding model. The demonstration highlights a typical workflow, illustrating how DH researchers can leverage Perspectives's interactive document map to uncover topics, sentiments, or other relevant categories, thereby gaining insights and preparing their data for subsequent in-depth analysis.