Search papers, labs, and topics across Lattice.
This paper introduces Delta-Consistent Scoring (DCS), a novel annotation-free framework that leverages frozen LLM representations to decode the hawkish-dovish stance of FOMC statements by jointly modeling absolute stance and relative inter-meeting shifts. DCS uses consecutive meetings as self-supervision, learning absolute stance scores and relative shift scores, and enforces delta-consistency between them, eliminating the need for manual labeling. Experiments across four LLM backbones demonstrate that DCS outperforms supervised probes and LLM-as-judge baselines, achieving up to 71.1% accuracy on sentence-level hawkish-dovish classification and generating economically meaningful meeting-level scores.
Ditch the labels: this method uses LLMs to extract monetary policy signals from FOMC statements by tracking subtle shifts between meetings, outperforming supervised approaches without any manual annotation.
Federal Open Market Committee (FOMC) statements are a major source of monetary-policy information, and even subtle changes in their wording can move global financial markets. A central task is therefore to measure the hawkish--dovish stance conveyed in these texts. Existing approaches typically treat stance detection as a standard classification problem, labeling each statement in isolation. However, the interpretation of monetary-policy communication is inherently relative: market reactions depend not only on the tone of a statement, but also on how that tone shifts across meetings. We introduce Delta-Consistent Scoring (DCS), an annotation-free framework that maps frozen large language model (LLM) representations to continuous stance scores by jointly modeling absolute stance and relative inter-meeting shifts. Rather than relying on manual hawkish--dovish labels, DCS uses consecutive meetings as a source of self-supervision. It learns an absolute stance score for each statement and a relative shift score between consecutive statements. A delta-consistency objective encourages changes in absolute scores to align with the relative shifts. This allows DCS to recover a temporally coherent stance trajectory without manual labels. Across four LLM backbones, DCS consistently outperforms supervised probes and LLM-as-judge baselines, achieving up to 71.1% accuracy on sentence-level hawkish--dovish classification. The resulting meeting-level scores are also economically meaningful: they correlate strongly with inflation indicators and are significantly associated with Treasury yield movements. Overall, the results suggest that LLM representations encode monetary-policy signals that can be recovered through relative temporal structure.