Search papers, labs, and topics across Lattice.
This paper introduces METRO, a method for inducing dialogue strategies from expert transcripts for non-collaborative agents using LLMs. METRO constructs a Strategy Forest, a hierarchical structure representing short-term responses and long-term strategic foresight, directly from raw transcripts. Experiments on two benchmarks demonstrate that METRO outperforms existing methods by 9%-10%, exhibiting strategic behavioral diversity, foresight, and cross-task transferability.
Forget hand-coded strategies: METRO uses LLMs to automatically learn dialogue strategies from expert transcripts, achieving state-of-the-art results in non-collaborative dialogue.
Developing non-collaborative dialogue agents traditionally requires the manual, unscalable codification of expert strategies. We propose \ours, a method that leverages large language models to autonomously induce both strategy actions and planning logic directly from raw transcripts. METRO formalizes expert knowledge into a Strategy Forest, a hierarchical structure that captures both short-term responses (nodes) and long-term strategic foresight (branches). Experimental results across two benchmarks show that METRO demonstrates promising performance, outperforming existing methods by an average of 9%-10%. Our further analysis not only reveals the success behind METRO (strategic behavioral diversity and foresight), but also demonstrates its robust cross-task transferability. This offers new insights into building non-collaborative agents in a cost-effective and scalable way. Our code is available at https://github.com/Humphrey-0125/METRO.