Search papers, labs, and topics across Lattice.
This paper introduces a novel framework for estimating staged event tree models by applying hierarchical clustering on the probability simplex. The approach leverages simplex-based divergences, including Total Variation, Hellinger, Fisher, and Kaniadakis, combined with various linkage methods. Through simulation experiments, the authors demonstrate that Total Variation divergence with Ward.D2 linkage achieves a superior balance of model fit, structure recovery, and computational efficiency compared to Backward Hill Climbing.
Forget computationally expensive methods: hierarchical clustering on the simplex with Total Variation divergence offers a surprisingly efficient route to estimating staged event tree models.
Staged tree models enhance Bayesian networks by incorporating context-specific dependencies through a stage-based structure. In this study, we present a new framework for estimating staged trees using hierarchical clustering on the probability simplex, utilizing simplex basesd divergences. We conduct a thorough evaluation of several distance and divergence metrics including Total Variation, Hellinger, Fisher, and Kaniadakis; alongside various linkage methods such as Ward.D2, average, complete, and McQuitty. We conducted the simulation experiments that reveals Total Variation, especially when combined with Ward.D2 linkage, consistently produces staged trees with better model fit, structure recovery, and computational efficiency. We assess performance by utilizing relative Bayesian Information Criterion (BIC), and Hamming distance. Our findings indicate that although Backward Hill Climbing (BHC) delivers competitive outcomes, it incurs a significantly higher computational cost. On the other, Total Variation divergence with Ward.D2 linkage, achieves similar performance while providing significantly better computational efficiency, making it a more viable option for large-scale or time sensitive tasks.