Search papers, labs, and topics across Lattice.
The paper introduces enhancements to the Projection Pursuit Tree (PPTree) classifier to improve its performance in complex, high-dimensional, multi-class classification problems. The enhancements address limitations in the original PPTree algorithm by allowing for more splits and flexible class groupings, particularly beneficial when dealing with unequal variance-covariance structures and nonlinear class separations. The authors validate the improvements through high-dimensional visualization techniques and an interactive web application, demonstrating the enhanced classifier's behavior on benchmark datasets.
PPTree classifiers get a glow-up with flexible splits and class groupings, plus interactive visualizations to prove they actually work in high dimensions.
This paper presents enhancements to the projection pursuit tree classifier and visual diagnostic methods for assessing their impact in high dimensions. The original algorithm uses linear combinations of variables in a tree structure where depth is constrained to be less than the number of classes -- a limitation that proves too rigid for complex classification problems. Our extensions improve performance in multi-class settings with unequal variance-covariance structures and nonlinear class separations by allowing more splits and more flexible class groupings in the projection pursuit computation. Proposing algorithmic improvements is straightforward; demonstrating their actual utility is not. We therefore develop two visual diagnostic approaches to verify that the enhancements perform as intended. Using high-dimensional visualization techniques, we examine model fits on benchmark datasets to assess whether the algorithm behaves as theorized. An interactive web application enables users to explore the behavior of both the original and enhanced classifiers under controlled scenarios. The enhancements are implemented in the R package PPtreeExt.