Search papers, labs, and topics across Lattice.
This paper investigates shortcut learning in machine learning models trained to classify geometric knots, revealing that models often rely on non-topological features present in Molecular Dynamics simulation data. The authors demonstrate that ML models can achieve high classification accuracy by exploiting these hidden, non-topological features rather than learning true topological invariants. To address this, they introduce a new dataset and code for generating knot embeddings that mitigate the influence of non-topological features, aiming to facilitate the development of more robust ML models for knot classification.
Machine learning models classifying geometric knots are cheating by exploiting hidden, non-topological features in simulation data, highlighting a need for more rigorous training datasets.
Classifying the topology of closed curves is a central problem in low dimensional topology with applications beyond mathematics spanning protein folding, polymer physics and even magnetohydrodynamics. The central problem is how to determine whether two embeddings of a closed arc are equivalent under ambient isotopy. Given the striking ability of neural networks to solve complex classification tasks, it is therefore natural to ask if the knot classification problem can be tackled using Machine Learning (ML). In this paper, we investigate generic shortcut methods employed by ML to solve the knot classification challenge and specifically discover hidden non-topological features in training data generated through Molecular Dynamics simulations of polygonal knots that are used by ML to arrive to positive classifications results. We then provide a rigorous foundation for future attempts to tackle the knot classification challenge using ML by developing a publicly-available (i) dataset, that aims to remove the potential of non-topological feature classification and (ii) code, that can generate knot embeddings that faithfully explore chosen geometric state space with fixed knot topology. We expect that our work will accelerate the development of ML models that can solve complex geometric knot classification challenges.