Search papers, labs, and topics across Lattice.
This paper reviews the shift from supervised learning to unsupervised and self-supervised learning (SSL) in biomedicine to overcome the annotation bottleneck. It highlights how SSL methods can learn directly from the intrinsic structure of biomedical data, such as MRI pixels or genomic sequence tokens. The review demonstrates that these methods enable novel phenotype discovery, linking morphology to genetics, and anomaly detection, often matching or surpassing the performance of supervised methods.
Unsupervised AI is poised to revolutionize biomedicine by unlocking insights from massive datasets, rivaling or surpassing supervised methods without relying on scarce expert annotations.
The dependence on expert annotation has long constituted the primary rate-limiting step in the application of artificial intelligence to biomedicine. While supervised learning drove the initial wave of clinical algorithms, a paradigm shift towards unsupervised and self-supervised learning (SSL) is currently unlocking the latent potential of biobank-scale datasets. By learning directly from the intrinsic structure of data - whether pixels in a magnetic resonance image (MRI), voxels in a volumetric scan, or tokens in a genomic sequence - these methods facilitate the discovery of novel phenotypes, the linkage of morphology to genetics, and the detection of anomalies without human bias. This article synthesises seminal and recent advances in "learning without labels," highlighting how unsupervised frameworks can derive heritable cardiac traits, predict spatial gene expression in histology, and detect pathologies with performance that rivals or exceeds supervised counterparts.