BremenKULeibnizFeb 18, 2026arXiv:2602.16352

Machine Learning in Epidemiology

Marvin N. Wright, Marvin N. Wright, Lukas Burk, Lukas Burk, Pegah Golchian, Pegah Golchian, Jan Kapar, Jan Kapar, Niklas Koenen, Niklas Koenen, Sophie Hanna Langbein, Sophie Hanna Langbein

AI Summary

This chapter introduces machine learning methodologies for analyzing complex epidemiological data, addressing the challenges posed by increasing data volume and dimensionality. It covers supervised and unsupervised learning principles, key ML methods, model evaluation strategies, hyperparameter optimization, and interpretable ML techniques. The chapter provides practical R code examples using a heart disease dataset to illustrate the application of these methods.

Key Contribution

Epidemiologists can leverage this practical guide, complete with R code, to navigate the complexities of applying machine learning to modern, high-dimensional health data.

Abstract

In the age of digital epidemiology, epidemiologists are faced by an increasing amount of data of growing complexity and dimensionality. Machine learning is a set of powerful tools that can help to analyze such enormous amounts of data. This chapter lays the methodological foundations for successfully applying machine learning in epidemiology. It covers the principles of supervised and unsupervised learning and discusses the most important machine learning methods. Strategies for model evaluation and hyperparameter optimization are developed and interpretable machine learning is introduced. All these theoretical parts are accompanied by code examples in R, where an example dataset on heart disease is used throughout the chapter.

Natural Language Processing Scientific Discovery & Drug Design

Citation Metrics

Citations0

Influential citations0

References87

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Machine Learning in Epidemiology

Related Papers