Skip to content

Sharing successes, challenges and daily happenings in the Department of Medicine

Medicine Matters Home Article of the Week Addressing Missingness in Predictive Models That Use Electronic Health Record Data

Addressing Missingness in Predictive Models That Use Electronic Health Record Data

ARTICLE: Addressing Missingness in Predictive Models That Use Electronic Health Record Data

AUTHORS: Shanshan Lin, Rolf H H Groenwold, Hemalkumar B Mehta, Ji Soo KimJodi B Segal

JOURNAL: Ann Intern Med. 2025 Oct;178(10):1451-1463. doi: 10.7326/ANNALS-24-01516. Epub 2025 Sep 9.

Abstract

Electronic health record (EHR) data are increasingly used to develop prediction models that guide clinical decision making at the point of care. These include algorithms that use high-frequency data, like in sepsis prediction, as well as simpler equations, such as the Pooled Cohort Equations for cardiovascular outcome prediction. Although EHR data used in prediction models are often highly granular and more current than other data, there is systematic and nonsystematic missingness in EHR data as there is with most data. Despite growing use for clinical decisions, algorithms implemented in EHRs are mostly unregulated and are often opaque to the user. Guidelines about the development, validation, implementation, and reporting on clinical prediction models are sparse in their recommendations regarding missing data. This article characterizes missingness in EHR data, summarizes methods for attending to missing data when developing prediction models, makes recommendations about validation and implementation of models in practice when data are missing, and identifies research needs in this field.

For the full article, click here.

For a link to the abstract, click here.

Kelsey Bennett

Kelsey Bennett

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.