TY - JOUR
T1 - Minimizing and quantifying uncertainty in AI-informed decisions
T2 - Applications in medicine
AU - Curtis, Samuel D.
AU - Panda, Sambit
AU - Li, Adam
AU - Xu, Haoyin
AU - Bai, Yuxin
AU - Ogihara, Itsuki
AU - O’Reilly, Eliza
AU - Wang, Yuxuan
AU - Dobbyn, Lisa
AU - Popoli, Maria
AU - Ptak, Janine
AU - Nehme, Nadine
AU - Silliman, Natalie
AU - Tie, Jeanne
AU - Gibbs, Peter
AU - Ho-Pham, Lan T.
AU - Tran, Bich N.H.
AU - Tran, Thach S.
AU - Nguyen, Tuan V.
AU - Irajizad, Ehsan
AU - Goggins, Michael
AU - Wolfgang, Christopher L.
AU - Wang, Tian Li
AU - Shih, Ie Ming
AU - Fader, Amanda
AU - Lennon, Anne Marie
AU - Hruban, Ralph H.
AU - Bettegowda, Chetan
AU - Gilbert, Lucy
AU - Kinzler, Kenneth W.
AU - Papadopoulos, Nickolas
AU - Vogelstein, Bert
AU - Vogelstein, Joshua T.
AU - Douville, Christopher
N1 - Publisher Copyright:
Copyright © 2025 the Author(s). Published by PNAS.
PY - 2025/8/26
Y1 - 2025/8/26
N2 - AI is now a cornerstone of modern dataset analysis. In many real world applications, practitioners are concerned with controlling specific kinds of errors, rather than minimizing the overall number of errors. For example, biomedical screening assays may primarily be concerned with mitigating the number of false positives rather than false negatives. Quantifying uncertainty in AI-based predictions, and in particular those controlling specific kinds of errors, remains theoretically and practically challenging. We develop a strategy called multidimensional informed generalized hypothesis testing (MIGHT) which we prove accurately quantifies uncertainty and confidence given sufficient data, and concomitantly controls for particular error types. Our key insight was that it is possible to integrate canonical cross-validation and parametric calibration procedures within a nonparametric ensemble method. Simulations demonstrate that while typical AI based-approaches cannot be trusted to obtain the truth, MIGHT can be. We apply MIGHT to answer an open question in liquid biopsies using circulating cell-free DNA (ccfDNA) in individuals with or without cancer: Which biomarkers, or combinations thereof, can we trust? Performance estimates produced by MIGHT on ccfDNA data have coefficients of variation that are often orders of magnitude lower than other state of the art algorithms such as support vector machines, random forests, and Transformers, while often also achieving higher sensitivity. We find that combinations of variable sets often decrease rather than increase sensitivity over the optimal single variable set because some variable sets add more noise than signal. This work demonstrates the importance of quantifying uncertainty and confidence—with theoretical guarantees—for the interpretation of real-world data.
AB - AI is now a cornerstone of modern dataset analysis. In many real world applications, practitioners are concerned with controlling specific kinds of errors, rather than minimizing the overall number of errors. For example, biomedical screening assays may primarily be concerned with mitigating the number of false positives rather than false negatives. Quantifying uncertainty in AI-based predictions, and in particular those controlling specific kinds of errors, remains theoretically and practically challenging. We develop a strategy called multidimensional informed generalized hypothesis testing (MIGHT) which we prove accurately quantifies uncertainty and confidence given sufficient data, and concomitantly controls for particular error types. Our key insight was that it is possible to integrate canonical cross-validation and parametric calibration procedures within a nonparametric ensemble method. Simulations demonstrate that while typical AI based-approaches cannot be trusted to obtain the truth, MIGHT can be. We apply MIGHT to answer an open question in liquid biopsies using circulating cell-free DNA (ccfDNA) in individuals with or without cancer: Which biomarkers, or combinations thereof, can we trust? Performance estimates produced by MIGHT on ccfDNA data have coefficients of variation that are often orders of magnitude lower than other state of the art algorithms such as support vector machines, random forests, and Transformers, while often also achieving higher sensitivity. We find that combinations of variable sets often decrease rather than increase sensitivity over the optimal single variable set because some variable sets add more noise than signal. This work demonstrates the importance of quantifying uncertainty and confidence—with theoretical guarantees—for the interpretation of real-world data.
KW - biomarkers
KW - biomedical assays
KW - cancer screening
KW - hypothesis testing
KW - predictive modeling
UR - https://www.scopus.com/pages/publications/105014336475
U2 - 10.1073/pnas.2424203122
DO - 10.1073/pnas.2424203122
M3 - 文章
C2 - 40833408
AN - SCOPUS:105014336475
SN - 0027-8424
VL - 122
SP - e2424203122
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 34
M1 - e2424203122
ER -