TY - JOUR
T1 - Using a machine learning approach to predict mortality in critically ill influenza patients
T2 - A cross-sectional retrospective multicentre study in Taiwan
AU - Hu, Chien An
AU - Chen, Chia Ming
AU - Fang, Yen Chun
AU - Liang, Shinn Jye
AU - Wang, Hao Chien
AU - Fang, Wen Feng
AU - Sheu, Chau Chyun
AU - Perng, Wann Cherng
AU - Yang, Kuang Yao
AU - Kao, Kuo Chin
AU - Wu, Chieh Liang
AU - Tsai, Chwei Shyong
AU - Lin, Ming Yen
AU - Chao, Wen Cheng
N1 - Publisher Copyright:
© Author(s) (or their employer(s)) 2020. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.
PY - 2020/2/25
Y1 - 2020/2/25
N2 - Current mortality prediction models used in the intensive care unit (ICU) have a limited role for specific diseases such as influenza, and we aimed to establish an explainable machine learning (ML) model for predicting mortality in critically ill influenza patients using a real-world severe influenza data set. Study design A cross-sectional retrospective multicentre study in Taiwan Setting Eight medical centres in Taiwan. Participants A total of 336 patients requiring ICU-admission for virology-proven influenza at eight hospitals during an influenza epidemic between October 2015 and March 2016. Primary and secondary outcome measures We employed extreme gradient boosting (XGBoost) to establish the prediction model, compared the performance with logistic regression (LR) and random forest (RF), demonstrated the feature importance categorised by clinical domains, and used SHapley Additive exPlanations (SHAP) for visualised interpretation. Results The data set contained 76 features of the 336 patients with severe influenza. The severity was apparently high, as shown by the high Acute Physiology and Chronic Health Evaluation II score (22, 17 to 29) and pneumonia severity index score (118, 88 to 151). XGBoost model (area under the curve (AUC): 0.842; 95% CI 0.749 to 0.928) outperformed RF (AUC: 0.809; 95% CI 0.629 to 0.891) and LR (AUC: 0.701; 95% CI 0.573 to 0.825) for predicting 30-day mortality. To give clinicians an intuitive understanding of feature exploitation, we stratified features by the clinical domain. The cumulative feature importance in the fluid balance domain, ventilation domain, laboratory data domain, demographic and symptom domain, management domain and severity score domain was 0.253, 0.113, 0.177, 0.140, 0.152 and 0.165, respectively. We further used SHAP plots to illustrate associations between features and 30-day mortality in critically ill influenza patients. Conclusions We used a real-world data set and applied an ML approach, mainly XGBoost, to establish a practical and explainable mortality prediction model in critically ill influenza patients.
AB - Current mortality prediction models used in the intensive care unit (ICU) have a limited role for specific diseases such as influenza, and we aimed to establish an explainable machine learning (ML) model for predicting mortality in critically ill influenza patients using a real-world severe influenza data set. Study design A cross-sectional retrospective multicentre study in Taiwan Setting Eight medical centres in Taiwan. Participants A total of 336 patients requiring ICU-admission for virology-proven influenza at eight hospitals during an influenza epidemic between October 2015 and March 2016. Primary and secondary outcome measures We employed extreme gradient boosting (XGBoost) to establish the prediction model, compared the performance with logistic regression (LR) and random forest (RF), demonstrated the feature importance categorised by clinical domains, and used SHapley Additive exPlanations (SHAP) for visualised interpretation. Results The data set contained 76 features of the 336 patients with severe influenza. The severity was apparently high, as shown by the high Acute Physiology and Chronic Health Evaluation II score (22, 17 to 29) and pneumonia severity index score (118, 88 to 151). XGBoost model (area under the curve (AUC): 0.842; 95% CI 0.749 to 0.928) outperformed RF (AUC: 0.809; 95% CI 0.629 to 0.891) and LR (AUC: 0.701; 95% CI 0.573 to 0.825) for predicting 30-day mortality. To give clinicians an intuitive understanding of feature exploitation, we stratified features by the clinical domain. The cumulative feature importance in the fluid balance domain, ventilation domain, laboratory data domain, demographic and symptom domain, management domain and severity score domain was 0.253, 0.113, 0.177, 0.140, 0.152 and 0.165, respectively. We further used SHAP plots to illustrate associations between features and 30-day mortality in critically ill influenza patients. Conclusions We used a real-world data set and applied an ML approach, mainly XGBoost, to establish a practical and explainable mortality prediction model in critically ill influenza patients.
KW - adult intensive & critical care
KW - adult intensive & critical care
KW - infectious diseases & infestations
KW - information technology
KW - thoracic medicine
UR - https://www.scopus.com/pages/publications/85080140146
U2 - 10.1136/bmjopen-2019-033898
DO - 10.1136/bmjopen-2019-033898
M3 - 文章
C2 - 32102816
AN - SCOPUS:85080140146
SN - 2044-6055
VL - 10
JO - BMJ Open
JF - BMJ Open
IS - 2
M1 - e033898
ER -