Prediction of colon cancer stages and survival period with machine learning approach

Pushpanjali Gupta, Sum Fu Chiang, Prasan Kumar Sahoo*, Suvendu Kumar Mohapatra, Jeng Fu You, Djeane Debora Onthoni, Hsin Yuan Hung, Jy Ming Chiang, Yenlin Huang, Wen Sy Tsai

*Corresponding author for this work

Research output: Contribution to journalJournal Article peer-review

78 Scopus citations

Abstract

The prediction of tumor in the TNM staging (tumor, node, and metastasis) stage of colon cancer using the most influential histopathology parameters and to predict the five years disease-free survival (DFS) period using machine learning (ML) in clinical research have been studied here. From the colorectal cancer (CRC) registry of Chang Gung Memorial Hospital, Linkou, Taiwan, 4021 patients were selected for the analysis. Various ML algorithms were applied for the tumor stage prediction of the colon cancer by considering the Tumor Aggression Score (TAS) as a prognostic factor. Performances of different ML algorithms were evaluated using five-fold cross-validation, which is an effective way of the model validation. The accuracy achieved by the algorithms taking both cases of standard TNM staging and TNM staging with the Tumor Aggression Score was determined. It was observed that the Random Forest model achieved an F-measure of 0.89, when the Tumor Aggression Score was considered as an attribute along with the standard attributes normally used for the TNM stage prediction. We also found that the Random Forest algorithm outperformed all other algorithms, with an accuracy of approximately 84% and an area under the curve (AUC) of 0.82 ± 0.10 for predicting the five years DFS.

Original languageEnglish
Article number2007
JournalCancers
Volume11
Issue number12
DOIs
StatePublished - 12 2019

Bibliographical note

Publisher Copyright:
© 2019 by the authors. Licensee MDPI, Basel, Switzerland.

Keywords

  • Artificial intelligence
  • Colon cancer
  • Disease-free survival
  • Machine learning
  • Prediction
  • TNM staging

Fingerprint

Dive into the research topics of 'Prediction of colon cancer stages and survival period with machine learning approach'. Together they form a unique fingerprint.

Cite this