Predicting metastasis in gastric cancer patients: machine learning-based approaches

Talebi, A., Celis‑Morales, C. , Borumandnia, N., Abbasi, S., Pourhoseingholi, M. A., Akbari, A. and Yousefi, J. (2023) Predicting metastasis in gastric cancer patients: machine learning-based approaches. Scientific Reports, 13, 4163. (doi: 10.1038/s41598-023-31272-w) (PMID:36914697) (PMCID:PMC10011363)

[img] Text
294263.pdf - Published Version
Available under License Creative Commons Attribution.

2MB

Abstract

Gastric cancer (GC), with a 5-year survival rate of less than 40%, is known as the fourth principal reason of cancer-related mortality over the world. This study aims to develop predictive models using different machine learning (ML) classifiers based on both demographic and clinical variables to predict metastasis status of patients with GC. The data applied in this study including 733 of GC patients, divided into a train and test groups at a ratio of 8:2, diagnosed at Taleghani tertiary hospital. In order to predict metastasis in GC, ML-based algorithms, including Naive Bayes (NB), Random Forest (RF), Support Vector Machine (SVM), Neural Network (NN), Decision Tree (RT) and Logistic Regression (LR), with 5-fold cross validation were performed. To assess the model performance, F1 score, precision, sensitivity, specificity, area under the curve (AUC) of receiver operating characteristic (ROC) curve and precision-recall AUC (PR-AUC) were obtained. 262 (36%) experienced metastasis among 733 patients with GC. Although all models have optimal performance, the indices of SVM model seems to be more appropiate (training set: AUC: 0.94, Sensitivity: 0.94; testing set: AUC: 0.85, Sensitivity: 0.92). Then, NN has the higher AUC among ML approaches (training set: AUC: 0.98; testing set: AUC: 0.86). The RF of ML-based models, which determine size of tumor and age as two essential variables, is considered as the third efficient model, because of higher specificity and AUC (84% and 87%). Based on the demographic and clinical characteristics, ML approaches can predict the metastasis status in GC patients. According to AUC, sensitivity and specificity in both SVM and NN can be regarded as better algorithms among 6 applied ML-based methods.

Item Type:Articles
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Celis, Dr Carlos and Talebi, Dr Atefeh
Authors: Talebi, A., Celis‑Morales, C., Borumandnia, N., Abbasi, S., Pourhoseingholi, M. A., Akbari, A., and Yousefi, J.
College/School:College of Medical Veterinary and Life Sciences > School of Cardiovascular & Metabolic Health
Journal Name:Scientific Reports
Publisher:Nature Research
ISSN:2045-2322
ISSN (Online):2045-2322
Copyright Holders:Copyright © 2023 The Authors
First Published:First published in Scientific Reports 13(1):4163
Publisher Policy:Reproduced under a Creative Commons license

University Staff: Request a correction | Enlighten Editors: Update this record