Using Machine Learning to Predict the Future Development of Disease

Miao, L., Guo, X., Abbas, H. T. , Qaraqe, K. A. and Abbasi, Q. H. (2020) Using Machine Learning to Predict the Future Development of Disease. In: 5th International Conference on the UK-China Emerging Technologies (UCET 2020), Glasgow, UK, 20-21 Aug 2020, ISBN 9781728194882 (doi:10.1109/UCET51115.2020.9205373)

[img] Text
221489.pdf - Accepted Version



The objective of this research is to develop a longterm risk model for the development of cardiovascular disease (CVD) because of type-2 diabetes (T2D). We use the support vector machine (SVM) and the K-nearest neighbours algorithms on the dataset collected from a longitudinal study called Framingham Heart Study, to develop the prediction models. The dataset was first balanced by the Synthetic Minority Oversampling Technique algorithm. The SVM algorithm was then used to train the model, and after tuning the parameters and training for 1000 times, the average accuracy to correctly predict the prevalence of CVD due to T2D came out as 96.5% and the average recall rate was 89.8%. Similarly, we also applied the KNN algorithm to train the dataset, and the recall rate even reaches 92.9%. The advantages of our model are: 1) it can predict with high accuracy both the risk of development of T2D and CVD simultaneously; 2) it can be used without the expensive and tedious oral glucose tolerance test. The model yielded high-performance results after training on the Framingham Heart Study dataset.

Item Type:Conference Proceedings
Glasgow Author(s) Enlighten ID:Abbas, Dr Hasan and Abbasi, Dr Qammer
Authors: Miao, L., Guo, X., Abbas, H. T., Qaraqe, K. A., and Abbasi, Q. H.
College/School:College of Science and Engineering > School of Engineering > Electronics and Nanoscale Engineering
Copyright Holders:Copyright © 2020 IEEE
Publisher Policy:Reproduced in accordance with the copyright policy of the publisher
Related URLs:

University Staff: Request a correction | Enlighten Editors: Update this record