International Journal of Advances in Intelligent Informatics
Vol 6, No 3 (2020): November 2020

Predicting breast cancer recurrence using principal component analysis as feature extraction: an unbiased comparative analysis

Zuhaira Muhammad Zain (College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University)
Mona Alshenaifi (College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University)
Abeer Aljaloud (College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University)
Tamadhur Albednah (College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University)
Reham Alghanim (College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University)
Alanoud Alqifari (College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University)
Amal Alqahtani (College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University)



Article Info

Publish Date
06 Nov 2020

Abstract

Breast cancer recurrence is among the most noteworthy fears faced by women. Nevertheless, with modern innovations in data mining technology, early recurrence prediction can help relieve these fears. Although medical information is typically complicated, and simplifying searches to the most relevant input is challenging, new sophisticated data mining techniques promise accurate predictions from high-dimensional data. In this study, the performances of three established data mining algorithms: Naïve Bayes (NB), k-nearest neighbor (KNN), and fast decision tree (REPTree), adopting the feature extraction algorithm, principal component analysis (PCA), for predicting breast cancer recurrence were contrasted. The comparison was conducted between models built in the absence and presence of PCA. The results showed that KNN produced better prediction without PCA (F-measure = 72.1%), whereas the other two techniques: NB and REPTree, improved when used with PCA (F-measure = 76.1% and 72.8%, respectively). This study can benefit the healthcare industry in assisting physicians in predicting breast cancer recurrence precisely.

Copyrights © 2020






Journal Info

Abbrev

IJAIN

Publisher

Subject

Computer Science & IT

Description

International journal of advances in intelligent informatics (IJAIN) e-ISSN: 2442-6571 is a peer reviewed open-access journal published three times a year in English-language, provides scientists and engineers throughout the world for the exchange and dissemination of theoretical and ...