Claim Missing Document
Check
Articles

Found 3 Documents
Search
Journal : Indonesian Journal of Electrical Engineering and Computer Science

Performance analysis of 10 machine learning models in lung cancer prediction Zapata-Paulini, Joselyn; Cabanillas-Carbonell, Michael
Indonesian Journal of Electrical Engineering and Computer Science Vol 37, No 2: February 2025
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v37.i2.pp1352-1364

Abstract

Lung cancer is one of the diseases with the highest incidence and mortality in the world. Machine learning (ML) models can play an important role in the early detection of this disease. This study aims to identify the ML algorithm that has the best performance in predicting lung cancer. The algorithms that were contrasted were logistic regression (LR), decision tree (DT), k-nearest neighbors (KNN), gaussian Naive Bayes (GNB), multinomial Naive Bayes (MNB), support vector classifier (SVC), random forest (RF), extreme gradient boosting (XGBoost), multilayer perceptron (MLP) and gradient boosting (GB). The dataset used was provided by Kaggle, with a total of 309 records and 16 attributes. The study was developed in several phases, such as the description of the ML models and the analysis of the dataset. In addition, the contrast of the models was performed under the metrics of specificity, sensitivity, F1 count, accuracy, and precision. The results showed that the SVC, RF, MLP, and GB models obtained the best performance metrics, achieving 98% accuracy, 98% precision, and 98% sensitivity.
Evaluation of machine learning algorithms in the early detection of Parkinson's disease: a comparative study Zapata-Paulini, Joselyn; Cabanillas-Carbonell, Michael
Indonesian Journal of Electrical Engineering and Computer Science Vol 35, No 1: July 2024
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v35.i1.pp222-237

Abstract

Parkinson's is a neurodegenerative disease that generally affects people over 60 years of age. The disease destroys neurons and increases the accumulation of α-synuclein in many parts of the brain stem, although at present its causes remain unknown. It is therefore a priority to identify a method that can detect the disease, and this is where machine learning models become important. This study aims to perform a comparative analysis of machine learning models focused on the early detection of Parkinson's disease. Logistic regression (LR), support vector machines (SVM), decision trees (DT), extra trees classifiers (ETC), K-nearest neighbors (KNN), random forests (RF), adaptive boosting (AdaBoost) and gradient boosting (GB) algorithms are described and developed to identify the one that offers the best performance. In the training stage, we used the Oxford University dataset for Parkinson's disease detection, which has a total of 23 attributes and 195 records on patient voice recordings. The article is structured into six sections, such as introduction, related work, methodology, results, discussions, and conclusions. The metrics of accuracy, sensitivity, F1 count, and precision were used to measure the models' performance. The results position the KNN model as the best predictor with 95% accuracy, precision, sensitivity, and F1 score.
Seeking best performance: a comparative evaluation of machine learning models in the prediction of hepatitis C Cabanillas-Carbonell, Michael; Zapata-Paulini, Joselyn
Indonesian Journal of Electrical Engineering and Computer Science Vol 39, No 1: July 2025
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v39.i1.pp374-386

Abstract

Hepatitis C is a disease that affects millions of people worldwide. It is spread through contact with contaminated blood through injections, transfusions, or other means. It is estimated that with early detection patients have a higher rate of recovery. The objective of this study is to perform a comparative evaluation of different models focused on the prediction of hepatitis C, to determine which of the models offers better performance in accuracy, precision, and sensitivity. The models used were logistic regression (LR), random forest (RF), K-nearest neighbors (KNN), decision tree (DT), and gradient boosting (GB), aimed at hepatitis C prediction. The training of the models was carried out using a dataset composed of 615 records, which incorporate 14 attributes. The structure of the article is divided into six sections, including introduction, review of related articles, methodology, results, discussion, and conclusions. The performance of the models was evaluated through metrics such as accuracy, sensitivity, F1 count, and, mainly, precision. The results obtained place the DT model as the most efficient predictor, reaching a precision, accuracy, sensitivity, and F1-score of 95%.