Zapata-Paulini, Joselyn
Unknown Affiliation

Published : 4 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 4 Documents
Search

Performance analysis of 10 machine learning models in lung cancer prediction Zapata-Paulini, Joselyn; Cabanillas-Carbonell, Michael
Indonesian Journal of Electrical Engineering and Computer Science Vol 37, No 2: February 2025
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v37.i2.pp1352-1364

Abstract

Lung cancer is one of the diseases with the highest incidence and mortality in the world. Machine learning (ML) models can play an important role in the early detection of this disease. This study aims to identify the ML algorithm that has the best performance in predicting lung cancer. The algorithms that were contrasted were logistic regression (LR), decision tree (DT), k-nearest neighbors (KNN), gaussian Naive Bayes (GNB), multinomial Naive Bayes (MNB), support vector classifier (SVC), random forest (RF), extreme gradient boosting (XGBoost), multilayer perceptron (MLP) and gradient boosting (GB). The dataset used was provided by Kaggle, with a total of 309 records and 16 attributes. The study was developed in several phases, such as the description of the ML models and the analysis of the dataset. In addition, the contrast of the models was performed under the metrics of specificity, sensitivity, F1 count, accuracy, and precision. The results showed that the SVC, RF, MLP, and GB models obtained the best performance metrics, achieving 98% accuracy, 98% precision, and 98% sensitivity.
Evaluation of machine learning algorithms in the early detection of Parkinson's disease: a comparative study Zapata-Paulini, Joselyn; Cabanillas-Carbonell, Michael
Indonesian Journal of Electrical Engineering and Computer Science Vol 35, No 1: July 2024
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v35.i1.pp222-237

Abstract

Parkinson's is a neurodegenerative disease that generally affects people over 60 years of age. The disease destroys neurons and increases the accumulation of α-synuclein in many parts of the brain stem, although at present its causes remain unknown. It is therefore a priority to identify a method that can detect the disease, and this is where machine learning models become important. This study aims to perform a comparative analysis of machine learning models focused on the early detection of Parkinson's disease. Logistic regression (LR), support vector machines (SVM), decision trees (DT), extra trees classifiers (ETC), K-nearest neighbors (KNN), random forests (RF), adaptive boosting (AdaBoost) and gradient boosting (GB) algorithms are described and developed to identify the one that offers the best performance. In the training stage, we used the Oxford University dataset for Parkinson's disease detection, which has a total of 23 attributes and 195 records on patient voice recordings. The article is structured into six sections, such as introduction, related work, methodology, results, discussions, and conclusions. The metrics of accuracy, sensitivity, F1 count, and precision were used to measure the models' performance. The results position the KNN model as the best predictor with 95% accuracy, precision, sensitivity, and F1 score.
Seeking best performance: a comparative evaluation of machine learning models in the prediction of hepatitis C Cabanillas-Carbonell, Michael; Zapata-Paulini, Joselyn
Indonesian Journal of Electrical Engineering and Computer Science Vol 39, No 1: July 2025
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v39.i1.pp374-386

Abstract

Hepatitis C is a disease that affects millions of people worldwide. It is spread through contact with contaminated blood through injections, transfusions, or other means. It is estimated that with early detection patients have a higher rate of recovery. The objective of this study is to perform a comparative evaluation of different models focused on the prediction of hepatitis C, to determine which of the models offers better performance in accuracy, precision, and sensitivity. The models used were logistic regression (LR), random forest (RF), K-nearest neighbors (KNN), decision tree (DT), and gradient boosting (GB), aimed at hepatitis C prediction. The training of the models was carried out using a dataset composed of 615 records, which incorporate 14 attributes. The structure of the article is divided into six sections, including introduction, review of related articles, methodology, results, discussion, and conclusions. The performance of the models was evaluated through metrics such as accuracy, sensitivity, F1 count, and, mainly, precision. The results obtained place the DT model as the most efficient predictor, reaching a precision, accuracy, sensitivity, and F1-score of 95%.
Artificial intelligence applications in agriculture: a systematic review of literature Cabanillas-Carbonell, Michael; Zapata-Paulini, Joselyn
IAES International Journal of Artificial Intelligence (IJ-AI) Vol 14, No 5: October 2025
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijai.v14.i5.pp3503-3519

Abstract

Artificial intelligence (AI) is transforming agriculture by offering innovative solutions to persistent challenges. This systematic literature review explores the most studied AI applications in agriculture, emphasizing crop management, agronomic decision-making, early detection of diseases and pests, and climate change adaptation. Using the preferred reporting items for systematic reviews and meta-analyses (PRISMA) methodology, 700 publications were retrieved from databases such as Scopus, ScienceDirect, and IEEE Xplore, with 104 relevant articles selected after applying strict inclusion and exclusion criteria. The findings underscore the importance of machine learning and image processing in tailoring agronomic practices to specific plot conditions and microclimates. These tools enable early identification and control of plant diseases and pests, reducing crop losses and dependence on chemicals. Nonetheless, challenges remain, particularly regarding accessibility for smallholder farmers, high implementation costs, and limited data infrastructure. While AI offers significant potential to enhance agricultural productivity, sustainability, and resilience, addressing these limitations is crucial. A balanced, inclusive approach is essential to ensure AI’s benefits are widely distributed and contribute to long-term food security and environmental sustainability.