Claim Missing Document
Check
Articles

Found 3 Documents
Search
Journal : Journal of Computer Networks, Architecture and High Performance Computing

COMPARISON OF RANDOM FOREST AND SUPPORT VECTOR MACHINE ALGORITHMS IN THE CLASSIFICATION OF DYSPEPSIA DISEASE Zahra, Fathima; Ichsan, Aulia; Riyadi, Sugeng
Journal of Computer Networks, Architecture and High Performance Computing Vol. 7 No. 4 (2025): Articles Research October 2025
Publisher : Information Technology and Science (ITScience)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47709/cnahpc.v7i4.6874

Abstract

Functional dyspepsia remains a prevalent gastrointestinal disorder globally, with a higher burden in low- and middle-income countries such as Indonesia. Diagnostic challenges are exacerbated by limited healthcare infrastructure and a low ratio of gastroenterologists. Machine learning approaches offer a promising solution to enhance diagnostic consistency and accuracy in resource-limited settings. This study aims to compare the performance of the Random Forest (RF) and Support Vector Machine (SVM) algorithms in differentiating dyspepsia from gastroenteritis using Indonesian clinical data. A quantitative experimental method was applied using patient medical records, including gastrointestinal disease categories, vital signs, and symptom profiles. Data preprocessing was carried out by handling missing values through imputation and Min-Max scaling normalization. The dataset was divided into 80% training data and 20% testing data using stratified random sampling. Model performance was evaluated using accuracy, precision, recall, and F1-score metrics. Random Forest demonstrated superior performance on all evaluation metrics compared to SVM. RF achieved 86.5% accuracy, 86.0% precision, 85.0% recall, and 85.5% F1-score, while SVM achieved 83.2% accuracy, 83.0% precision, 81.0% recall, and 82.0% F1-score. The 3.3 percentage point improvement in accuracy and 4.0 percentage point improvement in recall are clinically significant. Random Forest proved more effective in dyspepsia classification, showing better handling of complex clinical data interactions and more reliable diagnostic performance. These findings support the implementation of an RF-based decision support system in Indonesian healthcare facilities to improve diagnostic consistency and patient outcomes.
Enhancing Multi-Layer Perceptron Performance with K-Means Clustering Pardede, Doughlas; Ichsan, Aulia; Riyadi, Sugeng
Journal of Computer Networks, Architecture and High Performance Computing Vol. 6 No. 1 (2024): Article Research Volume 6 Issue 1, January 2024
Publisher : Information Technology and Science (ITScience)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47709/cnahpc.v6i1.3600

Abstract

Machine learning plays a crucial role in identifying patterns within data, with classification being a prominent application. This study investigates the use of Multilayer Perceptron (MLP) classification models and explores preprocessing techniques, particularly K-Means clustering, to enhance model performance. Overfitting, a common challenge in MLP models, is addressed through the application of K-Means clustering to streamline data preparation and improve classification accuracy. The study begins with an overview of overfitting in MLP models, highlighting the significance of mitigating this issue. Various techniques for addressing overfitting are reviewed, including regularization, dropout, early stopping, data augmentation, and ensemble methods. Additionally, the complementary role of K-Means clustering in enhancing model performance is emphasized. Preprocessing using K-Means clustering aims to reduce data complexity and prevent overfitting in MLP models. Three datasets - Iris, Wine, and Breast Cancer Wisconsin - are employed to evaluate the performance of K-Means as a preprocessing technique. Results from cross-validation demonstrate significant improvements in accuracy, precision, recall, and F1 scores when employing K-Means clustering compared to models without preprocessing. The findings highlight the efficacy of K-Means clustering in enhancing the discriminative power of MLP classification models by organizing data into clusters based on similarity. These results have practical implications, underlining the importance of appropriate preprocessing techniques in improving classification performance. Future research could explore additional preprocessing methods and their impact on classification accuracy across diverse datasets, advancing the field of machine learning and its applications
Analysis of Logistic Regression Regularization in Wild Elephant Classification with VGG-16 Feature Extraction Ichsan, Aulia; Riyadi, Sugeng; Pardede, Doughlas
Journal of Computer Networks, Architecture and High Performance Computing Vol. 6 No. 2 (2024): Articles Research Volume 6 Issue 2, April 2024
Publisher : Information Technology and Science (ITScience)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47709/cnahpc.v6i2.3789

Abstract

The research article explores the intersection of image-based wildlife classification and logistic regression regularization, focusing on the classification of wild elephant species. It begins by highlighting the significance of ecological research in biodiversity monitoring and conservation and introduces Convolutional Neural Networks (CNNs) as potent tools for feature extraction from images. The VGG-16 model is particularly emphasized for its ability to capture hierarchical representations of visual features crucial for classification tasks. The integration of VGG-16 feature extraction with logistic regression regularization is proposed as a compelling approach, offering a balance between sophisticated feature representation and efficient classification algorithms. The literature review delves into image-based wildlife classification, emphasizing the role of CNNs, especially VGG-16, in extracting discriminative features. It discusses the fusion of VGG-16 features with logistic regression and the challenges in this field, such as dataset annotation and environmental variability. The method section outlines the dataset acquisition, feature extraction using the VGG-16 architecture, and model configuration using logistic regression with lasso and ridge regularization. The process of finding the optimal regularization parameter (lambda) and model evaluation through cross-validation is detailed. Results showcase the optimal lambda values for lasso and ridge regularization and compare the performance of logistic lasso and logistic ridge models. Misclassification analysis reveals factors influencing classification accuracy, including feature variability and contextual complexity. The discussion reflects on the implications of the findings, emphasizing the importance of lambda selection and addressing challenges in wildlife classification. It suggests avenues for further research, such as advanced modeling techniques and feature engineering approaches. In conclusion, the study contributes to advancing wildlife classification efforts by leveraging state-of-the-art techniques and sheds light on opportunities to enhance classification accuracy in wildlife conservation.