Claim Missing Document
Check
Articles

Found 8 Documents
Search

Performance Analysis of the Decision Tree Classification Algorithm on the Water Quality and Potability Dataset Zaky, Umar; Naswin, Ahmad; Sumiyatun, Sumiyatun; Murdiyanto, Aris Wahyu
Indonesian Journal of Data and Science Vol. 4 No. 3 (2023): Indonesian Journal of Data and Science
Publisher : yocto brain

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56705/ijodas.v4i3.113

Abstract

Ensuring water potability is paramount for public health and safety. This research aimed to assess the efficacy of the Decision Tree classification algorithm in predicting water potability using the Water Quality and Potability dataset. Employing a 5-fold cross-validation technique, the model showcased a moderate performance with an average accuracy of approximately 54.33%. While the Decision Tree provides a baseline and interpretable mechanism for classification, the results emphasize the need for further exploration using more intricate models or ensemble methods. This study contributes to the broader effort of leveraging machine learning techniques for water quality assessment and provides insights into the potential and limitations of such models in predicting water safety
Evaluating Thresholding-Based Segmentation and Humoment Feature Extraction in Acute Lymphoblastic Leukemia Classification using Gaussian Naive Bayes Rismayanti, Nurul; Naswin, Ahmad; Zaky, Umar; Zakariyah, Muhammad; Purnamasari, Dwi Amalia
International Journal of Artificial Intelligence in Medical Issues Vol. 1 No. 2 (2023): International Journal of Artificial Intelligence in Medical Issues
Publisher : Yocto Brain

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56705/ijaimi.v1i2.99

Abstract

This study, titled "Evaluating Thresholding-Based Segmentation and HuMoment Feature Extraction in Acute Lymphoblastic Leukemia Classification using Gaussian Naive Bayes," investigates the application of image processing and machine learning techniques in the classification of Acute Lymphoblastic Leukemia (ALL). Utilizing a dataset of microscopic blood smear images, the research focuses on the efficacy of thresholding-based segmentation and Hu moment feature extraction in distinguishing between benign and malignant cases of ALL. Gaussian Naive Bayes, known for its simplicity and effectiveness, is employed as the classification algorithm. The study adopts a 5-fold cross-validation approach to evaluate the model's performance, with particular emphasis on metrics such as accuracy, precision, recall, and F1-score. Results indicate a high precision rate across all folds, averaging approximately 84.13%, while exhibiting variability in accuracy, recall, and F1-scores. These findings suggest that while the model is effective in identifying malignant cases, further refinements are necessary for improving overall accuracy and consistency. This research contributes to the field of medical image analysis by demonstrating the potential of combining simple yet efficient techniques for the automated diagnosis of hepatological diseases. It highlights the importance of integrating image processing with machine learning to enhance diagnostic accuracy in medical applications.
Classification of Mushroom Edibility Using K-Nearest Neighbors: A Machine Learning Approach Admojo, Fadhila Tangguh; Radhitya, Made Leo; Zein, Hamada; Naswin, Ahmad
Indonesian Journal of Data and Science Vol. 5 No. 3 (2024): Indonesian Journal of Data and Science
Publisher : yocto brain

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56705/ijodas.v5i3.199

Abstract

This study investigates the use of the K-Nearest Neighbors (KNN) algorithm for the binary classification of mushroom edibility using a cleaned version of the UCI Mushroom Dataset. The dataset underwent pre-processing techniques such as modal imputation, one-hot encoding, z-score normalization, and feature selection to ensure data quality. The model was trained on 80% of the dataset and evaluated on the remaining 20%, achieving an overall accuracy of 99%. Evaluation metrics, including precision, recall, and F1-score, confirmed the model's effectiveness in distinguishing between edible and poisonous mushrooms, with minimal misclassification errors. Despite its high performance, the study identified scalability as a limitation due to the computational complexity of KNN, suggesting that future research should explore alternative algorithms for enhanced efficiency. This research underscores the importance of pre-processing and hyperparameter optimization in building reliable classification models for food safety applications.
Pemetaan Lanskap Emosional Di Twitter: Visualisasi Sentimen Netral, Positif, dan Negatif Dengan Word Cloud mawaddah, Sitti; Naswin, Ahmad; Sulkifli, Sulkifli
RIGGS: Journal of Artificial Intelligence and Digital Business Vol. 4 No. 4 (2026): November - January
Publisher : Prodi Bisnis Digital Universitas Pahlawan Tuanku Tambusai

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.31004/riggs.v4i4.3650

Abstract

Penelitian ini mengkaji penerapan teknik Word Clouds dalam ekstraksi sentimen pada platform Twitter, dengan fokus pada identifikasi kata dominan dalam cuitan yang dikategorikan sebagai netral, positif, dan negatif. Teknik visualisasi ini memungkinkan pemahaman yang lebih mendalam mengenai distribusi kata dan peranannya dalam representasi sentimen. Melalui Exploratory Data Analysis (EDA), penelitian ini berhasil memetakan karakteristik data, seperti tingginya kemiripan Jaccard untuk tweet netral dan pola distribusi panjang teks antara text dan selected_text. EDA juga mendasari penerapan aturan deterministik (rule-based gate) untuk menangani cuitan dengan kepastian tinggi dan membedakan kasus yang membutuhkan pendekatan berbasis model lebih lanjut, yaitu Named Entity Recognition (NER). Penerapan rekayasa fitur meta, termasuk pengukuran similarity Jaccard, selisih panjang kata, dan jumlah kata, berhasil meningkatkan presisi dalam pemetaan span sentimen pada teks mikro. Evaluasi model menggunakan similarity Jaccard menunjukkan performa yang sangat baik dengan akurasi 85%, diikuti oleh konsentrasi tinggi skor prediksi pada rentang 0,9–1,0. Temuan ini menegaskan bahwa pendekatan hibrida yang mengintegrasikan Word Clouds, analitik berbasis fitur, dan NER efektif dalam mengatasi tantangan kompleksitas bahasa informal di media sosial, seperti slang, elongation, dan ambiguity. Penelitian ini memberikan kontribusi signifikan dalam pengembangan metodologi sentiment extraction yang lebih presisi dan efisien untuk aplikasi Natural Language Processing (NLP) di platform sosial media
Machine Learning-Based Prediction of HIV/AIDS Infection and Treatment Effectiveness: A Clinical Dataset Analysis Jiwa Permana, Agus Aan; Wikranta Arsa, I Gusti Ngurah; Naswin, Ahmad; Sumiyatun
International Journal of Artificial Intelligence in Medical Issues Vol. 3 No. 2 (2025): International Journal of Artificial Intelligence in Medical Issues
Publisher : Yocto Brain

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56705/ijaimi.v3i2.362

Abstract

The early and accurate prediction of HIV/AIDS infection is critical to improving clinical decision-making and ensuring effective patient management. This study presents a comprehensive machine learning-based approach to predict HIV/AIDS infection status and evaluate the effectiveness of antiretroviral treatments using a well-documented clinical dataset from 1996, comprising 2,139 patient records and 34 features. Through rigorous preprocessing, exploratory data analysis, and feature engineering, several new clinically relevant attributes were constructed, such as CD4/CD8 ratios and immunological change metrics. Four machine learning models—Logistic Regression, Support Vector Machine, Random Forest, and Gradient Boosting—were trained and evaluated. Among these, the Gradient Boosting classifier achieved the highest ROC-AUC score of 0.9335, while Random Forest provided strong predictive performance with a ROC-AUC of 0.9180 and was selected for further evaluation due to its model transparency. Key features influencing infection prediction included CD4+ and CD8+ dynamics, baseline immunological levels, and treatment history. Additionally, the study examined treatment effectiveness by analyzing CD4+ cell count responses across different therapy types. The combination of ZDV and ddI emerged as the most effective regimen, improving immune outcomes and lowering infection rates, while ZDV monotherapy showed the least favorable results. This work underscores the potential of machine learning as a clinical decision support tool in HIV/AIDS care and provides data-driven insights into treatment optimization. Future studies should incorporate longitudinal patient data and real-world clinical environments for broader applicability.
Evaluating Service Quality Metrics with AdaBoost Classifier at Restaurant X Batubulan, Kadek Suarjuna; Pratama, I Putu Adi; Naswin, Ahmad
Jurnal Sistem Informasi dan Komputer Terapan Indonesia (JSIKTI) Vol 6 No 3 (2024): March
Publisher : INFOTEKS (Information Technology, Computer and Sciences)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33173/jsikti.234

Abstract

This paper explores the use of the AdaBoost classifier to evaluate service quality metrics in the restaurant industry, specifically at Restaurant X. The study focuses on how machine learning, particularly ensemble learning algorithms, can improve the understanding of customer satisfaction by analyzing various service attributes, such as food quality, staff behavior, wait times, and ambiance. By applying AdaBoost, the model combines multiple weak classifiers to create a stronger, more accurate prediction model that identifies key factors influencing customer experience. The research highlights the importance of real-time data and customer feedback in refining service quality metrics and suggests that incorporating sentiment analysis and other dynamic data sources can provide a more comprehensive view of customer satisfaction. The findings suggest that using machine learning algorithms, like AdaBoost, can enhance operational decision-making, improve customer service, and contribute to overall business success. Additionally, the study proposes the continuous updating of the model to reflect changing customer preferences and trends in the competitive food service industry. This approach can lead to better service, customer retention, and a strategic advantage for restaurants seeking to meet the evolving demands of the market.
ARIMA Model for Time Series Forecasting of Doge Coin Prices Batubulan, Kadek Suarjuna; Pratama, I Putu Adi; Naswin, Ahmad
Jurnal Sistem Informasi dan Komputer Terapan Indonesia (JSIKTI) Vol 7 No 1 (2024): September
Publisher : INFOTEKS (Information Technology, Computer and Sciences)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33173/jsikti.242

Abstract

The volatility and speculative nature of cryptocurrencies present significant challenges for accurate price forecasting. This study evaluates the performance of the AutoRegressive Integrated Moving Average (ARIMA) model in predicting Dogecoin (DOGE) prices based on historical data obtained from reputable cryptocurrency platforms such as Binance, Coinbase, and CoinGecko. The ARIMA(5,1,0) model demonstrated strong performance under stable market conditions, achieving a Mean Squared Error (MSE) of 0.0006656 and a Root Mean Squared Error (RMSE) of 0.0258, effectively capturing linear price trends. However, the model’s limitations in handling high volatility and non-linear dependencies—common characteristics of cryptocurrency markets—were also identified. To address these challenges, the study explores hybrid ARIMA–neural network models that integrate statistical and machine learning approaches, improving predictive accuracy during periods of market instability. The results suggest that while ARIMA provides a solid baseline for time series forecasting, hybrid and sentiment-aware models incorporating social media and blockchain metrics offer more robust and adaptive solutions for dynamic cryptocurrency markets.
Cataract Classification in Eye Images Using MobileNetV2 Batubulan, Kadek Suarjuna; Pratama, I Putu Adi; Naswin, Ahmad
Jurnal Sistem Informasi dan Komputer Terapan Indonesia (JSIKTI) Vol 8 No 2 (2025): December
Publisher : INFOTEKS (Information Technology, Computer and Sciences)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33173/jsikti.268

Abstract

Cataract remains one of the primary causes of visual impairment globally, with early detection being essential to prevent permanent blindness and improve patient quality of life. However, conventional diagnosis depends on ophthalmologists and clinical-grade imaging devices, which are often limited in remote or under-resourced areas. This condition highlights the need for an efficient, accessible, and automated screening solution. To address this challenge, this study utilizes the MobileNetV2 deep learning architecture to classify cataract conditions based on eye images. MobileNetV2 is selected because of its lightweight model structure and strong feature representation capabilities, making it suitable for deployment in portable or embedded medical systems. The dataset used consists of two cataract stages, namely immature and mature cataracts, with images undergoing preprocessing prior to model training. The proposed system demonstrates excellent performance, achieving an accuracy, precision, recall, and F1-score of 100% in distinguishing cataract stages. These results confirm that MobileNetV2 can effectively support cataract screening with high reliability while maintaining efficiency. Future work will involve extending the dataset to include additional cataract severity levels and non-cataract eye images, as well as integrating explainable artificial intelligence methods to provide visual diagnostic interpretations and enhance clinical trust in real-world applications.