Claim Missing Document
Check
Articles

Found 6 Documents
Search
Journal : Infolitika Journal of Data Science

Machine Learning Approach for Diabetes Detection Using Fine-Tuned XGBoost Algorithm Maulana, Aga; Faisal, Farassa Rani; Noviandy, Teuku Rizky; Rizkia, Tatsa; Idroes, Ghazi Mauer; Tallei, Trina Ekawati; El-Shazly, Mohamed; Idroes, Rinaldi
Infolitika Journal of Data Science Vol. 1 No. 1 (2023): September 2023
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v1i1.72

Abstract

Diabetes is a chronic condition characterized by elevated blood glucose levels which leads to organ dysfunction and an increased risk of premature death. The global prevalence of diabetes has been rising, necessitating an accurate and timely diagnosis to achieve the most effective management. Recent advancements in the field of machine learning have opened new possibilities for improving diabetes detection and management. In this study, we propose a fine-tuned XGBoost model for diabetes detection. We use the Pima Indian Diabetes dataset and employ a random search for hyperparameter tuning. The fine-tuned XGBoost model is compared with six other popular machine learning models and achieves the highest performance in accuracy, precision, sensitivity, and F1-score. This study demonstrates the potential of the fine-tuned XGBoost model as a robust and efficient tool for diabetes detection. The insights of this study advance medical diagnostics for efficient and personalized management of diabetes.
ANFIS-Based QSRR Modelling for Kovats Retention Index Prediction in Gas Chromatography Idroes, Rinaldi; Noviandy, Teuku Rizky; Maulana, Aga; Suhendra, Rivansyah; Sasmita, Novi Reandy; Muslem, Muslem; Idroes, Ghazi Mauer; Jannah, Raudhatul; Afidh, Razief Perucha Fauzie; Irvanizam, Irvanizam
Infolitika Journal of Data Science Vol. 1 No. 1 (2023): September 2023
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v1i1.73

Abstract

This study aims to evaluate the implementation and effectiveness of the Adaptive Neuro-Fuzzy Inference System (ANFIS) based Quantitative Structure Retention Relationship (QSRR) to predict the Kovats retention index of compounds in gas chromatography. The model was trained using 340 essential oil compounds and their molecular descriptors. The evaluation of the ANFIS models revealed promising results, achieving an R2 of 0.974, an RMSE of 48.12, and an MAPE of 3.3% on the testing set. These findings highlight the ANFIS approach as remarkably accurate in its predictive capacity for determining the Kovats retention index in the context of gas chromatography. This study provides valuable perspectives on the efficiency of retention index prediction through ANFIS-based QSRR methods and the potential practicality in compound analysis and chromatographic optimization.
Ensemble Machine Learning Approach for Quantitative Structure Activity Relationship Based Drug Discovery: A Review Noviandy, Teuku Rizky; Maulana, Aga; Idroes, Ghazi Mauer; Emran, Talha Bin; Tallei, Trina Ekawati; Helwani, Zuchra; Idroes, Rinaldi
Infolitika Journal of Data Science Vol. 1 No. 1 (2023): September 2023
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v1i1.91

Abstract

This comprehensive review explores the pivotal role of ensemble machine learning techniques in Quantitative Structure-Activity Relationship (QSAR) modeling for drug discovery. It emphasizes the significance of accurate QSAR models in streamlining candidate compound selection and highlights how ensemble methods, including AdaBoost, Gradient Boosting, Random Forest, Extra Trees, XGBoost, LightGBM, and CatBoost, effectively address challenges such as overfitting and noisy data. The review presents recent applications of ensemble learning in both classification and regression tasks within QSAR, showcasing the exceptional predictive accuracy of these techniques across diverse datasets and target properties. It also discusses the key challenges and considerations in ensemble QSAR modeling, including data quality, model selection, computational resources, and overfitting. The review outlines future directions in ensemble QSAR modeling, including the integration of multi-modal data, explainability, handling imbalanced data, automation, and personalized medicine applications while emphasizing the need for ethical and regulatory guidelines in this evolving field.
Predicting Obesity Levels with High Accuracy: Insights from a CatBoost Machine Learning Model Maulana, Aga; Afidh, Razief Perucha Fauzie; Maulydia, Nur Balqis; Idroes, Ghazi Mauer; Rahimah, Souvia
Infolitika Journal of Data Science Vol. 2 No. 1 (2024): May 2024
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v2i1.195

Abstract

This study aims to develop a machine learning model using the CatBoost algorithm to predict obesity based on demographic, lifestyle, and health-related features and compare its performance with other machine learning algorithms. The dataset used in this study, containing information on 2,111 individuals from Mexico, Peru, and Colombia, was used to train and evaluate the CatBoost model. The dataset included gender, age, height, weight, eating habits, physical activity levels, and family history of obesity. The model's performance was assessed using accuracy, precision, recall, and F1-score and compared to logistic regression, K-nearest neighbors (KNN), random forest, and naive Bayes algorithms. Feature importance analysis was conducted to identify the most influential factors in predicting obesity levels. The results indicate that the CatBoost model achieved the highest accuracy at 95.98%, surpassing other models. Furthermore, the CatBoost model demonstrated superior precision (96.08%), recall (95.98%), and F1-score (96.00%). The confusion matrix revealed that the model accurately predicted the majority of instances in each obesity level category. Feature importance analysis identified weight, height, and gender as the most influential factors in predicting obesity levels, followed by dietary habits, physical activity, and family history of overweight. The model's high accuracy, precision, recall, and F1-score and ability to handle categorical variables effectively make it a valuable tool for obesity risk assessment and classification. The insights gained from the feature importance analysis can guide the development of targeted obesity prevention and management strategies, focusing on modifiable risk factors such as diet and physical activity. While further validation on diverse populations is necessary, the CatBoost model's results demonstrate its potential to support clinical decision-making and inform public health initiatives in the fight against the global obesity epidemic.
Artificial Neural Network–Particle Swarm Optimization Approach for Predictive Modeling of Kovats Retention Index in Essential Oils Kurniadinur, Kurniadinur; Noviandy, Teuku Rizky; Idroes, Ghazi Mauer; Ahmad, Noor Atinah; Irvanizam, Irvanizam; Subianto, Muhammad; Idroes, Rinaldi
Infolitika Journal of Data Science Vol. 2 No. 2 (2024): November 2024
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v2i2.220

Abstract

The Kovats retention index is a critical parameter in gas chromatography used for the identification of volatile compounds in essential oils. Traditional methods for determining the Kovats retention index are often labor-intensive, time-consuming, and prone to inaccuracies due to variations in experimental conditions. This study presents a novel approach combining Artificial Neural Networks (ANN) with Particle Swarm Optimization (PSO) to predict the Kovats retention index of essential oil compounds more accurately and efficiently. The ANN-PSO hybrid model leverages the strengths of both techniques: the ANN's capacity to model complex nonlinear relationships and PSO's capability to optimize hyperparameters by finding the global optimum. The model was trained using a dataset of 340 essential oil compounds with molecular descriptors, with the performance evaluated based on Root Mean Squared Error (RMSE) and Mean Absolute Percentage Error (MAPE). Results indicate that a simpler ANN configuration with one hidden neuron achieved the lowest RMSE (80.16) and MAPE (5.65%), suggesting that the relationship between the molecular descriptors and the Kovats retention index is not overly complex. This study demonstrates that the ANN-PSO model can serve as an effective tool for predictive modeling of the Kovats retention index, reducing the need for experimental procedures and improving analytical efficiency in essential oil research.
Explainable Deep Learning with Lightweight CNNs for Tuberculosis Classification Noviandy, Teuku Rizky; Idroes, Ghazi Mauer; Zulfikar, Teuku; Idroes, Rinaldi
Infolitika Journal of Data Science Vol. 3 No. 1 (2025): May 2025
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v3i1.305

Abstract

Tuberculosis (TB) remains a major global health threat, particularly in low-resource settings where timely diagnosis is critical yet often limited by the lack of radiological expertise. Chest X-rays (CXRs) are widely used for TB screening, but manual interpretation is prone to errors and variability. While deep learning has shown promise in automating CXR analysis, most existing models are computationally intensive and lack interpretability, limiting their deployment in real-world clinical environments. To address this gap, we evaluated three lightweight and explainable CNN architectures, ShuffleNetV2, SqueezeNet 1.1, and MobileNetV3, for binary TB classification using a locally sourced dataset of 3,008 CXR images. Using transfer learning and Grad-CAM for visual explanation, we show that MobileNetV3 and ShuffleNetV2 achieved perfect test performance with 100% accuracy, sensitivity, specificity, precision, and F1-score, along with AUC scores of 1.00 and inference times of 94.66 and 103.63 seconds, respectively. SqueezeNet performed moderately, with a lower F1-score of 82.98% and several misclassifications. These results demonstrate that lightweight CNNs can deliver high diagnostic accuracy and transparency, supporting their use in scalable, AI-assisted TB screening systems for underserved healthcare settings.
Co-Authors Abas, Abdul Hawil Abd Rahman, Sunarti Ahmad, Noor Atinah Akmal Muhni Alfizar Alfizar Ali Bakri Anggi, Tiara Aprianto . Arkadinata, Teguh Asep Rusyana Azhar, Fauzul Bachtiar, Boy Muhclis Bahri, Ridzky Aulia Bako, Winanda Celik, Ismail Diah, Muhammad Diana Setya Ningsih, Diana Diana Setya Ningsih, Diana Setya Diki, Diki Eko Suhartono El-Shazly, Mohamed Emran, Talha Bin Erkata Yandri Faisal, Farassa Rani Fajar Fakri Fauziah, Niken Fazli, Qalbin Salim Hafni Zahara Harahap, Saima Putri Harera, Cheariva Firsa Hewindati, Yuni Tri Hizir Sofyan Idroes, Ghalieb Mutig Ifandi, Ilham Imelda, Eva Irvanizam, Irvanizam Irwana, Salman Jainury, Aldi Jauna, Jauna Kemala, Pati Khairan Khairan Khalijah Awang Kurniadinur, Kurniadinur Kusumo, Fitranto Lala, Andi Lukman Hakim Maria Paristiowati Marwan Marwan Maulana, Aga Maulydia, Nur Balqis Maysarah, Hilda Medyan Riza Mirda, Erisna Mirja, Mirja Misbullah, Alim Muhammad Adam, Muhammad Muhammad Ichsan Muhammad Ichsan Muhammad Sabri Muhammad Subianto Muhammad Yanis Muhammad Yusuf Mukhlisuddin Ilyas Muliadi Ramli Muslem Muslem, Muslem Musvira, Intan Natasya Natasya Nizamuddin Nizamuddin Nova Yanti Pasyamei Rembune Kala Patwekar, Mohsina Prasetio, Rasi Purnama, M. Risky Putri Raisah Raisah, Putri Raudhatul Jannah Razief Perucha Fauzie Afidh Rinaldi Idroes Rizkia, Tatsa Sasmita, Novi Reandy Shofi, Shofi Siti Maulina Rukmana Souvia Rahimah Suhendra , Rivansyah Suhendra, Rivansyah Suhendrayatna Suhendrayatna Surna, Muhammad Ipan Susanna Susanna Syamsiar, Syamsiar Taufiq Karma Teuku Rizky Noviandy Teuku Zulfikar Tjut Chamzurni TRINA EKAWATI TALLEI Wahyuni, Srie Wangi, Putri Ayu Sekar Wildan Seni, Wildan Wiwik Handayani Yustiana Yustiana, Yustiana Zahriah, Zahriah Zuchra Helwani, Zuchra Zulkarnain Jalil