Claim Missing Document
Check
Articles

Found 6 Documents
Search
Journal : Knowledge Engineering and Data Science

Comparison of Naïve Bayes Algorithm and Decision Tree C4.5 for Hospital Readmission Diabetes Patients using HbA1c Measurement Utomo Pujianto; Asa Luki Setiawan; Harits Ar Rosyid; Ali M. Mohammad Salah
Knowledge Engineering and Data Science Vol 2, No 2 (2019)
Publisher : Universitas Negeri Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (655.467 KB) | DOI: 10.17977/um018v2i22019p58-71

Abstract

Diabetes is a metabolic disorder disease in which the pancreas does not produce enough insulin or the body cannot use insulin produced effectively. The HbA1c examination, which measures the average glucose level of patients during the last 2-3 months, has become an important step to determine the condition of diabetic patients. Knowledge of the patient's condition can help medical staff to predict the possibility of patient readmissions, namely the occurrence of a patient requiring hospitalization services back at the hospital. The ability to predict patient readmissions will ultimately help the hospital to calculate and manage the quality of patient care. This study compares the performance of the Naïve Bayes method and C4.5 Decision Tree in predicting readmissions of diabetic patients, especially patients who have undergone HbA1c examination. As part of this study we also compare the performance of the classification model from a number of scenarios involving a combination of preprocessing methods, namely Synthetic Minority Over-Sampling Technique (SMOTE) and Wrapper feature selection method, with both classification techniques. The scenario of C4.5 method combined with SMOTE and feature selection method produces the best performance in classifying readmissions of diabetic patients with an accuracy value of 82.74 %, precision value of 87.1 %, and recall value of 82.7 %.
Comparison of Indonesian Imports Forecasting by Limited Period Using SARIMA Method Harits Ar Rosyid; Mutyara Whening Aniendya; Heru Wahyu Herwanto
Knowledge Engineering and Data Science Vol 2, No 2 (2019)
Publisher : Universitas Negeri Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (565.881 KB) | DOI: 10.17977/um018v2i22019p90-100

Abstract

The development of Indonesia's imports fluctuate over years. Inability to anticipate such rapid changes can cause economic slump due to inappropriate policy. For instance, recent years imports in rice led to the extermination of rice reserves. The reason is to maintain the market price of rice in Indonesia. To overcome these changes, forecasting the amount of imports should assist the Government in determining the optimum policy. This can be done by utilizing an algorithm to forecast time series data, in this case the amount of imports in the next few months with a high degree of accuracy. This study uses data obtained from the official website of the Indonesian Ministry of Trade. Then, Seasonal Autoregressive Integrated Moving Average (SARIMA) method is applied to forecast the imports. This method is suitable for the interconnected dependent variables, as well as in forecasting seasonal data patterns. The results of the experiment showed that 6-period forecast is the most accurate results compared to forecasting by 16 and 24 periods. The research resulted in the best model, that is ARIMA (0, 1, 3)(0, 1, 1)12 produces forecasting with a MAPE value of 7.210 % or an accuracy rate of 92.790 %. By applying this imports forecast model, the government can have a forward strategic plans such as selectively imports products and carefully decide the amount of the incoming products to Indonesia. Hence, it could maintain or improve the economic condition where local businesses can grow confidently. 
Performance of Ensemble Classification for Agricultural and Biological Science Journals with Scopus Index Nastiti Susetyo Fanany Putri; Aji Prasetya Wibawa; Harits Ar Rosyid; Agung Bella Putra Utama; Wako Uriu
Knowledge Engineering and Data Science Vol 5, No 2 (2022)
Publisher : Universitas Negeri Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.17977/um018v5i22022p137-142

Abstract

The ensemble method is considered an advanced method in both prediction and classification. The application of this method is estimated to have a more optimal output than the previous classification method. This article aims to determine the ensemble's performance to classify journal quartiles. The subject of agriculture was chosen because Indonesia is an agricultural country, and the interest of researchers in this field shows a positive response. The data is downloaded through the Scimago Journal and Country Rank with the accumulation in 2020. Labels have four classes: Q1, Q2, Q3, and Q4. The ensemble applied is Boosting and Bagging with Decision Tree (DT) and Gaussian Naïve Bayes (GNB) algorithms compiled from 2144 instances. The Boosting meta-ensembles used are Adaboost and XGBoost. From this study, the Bagging Decision Tree has the highest accuracy score at 71.36, followed by XGBoost Decision Tree with 69.51. The third is XGBoost Gaussian Naïve Bayes with 68.82, Adaboost Decision Tree with 60.42, Adaboost Gaussian Naïve Bayes with 58.2, and Bagging Gaussian Naïve Bayes with 56.12 results. This paper shows that the Bagging Decision Tree is the ensemble method that works optimally in this subject classification. This result suggests that the ensemble method can still fail to produce an ideal outcome that approaches the SJR system.
Can Multinomial Logistic Regression Predicts Research Group using Text Input? Harits Ar Rosyid; Aulia Yahya Harindra Putra; Muhammad Iqbal Akbar; Felix Andika Dwiyanto
Knowledge Engineering and Data Science Vol 5, No 2 (2022)
Publisher : Universitas Negeri Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.17977/um018v5i22022p150-159

Abstract

While submitting proposals in SISINTA, students often confuse or falsely submit their proposals to the less relevant or incorrect research group. There are 13 research groups for the students to choose from. We proposed a text classification method to help students find the best research group based on the title and/or abstract. The stages in this study include data collection, preprocessing data, classification using Logistic Regression, and evaluation of the results. Three scenarios in research group classification are based on 1) title only, 2) abstract only, and 3) title and abstract. Based on the experiments, research group classification using title-only input is the best overall. This scenario gets the most optimal results with accuracy, precision, recall, and f1-score successively at 63.68%, 64.91%, 63.68%, and 63.46%. This result is sufficient to help students find the best research group based on the text titles. In addition, lecturers can comment more elaborately since the proposals are relevant to the research group’s scope.
Optimizing Random Forest Algorithm to Classify Player's Memorisation via In-game Data Akmal Vrisna Alzuhdi; Harits Ar Rosyid; Mohammad Yasser Chuttur; Shah Nazir
Knowledge Engineering and Data Science Vol 6, No 1 (2023)
Publisher : Universitas Negeri Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.17977/um018v6i12023p103-113

Abstract

Assessment of a player's knowledge in game education has been around for some time. Traditional evaluation in and around a gaming session may disrupt the players' immersion. This research uses an optimized Random Forest to construct a non-invasive prediction of a game education player's Memorization via in-game data. Firstly, we obtained the dataset from a 3-month survey to record in-game data of 50 players who play 4-15 game stages of the Chem Fight (a test case game). Next, we generated three variants of datasets via the preprocessing stages: resampling method (SMOTE), normalization (min-max), and a combination of resampling and normalization. Then, we trained and optimized three Random Forest (RF) classifiers to predict the player's Memorization. We chose RF because it can generalize well given the high-dimensional dataset. We used RF as the classifier, subject to optimization using its hyperparameter: n_estimators. We implemented a Grid Search Cross Validation (GSCV) method to identify the best value of  n_estimators. We utilized the statistics of GSCV results to reduce the weight of  n_estimators by observing the region of interest shown by the graphs of performances of the classifiers. Overall, the classifiers fitted using the BEST n_estimators (i.e., 89, 31, 89, and 196 trees) from GSCV performed well with around 80% accuracy. Moreover, we successfully identified the smaller number of n_estimators (OPTIMAL), at least halved the BEST  n_estimators. All classifiers were retrained using the OPTIMAL  n_estimators (37, 12, 37, and 41 trees). We found out that the performances of the classifiers were relatively steady at ~80%. This means that we successfully optimized the Random Forest in predicting a player's Memorization when playing the Chem Fight game. An automated technique presented in this paper can monitor student interactions and evaluate their abilities based on in-game data. As such, it can offer objective data about the skills used.
Constructing Qur’an Recitation Classification using Alexnet Algorithm Rosyid, Harits Ar; Abdullah, Dzulkifli; Alqahtani, Mohammed S.
Knowledge Engineering and Data Science Vol 7, No 2 (2024)
Publisher : Universitas Negeri Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.17977/um018v7i22024p152-163

Abstract

The growing demands for accurate and efficient methods in the Qur'an recitation classification highlight the limitations of existing models, particularly in assisting the memorization process. This study aims to address these challenges by implementing the AlexNet Convolutional Neural Network architecture, widely recognized for its effectiveness in image classification, to classify the Qur'an recitations using the Mel-Frequency Cepstral Coefficient (MFCC) as the feature extraction method. The research involves several stages, including data collection, preprocessing (audio segmentation by verse), data augmentation, feature extraction, and classification using the AlexNet architecture, followed by performance evaluation. Key results demonstrate that the combination of MFCC and AlexNet yields promising accuracy in classifying Surah Al-Ikhlas recitations, suggesting its potential application for automatic reading correction. This approach significantly improves over traditional methods, contributing to more effective tools for Qur'an memorization assistance. Future work could explore its application in other significant improvement contexts and address potential challenges related to varying audio quality.
Co-Authors Abdullah, Dzulkifli Achmad Iffad Adhilaga, Hanif Aditya Galih Sulaksono, Aditya Galih Agung Bella Putra Utama Agusta Rakhmat Taufani Ahmad Adi Prasetyo Aji Prasetya Wibawa Akmal Vrisna Alzuhdi Ali M. Mohammad Salah Alqahtani, Mohammed S. Amalia Amalia Anie Yulistyorini Anik Nur Handayani Ardi Anugerah Wicaksana Aripriharta - Asa Luki Setiawan Asfani, Khoirudin Ashar, Muhammad Aulia Yahya Harindra Putra Aya Sofia Mufti Azhar Ahmad Smaragdina Azizah, Desi Fatkhi Brillianta Zayyan Muhammad Danang Rahmat Bachtiar Denny Kurniawan Diederik Rousseau Dyah Lestari Edwin Meinardi Trianto Elfonda Daffa Risqullah Elmiyadi Novia Farma Esther Irawati Setiawan Fajariani, Erna Fatma Yuniardini Fauzi, Rochmad Febrianto Alqodri Felix Andika Dwiyanto Ferdinand, Miftakhul Anggita Bima Gunawan Gunawan Gunawan Hakkun Elmunsyah Hartarto Junaedi Hendrawan Armanto Herman Thuan To Saurik Heru Wahyu Herwanto Joumil Aidil Saifuddin Khoiruddin Asfanie Khurin Nabila Kumalasari, Ira Kusuma Refa Haratama Liang, Yeoh Wen Lucyta Qutsyaning Rosydah M Baharuddin Yusuf Mohammad Musthofa Al Ansyorie Mohammad Yasser Chuttur Mokhtar , Norrima Binti Muchamad Andis Setiawan Muhammad Akbar Muhammad Iqbal Akbar Muhammad Naufal Farras Muladi Mursyit, Mohammad Mutyara Whening Aniendya Nastiti Susetyo Fanany Putri Novian Dwi syahrizal Hilmi Nur A’yuni Ramadhani Nur Hidayatullah Nur Sa’ida Kismurdiani Prasetyo, Ahmad Adi Prawidya, Della Murbarani Rahadyan Fannani Arif Sari, Tenty Luay Setumin , Samsul Shah Nazir Siti Sendari Suparman Syaad Patmanthara Teguh Andriyanto, Teguh Theodora Monica Timothy John Pattiasina Tinesa Fara Prihandini Utomo Pujianto Wahyu Irianto Wako Uriu Wiryawan, Muhammad Zaki Yudhistira, Moch Rajendra Yusmanto, Yunan Zaeni, Ilham Ari Elbaith