Claim Missing Document
Check
Articles

Found 2 Documents
Search
Journal : Journal of Applied Information, Communication and Technology (JAICT)

Classification of Type 2 Diabetes using Decission Tree Algorithm Ivandari, Ivandari; Maulana, Much. Rifqi; Al Karomi, M Adib
JAICT Vol. 8 No. 2 (2023)
Publisher : Politeknik Negeri Semarang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.32497/jaict.v8i2.4835

Abstract

Diabetes is a disease that causes many deaths. According to data from WHO, in 2019 there were 2 million deaths due to diabetes. The recording of the patient's condition has been carried out for medical purposes. The large number of records that are only used as stored data will only later become digital waste. Data mining offers a classification process to process data into new knowledge. The recognition of new patterns from existing data results from algorithmic calculation processes as well as statistics. This study uses the type 2 diabetes dataset from the uci repository which was released in 2020. Previous research was conducted using the KNN algorithm with an accuracy rate of 92.5%. For numerical datasets, the decision tree algorithm is proven to be superior and can represent it in a language that is easy for humans to understand. One of the best and widely used classification algorithms for high-dimensional datasets is the decision tree. The results showed that the accuracy of the decision tree algorithm for type 2 diabetes data classification was 95.96%. Another output of this study is a decision tree from the early stage diabetes risk prediction dataset.
Improving the Accuracy of the C45 Classification Algorithm Using Information Gain Ratio Feature Selection for Classification of Type 2 Diabetes Mellitus Disease Ivandari, Ivandari; Maulana, Much. Rifqi; Kurniawan, Ichwan; Al Karomi, M Adib
JAICT Vol. 9 No. 2 (2024)
Publisher : Politeknik Negeri Semarang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.32497/jaict.v9i2.5845

Abstract

Abstract”” Diabetes is a disease that can cause death. Diabetes can cause heart failure, chronic kidney disease, glaucoma that attacks the eyes and several other diseases. WHO data states that there were more than 2 million deaths due to diabetes in 2019. Data from the International Diabetes Federation shows that around 537 adults are recorded as living with diabetes. This condition must be treated immediately, considering that diabetes is one of the most deadly non-communicable diseases in the world. Patient registration is mostly done in hospitals. A lot of data will only become digital waste if it does not have more benefits. In 2020 Diabetes and Hospital in Sylhet donated patient data for further research. This data contains 520 patient records with 17 attributes that have been validated by specialist doctors. Early stage diabetes risk prediction data is released by the uci repository as public data and can be used for research testing. Research using this dataset has been widely carried out with the previous best accuracy level of 95.96%. In previous studies, all attributes were used in the classification process. The number of irrelevant attributes can affect the performance of the classification algorithm. This study uses the information gain ratio for feature selection of the early stage diabetes risk prediction dataset. The C45 algorithm is used for classification, evaluation using confusion matrix and validation using 10 folds cross validation. The results of this study improve the performance of C45 so that it obtains an accuracy level of 96.15%. This study also produces a decision tree for diabetes..