Building of Informatics, Technology and Science
Vol 4 No 4 (2023): March 2023

Text Classification of Indonesian Translated Hadith Using XGBoost Model and Chi-Square Feature Selection

Putri, Dita Julaika (Unknown)
Dwifebri, Mahendra (Unknown)
Adiwijaya, Adiwijaya (Unknown)



Article Info

Publish Date
29 Mar 2023

Abstract

Aside from the Holy Qur'an, Hadith is indeed a life guide that every Muslims in this world must follow. The technology for classifying texts and sentences, including categorizing hadiths, is evolving in tandem with the advancement of the times. The model used to perform classification has also been developed and optimized such as the use of the XGBoost algorithm which is more optimized than the previous tree algorithm. This can also make it easier for us as Muslims to study hadiths by categorizing them according to recommendations, prohibitions, and information. This study conducted text classification of Indonesian translations of hadith texts based on recommendations, prohibitions, and information using the XGBoost algorithm, TF-IDF for its feature extraction, and Chi-Square for its feature selection. In this study, experiments were carried out by changing the order of the preprocessing process for the stopword removal and stemming parts, performing the classification process with and without using chi-square as a feature selection, and adding parameter value during the modeling process with XGBoost and the highest final results obtained were 79% for accuracy, 79% for precision, 78% for recall and 78% for F1-score.

Copyrights © 2023






Journal Info

Abbrev

bits

Publisher

Subject

Computer Science & IT

Description

Building of Informatics, Technology and Science (BITS) is an open access media in publishing scientific articles that contain the results of research in information technology and computers. Paper that enters this journal will be checked for plagiarism and peer-rewiew first to maintain its quality. ...