Building of Informatics, Technology and Science
Vol 3 No 4 (2022): March 2022

A Multi-Label Classification of Al-Quran Verses Using Ensemble Method and Naïve Bayes

Choirulfikri, Muhammad Rizqi (Unknown)
Lhaksamana, Kemas Muslim (Unknown)
Faraby, Said Al (Unknown)



Article Info

Publish Date
31 Mar 2022

Abstract

Al-Quran is the holy book as a guide and also a source of law for muslims. Thus, understanding and studying Al-Quran is very important for muslims. To make it easier for muslims to understand and study the Qur'an, it is necessary to classify the verses of the Al-Qur'an. This study built a system that can perform multi-label classification of Al-Quran verses. Multi-label means that the classification will divide each verse of the Al-Quran into more than 1 topic. The model is built using the ensemble method by combining several Naïve Bayes algorithms. The ensemble method was chosen because research with different datasets can obtain good performance. The naïve Bayes algorithm was also chosen because it has a simple calculation so it requires a fairly short computation time. The preprocessing step is also carried out to see the comparison of performance results. To measure the performance of the system that has been built, the calculation of hamming loss is used. Based on the experimental results with several testing scenarios, the best performance results are obtained by combining Multinomial NB and Bernoulli NB with a hamming loss value of 0.1167. Thus, the use of the ensemble method can improve performance compared to without the ensemble method. This research can also of course build a multi-label classification model for the verses of Al-Quran with the ensemble method

Copyrights © 2022






Journal Info

Abbrev

bits

Publisher

Subject

Computer Science & IT

Description

Building of Informatics, Technology and Science (BITS) is an open access media in publishing scientific articles that contain the results of research in information technology and computers. Paper that enters this journal will be checked for plagiarism and peer-rewiew first to maintain its quality. ...