This Author published in this journals
All Journal Jurnal Infra
Anthony Setiawan
Program Studi Informatika

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Klasifikasi Artikel Berita Bahasa Indonesia Dengan Naive Bayes Classifier Anthony Setiawan; Leo Willyanto Santoso; Rudy Adipranata
Jurnal Infra Vol 8, No 1 (2020)
Publisher : Jurnal Infra

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

Human access to latest news now becoming more easier and much more, caused by advanced technological development in latest years. But, the article categorization is still manually inserted by the writer, so sometimes by human error, some mistake can be happening, like inserting wrong category or sometimes the writer purposely insert wrong category just because that category is so popular just to boost his viewer count. That’s why there is an application in the form of website to automatically categorizing the article that fit mostly to their its category.This application is using N-Gram feature and Naïve Bayes Classifier method to classifying news content. N-Gram feature is a feature that group words based on the amount of N, like unigram or bigram. Naïve Bayes Classifier is a method that using probability to solve some problem.According to the test using Naïve Bayes Classifier, in dataset training and test with ratio of 50 : 50, at unigram section the correct accuracy result are 0.901,  and the bigram result are 0.508. In dataset ratio of 60 : 40, at unigram section the correct accuracy result are 0.904, and the bigram result are 0.498. In dataset ratio of 70 : 30, at unigram section the correct accuracy result are 0.947, and the bigram result are 0.519. In dataset ratio of 80 : 20, at unigram section the correct accuracy result are 0.887, and the bigram result are 0.507. So, the conclusion is dataset training and test with ratio of 70 : 30 yield highest accuracy, in unigram (0.947) and also bigram (0.519).