p-Index From 2020 - 2025
6.343
P-Index
This Author published in this journals
All Journal Jurnal Teknologi dan Manajemen Informatika TEKNOLOGI: Jurnal Ilmiah Sistem Informasi TELKOMNIKA (Telecommunication Computing Electronics and Control) Jurnal Ilmiah Kursor Register: Jurnal Ilmiah Teknologi Sistem Informasi Jurnal Teknologi dan Sistem Komputer Jurnal ELTIKOM : Jurnal Teknik Elektro, Teknologi Informasi dan Komputer INTEGER: Journal of Information Technology Teknika: Engineering and Sains Journal Knowledge Engineering and Data Science JICTE (Journal of Information and Computer Technology Education) SMARTICS Journal Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control Konvergensi Jurnal Sisfokom (Sistem Informasi dan Komputer) INTECOMS: Journal of Information Technology and Computer Science Antivirus : Jurnal Ilmiah Teknik Informatika Journal of Information System,Graphics, Hospitality and Technology Jutisi: Jurnal Ilmiah Teknik Informatika dan Sistem Informasi Jurnal Teknologi Informasi dan Terapan (J-TIT) Jurnal Teknika Teknika Journal of Electrical Engineering and Computer (JEECOM) Best : Journal of Applied Electrical, Science and Technology Insyst : Journal of Intelligent System and Computation J-Intech (Journal of Information and Technology) Joutica : Journal of Informatic Unisla Jurnal Nasional Teknik Elektro dan Teknologi Informasi Insand Comtech : Information Science and Computer Technology Journal Jurnal Indonesia Sosial Teknologi JEECS (Journal of Electrical Engineering and Computer Sciences) Eksplorasi Teknologi Enterprise & Sistem Informasi (EKSTENSI) EduTech Journal
Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : Jurnal Nasional Teknik Elektro dan Teknologi Informasi

Self-Training Naive Bayes Berbasis Word2Vec untuk Kategorisasi Berita Bahasa Indonesia Joan Santoso; Agung Dewa Bagus Soetiono; Gunawan; Endang Setyati; Eko Mulyanto Yuniarno; Mochamad Hariadi; Mauridhi Hery Purnomo
Jurnal Nasional Teknik Elektro dan Teknologi Informasi Vol 7 No 2: Mei 2018
Publisher : Departemen Teknik Elektro dan Teknologi Informasi, Fakultas Teknik, Universitas Gadjah Mada

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (1455.318 KB)

Abstract

News as one kind of information that is needed in daily life has been available on the internet. News website often categorizes their articles to each topic to help users access the news more easily. Document classification has widely used to do this automatically. The current availability of labeled training data is insufficient for the machine to create a good model. The problem in data annotation is that it requires a considerable cost and time to get sufficient quantity of labeled training data. A semi-supervised algorithm is proposed to solve this problem by using labeled and unlabeled data to create classification model. This paper proposes semi-supervised learning news classification system using Self-Training Naive Bayes algorithm. The feature that is used in text classification is Word2Vec Skip-Gram Model. This model is widely used in computational linguistics or text mining research as one of the methods in word representation. Word2Vec is used as a feature because it can bring the semantic meaning of the word in this classification task. The data used in this paper consists of 29,587 news documents from Indonesian online news websites. The Self-Training Naive Bayes algorithm achieved the highest F1-Score of 94.17%.