Journal of Computer Science and Informatics Engineering (J-Cosine)
Vol 6 No 2 (2022): December 2022

Term Weighting Based Indexing Class and Indexing Short Document for Indonesian Thesis Title Classification

Ana Tsalitsatun Ni'mah (Universitas Trunojoyo Madura)
Fahmi Syuhada (Universitas Qamarul Huda Badaruddin Bagu, West Nusa Tenggara)



Article Info

Publish Date
21 Dec 2022

Abstract

Document classification nowadays is an easy thing to do because there are the latest methods to get maximum results. Document classification using the term weighting TF-IDF-ICF method has been widely studied. Documents used in this research generally use large documents. If the term weighting TF-IDF method is used in a short text document such as the Thesis Title, the document will not get a perfect score from the classification results. Because in the IDF will calculate the weight of words that always appear to be few, ICF will calculate the weight of words that often appear in the class to be few. While the word should have great weight to be the core of a short text document. Therefore, this study aims to conduct research on word weighting based on class indexation and short document indexation, namely TF-IDF-ICF-IDSF. This study uses a classification comparison Naïve Bayes and SVM. The dataset used is Thesis Title of Informatics Education student at Trunojoyo Madura University. The test results show that the classification results using the TF-IDF-ICF-IDSF term weighting method outperform other term weighting, namely getting 91% Precision, 93% Recall, 86% F1-Score, and 84% Accuracy on SVM.

Copyrights © 2022






Journal Info

Abbrev

jcosine

Publisher

Subject

Computer Science & IT

Description

Journal of Computer Science and Informatics Engineering (J-Cosine) is a journal that is published by Informatics Engineering Dept., Faculty of Engineering, University of Mataram (Program Studi Teknik Informatika, Fakultas Teknik Universitas Mataram) under online and print ISSN: 2541-0806 and ...