International Journal of Artificial Intelligence Research
Vol 3, No 2 (2019): December 2019

Journal Classification Using Cosine Similarity Method on Title and Abstract with Frequency-Based Stopword Removal 

Piska Dwi Nurfadila (Electrical Engineering Department, Universitas Negeri Malang)
Aji Prasetya Wibawa (Scopus ID : 56012410400
Dept Electrical Engineering, State University of Malang, Malang)

Ilham Ari Elbaith Zaeni (Electrical Engineering Department, Universitas Negeri Malang)
Andrew Nafalski (University of South Australia)



Article Info

Publish Date
30 Dec 2019

Abstract

Classification of economic journal articles has been done using the VSM (Vector Space Model) approach and the Cosine Similarity method. The results of previous studies are considered to be less optimal because Stopword Removal was carried out by using a dictionary of basic words (tuning). Therefore, the omitted words limited to only basic words. This study shows the improved performance accuracy of the Cosine Similarity method using frequency-based Stopword Removal. The reason is because the term with a certain frequency is assumed to be an insignificant word and will give less relevant results. Performance testing of the Cosine Similarity method that had been added to frequency-based Stopword Removal was done by using K-fold Cross Validation. The method performance produced accuracy value for 64.28%, precision for 64.76 %, and recall for 65.26%. The execution time after pre-processing was 0, 05033 second.

Copyrights © 2019






Journal Info

Abbrev

IJAIR

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering

Description

International Journal Of Artificial Intelligence Research (IJAIR) is a peer-reviewed open-access journal. The journal invites scientists and engineers throughout the world to exchange and disseminate theoretical and practice-oriented topics of Artificial intelligent Research which covers four (4) ...