JOINCS (Journal of Informatics, Network, and Computer Science)
Vol 3 No 1 (2020): April

Comparative Analysis of Text Mining Results With Tf ldf Features and SQL Like Operator in Indonesian News Search

Riwa Rambu Hada Enda (Prodi Teknik Informatika, Universitas Kristen Wira Wacana Sumba)
Fajar Hariadi (Prodi Teknik Informatika, Universitas Kristen Wira Wacana Sumba)



Article Info

Publish Date
29 Apr 2020

Abstract

Research on the implementation of text mining uses the TF IDF method to be used in the Information retrieval / Indonesian news search feature. The dataset used was sourced from NewsAPI and built a Codeigniter based website named "News Plus Six Dua". This study also uses the Vertor Space Model (VSM) method to overcome the weaknesses of the TF IDF method at the time of the sorting process. The results of this study explain that the search by the TF IDF method has higher accuracy when compared to SQL like operators. TF IDF produces a percentage of precision 100% and recall (sensitivity) 66.7% on searches with the keyword "Indonesian soccer schedule" while SQL like operators do not display search results or equal to 0%. But the TF IDF method has the disadvantage of running slower than SQL like operators. This has been tested using either the number of words or terms entered, the number of datasets, and the location of access. At the location of access, access via hosting is monitored faster when compared via localhost.

Copyrights © 2020






Journal Info

Abbrev

joincs

Publisher

Subject

Computer Science & IT

Description

JOINCS publishes original research papers in computer science and related subjects in system science, with consideration to the relevant mathematical theory. Applications or technical reports oriented papers may also be accepted and they are expected to contain deep analytic evaluation of the ...