JURNAL TEKNIK INFORMATIKA DAN SISTEM INFORMASI
Vol 11 No 1 (2024): JATISI (Jurnal Teknik Informatika dan Sistem Informasi)

KEYWORD EXTRACTION JUDUL BERITA ONLINE DI INDONESIA MENGGUNAKAN METODE TF-IDF

Wibowo, Ibnu Surya (Unknown)
Witanti, Arita (Unknown)
Susilawati, Indah (Unknown)



Article Info

Publish Date
15 Mar 2024

Abstract

Keyword Extraction is the process of identifying and extracting key words or important phrases from a text or document, aiming to quickly identify keyword information from a large amount of data. This research investigates three online news sites in Indonesia, namely Kompas.com, Detik.com, and Tempo.co. The keyword extraction process is strengthened by the TF-IDF (Term Frequency - Inverse Document Frequency) method, assigning a high weight to words that frequently appear in one document but rarely in others. The TF-IDF calculation results in different weights for each word. Many words were successfully extracted from the three sites, displaying 30 keywords from the dataset. The accuracy of the weight calculation was then tested using F1-Score, resulting in an F1-Score of 99.67%, with an accuracy rate of 99.76%.

Copyrights © 2024






Journal Info

Abbrev

jatisi

Publisher

Subject

Computer Science & IT

Description

JATISI bekerja sama dengan IndoCEISS dalam pengelolaannya. IndoCEISS merupakan wadah bagi para ilmuwan, praktisi, pendidik, dan penggemar dalam bidang komputer, elektronika, dan instrumentasi yang menaruh minat untuk memajukan bidang tersebut di Indonesia. JATISI diterbitkan 2 kali dalam setahun ...