LogicLink: Journal of Artificial Intelligence and Multimedia in Informatics
Vol. 1 No. 1, June 2024

Peringkasan Dokumen Teks Bilingual Sebagai Reduksi Fitur Untuk Klasifikasi Menggunakan Algoritma K-NN

Rahmawan Bagus Trianto (Unknown)
Agus Susilo Nugroho (Unknown)



Article Info

Publish Date
23 Jun 2024

Abstract

Summarizing text is a step to extract the essence of a text document with no more than half. Summarizing text has an important role in extracting the core information from a document in a more concise form. Summarizing text documents can be used as feature reduction in classifying text documents because it can reduce features that are considered irrelevant. Text documents are summarized using the Term Frequency-Inverse Document Frequency (TF-IDF) method, then classified using the K-Nearest Neighbor (K-NN) algorithm. One of the disadvantages of the K-NN algorithm is that it is not optimal in classification if the k value is not appropriate, as well as the selection of an inappropriate distance calculation method. By testing various k values ​​and using the Euclidean Distance distance measurement method, you can increase the accuracy of text document classification. Text document summarization using the proposed TF-IDF method is proven to increase when classification is carried out with K-NN. From the research results, it was found that the classification accuracy at the compression rate increased by 50% with a k value of 6 to 8 of 95.33%. This shows that text document summarization as feature reduction has a positive role in the classification process using the K-NN algorithm.

Copyrights © 2024






Journal Info

Abbrev

logiclink

Publisher

Subject

Computer Science & IT

Description

LogicLink : Journal of Artificial Intelligence and Multimedia in Informatics is free of fee, open access, and peer-reviewed journal, published by Informatics Department - UIN K.H. Abdurrahman Wahid Pekalongan Indonesia, which is a dissemination medium for research results from scientists and ...