Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer
Vol 2 No 12 (2018): Desember 2018

Clustering Dokumen Skripsi Dengan Menggunakan Hierarchical Agglomerative Clustering

Danang Aditya Wicaksana (Fakultas Ilmu Komputer, Universitas Brawijaya)
Putra Pandu Adikara (Fakultas Ilmu Komputer, Universitas Brawijaya)
Sigit Adinugroho (Fakultas Ilmu Komputer, Universitas Brawijaya)



Article Info

Publish Date
06 Aug 2018

Abstract

A minor thesis is a document of a scientific work compiled by a student at the level of stratum 1 which discusses a particular topic or field of research or development results that the student has undertaken in order to take the final examination to obtain a degree. In the Reading Room of the Faculty of Computer Science and the Central Library of Brawijaya University there is a problem that arises that there is no categorization of all minor thesis documents stored. Hierarchical Agglomerative Clustering (HAC) method is implemented for clustering minor thesis documents based on minor thesis title. HAC classifies iterative documents from the smallest cluster to the largest 1 cluster. Input data that is in the form of title of minor thesis document of Informatics Engineering Brawijaya University. The preprocessing stage is performed on the minor thesis title data to get the term feature. All the terms obtained are processed to get the weight of TF-IDF. The value of similarity between documents obtained from the value of cosine distance. The clustering process uses 3 distance options as the single linkage, complete linkage and average linkage parameters. The clustering results of each distance parameter are displayed on the label of each cluster generated and each cluster generated is evaluated using silhouette coefficient. From the test result on 100 minor thesis documents obtained the value of Silhouette Coefficient from single linkage is 0,10125, complete linkage is 0,155733 and average linkage is 0,160428. Average linkage is better in grouping documents than single linkage and complete linkage.

Copyrights © 2018






Journal Info

Abbrev

j-ptiik

Publisher

Subject

Computer Science & IT Control & Systems Engineering Education Electrical & Electronics Engineering Engineering

Description

Jurnal Pengembangan Teknlogi Informasi dan Ilmu Komputer (J-PTIIK) Universitas Brawijaya merupakan jurnal keilmuan dibidang komputer yang memuat tulisan ilmiah hasil dari penelitian mahasiswa-mahasiswa Fakultas Ilmu Komputer Universitas Brawijaya. Jurnal ini diharapkan dapat mengembangkan penelitian ...