Research or final assignment is a requirement of graduation students. Every year the research becomes increasing and allows the students to take the same or similar topics. Through this research developed an application to classify student thesis reports. The results of this grouping also indicate that the themes are varied and when the themes becomes non-varied. Student research reports or commonly called a thesis report can be grouped by theme, object or method of the research. The process of extracting this thesis is done by using text mining technology. Then the process of grouping thesis document can be done by using k-means clustering method on a set of thesis documents by taking abstract, keywords and table of contents as an important information that represents the content of the document. Then the document will be done preprocessing first by using text mining method. To process the preprocessing is divided into several parts, namely tokenisasi, filtering, stemming and term weighting. After the document passes through the preprocessing process, then the document can be grouped by using the method of k-means clustering. In this experiment, trials are conducted by entering the number of clusters that vary. From the results of the analysis by entering the different cluster values have obtained the optimal value by entering the number of with the resulting silhouette value 0,483695522.
Copyrights © 2018