Universitas Bhinneka PGRI Library has many collections in both printed and digital forms, which collections will increase over time. Thus the number of collections of books in the library will be more and more diverse, it will make the process of grouping existing collections difficult. The method used in this study is data mining with the K-Nearest Neighbors (K-NN) algorithm approach by combining TF-IDF as word frequency weighting. The stages of working on the K-NN method in this study went through 4 stages, namely: (1) text preprocessing by applying the tokenization method, case folding, stopword removal and stemming, (2) Word weighting using the TF-IDF method (3). Modeling the k value from a minimum limit of 1 to a maximum limit of 30. (4) Classification of data using the most optimal k value based on k value modeling. (5) discussion of classification results. Data collection techniques using literature studies and datasets. With this classification system, it is expected to provide useful information for users. In addition, this study also aims to implement the K-NN method by combining it with TF-IDF while at the same time knowing the accuracy of the sales prediction system. The results of this study are based on the highest accuracy value for the classification of book titles of 66.67% and the lowest accuracy value of 60% with an average accuracy value of 63.33%.Kata kunci— Data Mining, K-Nearest Neighbor (K-NN), TF-IDF
Copyrights © 2021