Tuberculosis is an infectious disease caused by the bacteria mycobacterium tuberculosis. Tuberculosis is a serious global health problem and can cause death if not treated properly. At the Sidorejo Health Center, the current process of diagnosing patients uses several benchmarks of medical history obtained from patients regarding complaints, symptoms, and risk factors, while the results of the diagnosis calculation are not yet known. Comparison of the K-nearest neighbor and naïve bayes algorithms in classifying tuberculosis can provide input for the Sidorejo Health Center in seeing the accuracy of the diagnosis of tuberculosis, with medical information such as symptoms and medical history, where later patient data will be processed using the rapid miner application. The system development method used in this study is CRISP-DM, which consists of business understanding, data understanding, data preparation, modeling, evaluation, and deployment. The testing method uses a confusion matrix to measure the accuracy of the algorithm model with the results being that the K-nearest neighbor algorithm produces a high accuracy of 98% while the naïve bayes algorithm is the lowest with an accuracy of 0.70%.
                        
                        
                        
                        
                            
                                Copyrights © 2025