Jurnal Andalas: Rekayasa dan Penerapan Teknologi
Vol. 5 No. 1 (2025): Juni 2025

Leveraging Naive Bayes Classification for Early Detection of Breast Cancer: A Data-Centric Diagnostic Approach

Budi, Baik (Unknown)
Refki Budiman (Unknown)
Queen Hesti Ramadhamy (Unknown)



Article Info

Publish Date
01 Jun 2025

Abstract

This study aims to develop a breast cancer detection model using two distinct approaches: the Naive Bayes algorithm for classification and the K-Means algorithm for clustering. The methodology involves the collection of diagnostic clinical feature data, data preprocessing for normalization, and the separate training and evaluation of each model. Naive Bayes is employed to classify breast cancer as malignant or benign based on training and testing datasets, while K-Means is applied to unlabeled data as an additional analytical method. The performance of the Naive Bayes classifier is assessed using a confusion matrix, whereas the clustering results from K-Means are evaluated based on cluster validity metrics. The results indicate that Naive Bayes achieves a high level of accuracy (93%) in breast cancer classification, while K-Means offers additional insights through data pattern clustering. Together, these approaches demonstrate potential to effectively support the medical diagnostic process.

Copyrights © 2025






Journal Info

Abbrev

jarpet

Publisher

Subject

Education Electrical & Electronics Engineering Health Professions Industrial & Manufacturing Engineering Other

Description

This journal is open to submission from scholars and experts in the implementation of sciences and technologies to solve the real problems in the community, such as but not limited to: Small scale factory, Industry, Education, Agriculture, Environment, Sport, Tourism, Food, and ...