Lung disease is a health problem that greatly affects the quality of life and various types, such as pneumonia, bronchitis, tuberculosis, asthma and COPD require special attention. Accurate classification is essential to ensure effective treatment and prevent complications. The research used the C4.5 Decision Tree Algorithm method to classify lung cancer risk using a dataset that included 16 attributes, symptoms such as and risk factors including age, shortness of breath, and smoking habits, for a total of 309 data. The train_test_split method from Scikit-learn is used to split the data into 70% for training and 30% for testing. With 89% accuracy, 70% precision, and 74.5% recall on test data assessed using the Confusion Matrix, the C4.5 model demonstrated strong performance. These findings show that 83 of the 93 predictions in the test data were correct. This research concludes that the Decision Tree Algorithm has been proven to support the diagnosis of lung cancer. however, the model performance can be improved by comparing it with other algorithms to get more optimal results.
Copyrights © 2024