Lidina
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Peningkatan Klasifikasi Kanker Paru-Paru Melalui Rekayasa Fitur Interaksi Faktor Resiko Ananda Rizki Fitria; Agnes Prameswari; Aas Mirawati; Lidina
Prosiding SISFOTEK Vol 9 No 1 (2025): SISFOTEK IX 2025
Publisher : Ikatan Ahli Informatika Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

This study aims to improve the accuracy of lung cancer classification by applying a feature engineering-based machine learning approach from risk factor interactions. The data used comes from the Lung Cancer Risk Dataset on Kaggle, which contains 50,000 patient records with demographic, lifestyle, and medical condition variables. The preprocessing stage includes normalization, one-hot encoding, and the formation of interaction features that represent the nonlinear relationship between smoking habits, environmental exposure, and medical history. Two Random Forest models were compared: a baseline model without interaction features and an expanded model with interaction features. The results showed that the baseline model achieved an accuracy of 0.6973, while the model with interaction features achieved 0.6949, with better interpretability. Visualization through confusion matrices, feature importance plots, and SHAP analysis showed the contribution of engineered features to the interpretability of the model. These results indicate that interaction-based feature engineering can enrich model transparency and provide deeper clinical insights, and has the potential to be applied in clinical decision support systems and precision-based prediction models.