Claim Missing Document
Check
Articles

Found 12 Documents
Search

Model Machine Learning Untuk Prediksi Risiko Penyakit Liver Dengan Random Forest Teroptimasi Rizky Andrea Arifa; Nana Suarna; Agus Bahtiar; Nining Rahaningsih; Willy Prihartono
Jurnal Sistem Informasi dan Teknologi Vol 6 No 1 (2026): Jurnal Sistem Informasi dan Teknologi (SINTEK)
Publisher : LPPM STMIK KUWERA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56995/sintek.v6i1.204

Abstract

Penyakit liver merupakan salah satu kondisi kronis dengan tingkat mortalitas tinggi, sehingga diperlukan pendekatan prediksi yang akurat untuk mendukung deteksi dini. Penelitian ini bertujuan mengembangkan model machine learning untuk memprediksi risiko penyakit liver menggunakan algoritma Random Forest yang dioptimalkan dengan RandomizedSearchCV. Dataset yang digunakan terdiri dari 1.700 entri yang mencakup variabel klinis dan gaya hidup, termasuk usia, jenis kelamin, BMI, konsumsi alkohol, kebiasaan merokok, riwayat genetik, aktivitas fisik, diabetes, hipertensi, serta hasil Liver Function Test. Proses penelitian meliputi preprocessing, normalisasi skala, pembagian data menggunakan train-test split 80:20, pembangunan model baseline, dan optimasi hiperparameter. Hasil eksperimen menunjukkan bahwa optimasi menghasilkan peningkatan performa model, dengan akurasi 0.91, peningkatan recall sebesar 3.20%, dan AUC-ROC mencapai 0.96. Analisis feature importance menunjukkan bahwa LiverFunctionTest, BMI, dan AlcoholConsumption merupakan fitur paling berpengaruh terhadap prediksi risiko penyakit liver. Dengan demikian, Random Forest teroptimasi terbukti efektif dalam menghasilkan model prediksi yang akurat dan dapat digunakan sebagai alat pendukung keputusan dalam deteksi dini penyakit liver.
ALGORITMA RANDOM FOREST UNTUK PREDIKSI STATUS PINJAMAN BERDASARKAN SKOR KREDIT Attaufiqqurrohman, Hadit; Ade Irma Purnamasari; Denni Pratama; Nining Rahaningsih; Willy Prihartono
METHODIKA: Jurnal Teknik Informatika dan Sistem Informasi Vol. 12 No. 1 (2026): Volume 12 Nomor 1 Tahun 2026
Publisher : Universitas Methodist Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

The rapid development of financial technology has encouraged financial institutions to adopt data-driven credit scoring systems in order to minimize the risk of default. However, many loan eligibility prediction models still face challenges such as data imbalance (class imbalance) and the limited capability of traditional models to capture non-linear relationships among variables. This study aims to develop a loan status prediction model using the Random Forest algorithm combined with the Synthetic Minority Oversampling Technique (SMOTE) and One-Hot Encoding (OHE) to improve model accuracy and generalization capability. The data used in this study are secondary data obtained from the public Kaggle platform, consisting of 45,000 records with 14 demographic and financial attributes. The research method employs a supervised learning approach with several stages, including data acquisition and preprocessing (data cleaning, normalization, encoding, and data balancing), Random Forest model training, and performance evaluation using accuracy, precision, recall, F1-score, and AUC metrics. The results show that the combination of Random Forest, SMOTE, and OHE achieves high predictive performance, with an accuracy of 94.8%, precision of 95.6%, recall of 93.7%, F1-score of 94.6%, and an AUC value of 0.972. The most influential variables in loan status prediction are credit_score, person_income, and loan_amnt. This approach is proven to be effective in addressing data imbalance issues and improving classification accuracy in identifying creditworthy and non-creditworthy borrowers.