p-Index From 2021 - 2026
7.892
P-Index
Claim Missing Document
Check
Articles

Perbandingan Algoritma Naive Bayes dan Decission Tree untuk Prediksi Penyakit Kanker Paru-Paru Gusti Firmansyah, Mulia; Khairuddin, M.; fadillah, M; Efrizoni, Lusiana; Rahmaddeni , Rahmaddeni
Jurnal Dinamika Informatika Vol. 13 No. 1 (2024): Jurnal Dinamika Informatika
Publisher : Program Studi Informatika Universitas PGRI Yogyakarta

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.31316/jdi.v13i1.309

Abstract

In this study, we compared the performance of two machine learning algorithms, Naïve Bayes and Decission Tree, for diagnosing lung diseases using patient health datasets. The main objective of this study is to evaluate the accuracy, precision, recall, and F1 score of the two algorithms to determine which method is more effective in predicting lung diseases. The results showed that the tree classification algorithm outperformed Naïve Bayes in terms of accuracy, reaching 95% in an 80:20 split, compared to the 78% accuracy achieved by Naïve Bayes on the same data. Further analysis showed that most patients in this dataset were high risk with 365 patients, followed by risk with 332 patients, and low risk with 303 patients. The decision tree structure proved to be more effective in handling the complexity of the data and produced more accurate predictions, improving efficiency by creating a new "Risk_Score". These results show that decision trees are a better method than Naïve Bayes for diagnosing lung diseases and can provide a solid foundation for developing accurate machine learning models for future health research.
Penerapan K-Means Clustering untuk Mengelompokkan Risiko Diabetes Berdasarkan Gaya Hidup dan Kesehatan Sigit, Rapel Aprilius; Rio, Unang; Efrizoni, Lusiana; Ali, Edwar
JATISI Vol 12 No 4 (2025): JATISI (Jurnal Teknik Informatika dan Sistem Informasi)
Publisher : Universitas Multi Data Palembang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35957/jatisi.v12i4.13292

Abstract

Diabetes mellitus is a chronic disease with a globally increasing prevalence, driven by modern lifestyle changes. Early detection of diabetes risk is crucial in preventing and mitigating long-term complications. This study aims to cluster individuals based on their diabetes risk levels using the K-Means Clustering algorithm by considering lifestyle and health condition attributes. The dataset used was obtained from the Kaggle platform, consisting of 5,452 entries and 22 attributes. The pre-processing stage involved data cleaning, normalization, and manual feature selection. The optimal number of clusters was determined using the Elbow Method, which indicated the best result at k = 3. Cluster quality evaluation was performed using the Davies-Bouldin Index (DBI), which yielded a score of 0.7678, indicating a reasonably good level of cluster compactness and separation. The final output formed three risk clusters: low, medium, and high, with distributions of 424, 819, and 615 records, respectively. This segmentation is expected to serve as a basis for healthcare institutions in designing more targeted and data-driven preventive interventions.
Komparasi Ekstraksi Fitur dalam Klasifikasi Teks Multilabel Menggunakan Algoritma Machine Learning Lusiana Efrizoni; Sarjon Defit; Muhammad Tajuddin; Anthony Anggrawan
MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer Vol. 21 No. 3 (2022)
Publisher : Universitas Bumigora

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30812/matrik.v21i3.1851

Abstract

Ektraksi fitur dan algoritma klasifikasi teks merupakan bagian penting dari pekerjaan klasifikasi teks, yang memiliki dampak langsung pada efek klasifikasi teks. Algoritma machine learning tradisional seperti Na¨ıve Bayes, Support Vector Machines, Decision Tree, K-Nearest Neighbors, Random Forest, Logistic Regression telah berhasil dalam melakukan klasifikasi teks dengan ektraksi fitur i.e. Bag ofWord (BoW), Term Frequency-Inverse Document Frequency (TF-IDF), Documents to Vector (Doc2Vec), Word to Vector (word2Vec). Namun, bagaimana menggunakan vektor kata untuk merepresentasikan teks pada klasifikasi teks menggunakan algoritma machine learning dengan lebih baik selalumenjadi poin yang sulit dalam pekerjaan Natural Language Processing saat ini. Makalah ini bertujuan untuk membandingkan kinerja dari ekstraksi fitur seperti BoW, TF-IDF, Doc2Vec dan Word2Vec dalam melakukan klasifikasi teks dengan menggunakan algoritma machine learning. Dataset yang digunakan sebanyak 1000 sample yang berasal dari tribunnews.com dengan split data 50:50, 70:30, 80:20 dan 90:10. Hasil dari percobaan menunjukkan bahwa algoritma Na¨ıve Bayes memiliki akurasi tertinggi dengan menggunakan ekstraksi fitur TF-IDF sebesar 87% dan BoW sebesar 83%. Untuk ekstraksi fitur Doc2Vec, akurasi tertinggi pada algoritma SVM sebesar 81%. Sedangkan ekstraksi fitur Word2Vec dengan algoritma machine learning (i.e. i.e. Na¨ıve Bayes, Support Vector Machines, Decision Tree, K-Nearest Neighbors, Random Forest, Logistic Regression) memiliki akurasi model dibawah 50%. Hal ini menyatakan, bahwa Word2Vec kurang optimal digunakan bersama algoritma machine learning, khususnya pada dataset tribunnews.com.
Komparasi Algoritma K-Nearest Neighbors dan Naïve Bayes dalam Klasifikasi Penyakit Diabetes Gestasional Ermy Pily, Annisa Khoirala; Oktavianda; Aprilia, Fanesa; Rahmaddeni; Efrizoni, Lusiana
The Indonesian Journal of Computer Science Vol. 13 No. 1 (2024): The Indonesian Journal of Computer Science
Publisher : AI Society & STMIK Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33022/ijcs.v13i1.3714

Abstract

Diabetes merupakan penyakit metabolik dengan gejala hiperglikemia akibat gangguan sekresi insulin dan aksi insulin. Diabetes gestasional adalah gangguan toleransi glukosa pada wanita hamil. Saat kehamilan, plasenta menghasilkan hormon baru seperti human placental lactogen (HPL), hormon estrogen, dan hormon peningkat resistensi insulin. Gejala diabetes gestasional tidak selalu mudah dikenali, dan seringkali penderitanya mengalami gejala awal secara tidak sadar. Penelitian ini bertujuan untuk membandingkan performa dua algoritma yaitu K-NN dan Naïve Bayes dengan Feature Selection dalam mengklasifikasikan penderita diabetes gestasional. Hasil error terendah dari feature selection dengan iterasi K=4, memperoleh MAE 0.317, MSE 0.142, dan RMSE 0.377. Hasil akurasi pada model KNN dengan K=5 , tanpa Feature Selection sebesar 80% dan K-NN dengan Feature Selection sebesar 77%. Sementara itu, Naïve Bayes tanpa Feature Selection sebesar 77% dan Naïve Bayes dengan Feature Selection sebesar 80%. Dari hasil tersebut K-NN tanpa Feature Selection dan Naïve Bayes dengan Feature Selection mendapatkan hasil yang lebih baik.
Optimizing Content Recommendations Using a Hybrid Filtering Algorithm to Enhance User Relevance and Engagement Efrizoni, Lusiana; Junadhi; Agustin
MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer Vol. 25 No. 2 (2026)
Publisher : Universitas Bumigora

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30812/matrik.v25i2.5988

Abstract

Recommender systems play an important role in helping users discover relevant content in environments characterized by information overload. However, existing approaches often struggle to balance recommendation relevance and user engagement. Collaborative filtering is constrained by data sparsity and the cold-start problem, whereas content-based methods that rely on textual features may not fully capture dynamic user preferences. This study aims to develop a hybrid deep learning-based recommendation model that improves both recommendation relevance and user engagement. The proposed method integrates collaborative filtering via Neural Matrix Factorization (NeuMF) with content-based filtering via a Long Short-Term Memory (LSTM) text encoder, employing an early-fusion strategy. An experimental research method was applied using synthetic user–item interaction data. Model performance was evaluated using ranking metrics (Precision@10, Recall@10, and NDCG@10) and engagement metrics (Click-Through Rate and Average Completion Ratio). The results show that the hybrid model outperforms the baseline models. It achieves Precision@10 of 0.143, Recall@10 of 0.112, and NDCG@10 of 0.139, which exceed those of both the NeuMF-only and LSTM-only models. In terms of engagement, the hybrid model also records the best performance with a CTR of 0.0017 and an ACR of 0.0090. These findings indicate that integrating user–item interaction patterns with semantic content representations can significantly enhance recommendation quality and user engagement, providing a more effective solution for content-rich digital platforms.
Co-Authors -, Dwi Haryono Afrinanda, Rizky Agung Marinda Agustin Agustin Agustin Agustin Agustin Agustin Agustin, Endy Wulan Ahmad - Fauzan Ahmad Fauzan Ahmad Rizali Anam, M Khairul Andhika, Imam Anthony Anggrawan Anugraha, Yoga Safitra Aprilia, Fanesa Arifin, Muhammad Amirul Armoogum , Sheeba Aulia, Rahma Azhari, Zahra Cikita, Putri Dadynata, Eric Deni, Rahmad Devi Puspita Sari, Devi Puspita Dewi, Deshinta Arrova Dhini Septhya Djamalilleil, Said Azka Fauzan Edwar Ali Erlinda, Susi Ermy Pily, Annisa Khoirala ester nababan fadillah, m Fadly Fadly Farhan Pratama Fauzan, Aulia Filza Izzati Finanta Okmayura Firdaus, Muhammad Bambang Firman, Muhammad Aditya Fransiskus Zoromi Fransiskus Zoromi, Fransiskus Gusti Firmansyah, Mulia Habibie, Dedi Rahman Hadi Asnal, Hadi Handayani, Nadya Satya Haviluddin Haviluddin Helda Yenni, Helda Hidaya Spitri Hutasoit, Josua Iftar Ramadhan Ihsan, Raja Muhammad Ike Yunia Pasa Irwanda Syahputra Julianti, Nadea Junadhi Junadhi Junadhi Junadhi Junadhi, Junadhi Karpen Kartina Diah K. W. Khairuddin, M. Kharisma Rahayu Koko Harianto Kurniawan, Tri Basuki Lathifah, Lathifah Lestari, Fika Ayu Lili Marlia M. Azzuhri Dinata M. Irpan Marhadi, Nanda Maulana, Fitra Melva Suryani Muhammad Bambang Firdaus Muhammad Oase Ansharullah Muhammad Syaifullah MUHAMMAD TAJUDDIN Munawir Munawir Muslim Muslim Nanda, Annisa Nasution , Zikri Hardyan Novfuja, Elma Nurul fadillah, Nurul Oktavianda Panguluri, Padmavathi Praveen, S Phani Purnama, Muhammad Adji Putantri, Nazlah Sari Putra, Febrianda Putri, Adinda Dwi Putri, Siti Faradila R. Guntur Surya Yuwana - Rabbani, Salsabila Rahmaddeni , Rahmaddeni Rahmaddeni Rahmaddeni Rahmaddeni, - Rahmiati Rahmiati Rais Amin Ramadhani, Jilang Rati Rahmadani Ratna Andini Husen Revaldo, Bagus Tri Riadhil Jannah Rini Yanti, Rini Risky Harahap Risman Risman Rizki Astuti Rohmatulloh, Vanda Rometdo Muzawi, Rometdo Safitri, Dea Sahelvi, Elza Sapina, Nur Sapitri, Riska Mela Sari, Atalya Kurnia Sarjon Defit Sarjon Defit Setiawan , Andri Shahreen Kasim, Shahreen Sholekhah, Fitriana Sigit, Rapel Aprilius Sirisha, Uddagiri Sularno Supian, Acuan Susandri, Susandri Susanti Susanti Susanti, Susanti Susi Erlinda Syahrul Imardi Syarifuddin Elmi Tahiyat, Hafsah Fulaila Tashid Tawa Bagus, Wahyu Torkis Nasution Tri Putri Lestari, Tri Putri Tri Revaldo, Bagus Triyani Arita Fitri Try Puspa Siregar, Farida Ulfa, Arvan Izzatul Unang Rio Uthami, Kurnia Vindi Fitria Wirta Agustin Wirta Wirta Yanti, Rini Yoyon Efendi Yulli Zulianda Zahra Azhari Zakaria , Mohd Zaki Zakaria, Mohd Zaki Zega, Wilman Zikri Hadryan nst Zulafwan Zuriatul Khairi Zuriatul Khairi