Claim Missing Document
Check
Articles

Found 3 Documents
Search

PREDIKSI POTENSIAL GEMPA BUMI INDONESIA MENGGUNAKAN METODE RANDOM FOREST DAN FEATURE SELECTION Tantyoko, Henri; Sari, Dian Kartika; Wijaya, Andreas Rony
IDEALIS : InDonEsiA journaL Information System Vol 6 No 2 (2023): Jurnal IDEALIS Juli 2023
Publisher : Universitas Budi Luhur

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.36080/idealis.v6i2.3036

Abstract

Gempa bumi adalah suatu peristiwa alamiah yang terjadi saat terjadi pelepasan energi secara tiba-tiba dalam kerak bumi, mengakibatkan getaran dan guncangan pada permukaan bumi. Gempa bumi merupakan salah satu bencana alam yang dapat menyebabkan kerusakan fisik yang besar, dampak ekonomi yang signifikan, dan hilangnya nyawa manusia. Beberapa penyebab gempa bumi antara lain aktivitas tektonik lempeng bumi, pergerakan lempeng tektonik, dan deformasi kerak bumi. Untuk mengurangi jumlah korban jiwa, perlu dilakukan prediksi kapan gempa bumi akan terjadi di suatu wilayah. Salah satu cara untuk memprediksi ialah dengan menggunakan metode Machine Learning yaitu Random Forest (RF), metode ini memanfaatkan beberapa pohon keputusan yang selanjutnya dilakukan voting untuk menentukan keputusan akhir prediksi . Model yang baik adalah model yang menghasilkan kesalahan seminimal mungkin. Oleh karena itu, penulis melakukan skema seleksi fitur untuk mengolah fitur-fitur yang memiliki korelasi yang kuat. Prediksi menggunakan RF dengan seleksi fitur menghasilkan F1 score sebesar 92.23%, yang lebih baik 5.02% dibandingkan tanpa menggunakan seleksi fitur. Metode RF + Seleksi Fitur ini juga jauh lebih baik jika dibandingkan metode machine learning tradisional lainnya seperti SVM, Naïve Bayes, dan Decision Tree.
Comparison of the Word2vec Skipgram Model Method Linkaja Application Review using Bidirectional LSTM Algorithm and Support Vector Machine Ayuningtyas, Puji; Tantyoko, Henri
JUSTIN (Jurnal Sistem dan Teknologi Informasi) Vol 12, No 1 (2024)
Publisher : Jurusan Informatika Universitas Tanjungpura

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26418/justin.v12i1.72530

Abstract

Word embedding is a phase in word processing that seeks to convert each word into a vector representation. Word2Vec is a sort of word embedding that is frequently utilized in natural language processing research. Choosing the proper algorithm can help increase the performance of the word embedding method while doing text data categorization tasks. This research uses the Bidirectional LSTM deep learning algorithm and the Support Vector Machine (SVM) machine learning algorithm. The crawling approach was used to obtain data by accessing the LinkAja Application ID on the Google Play Store. The total number of rows in the dataset was 35560. Labeling data involves categorizing it into two target classes: positive (score 1) and negative (score 0). This study employs the Word2Vec approach with skipgram architecture during the vectorization stage. Vector size, window, min count, and sg are the four parameters employed. The bidirectional LSTM architecture employs a sequential model that consists of three neural network layers: embedding, bidirectional, and dense. In the meanwhile, the SVM architecture employs the Radial Basis Function (RBF) kernel parameters. For the final stage of algorithm testing, the accuracy of the bidirectional LSTM (BiLSTM) algorithm was 0.9505, which means it was higher than the support vector machine (SVM) algorithm with an accuracy value of 0.93.
Handling Imbalance Data using Hybrid Sampling SMOTE-ENN in Lung Cancer Classification Latief, Muhammad Abdul; Nabila, Luthfi Rakan; Miftakhurrahman, Wildan; Ma'rufatullah, Saihun; Tantyoko, Henri
International Journal of Engineering and Computer Science Applications (IJECSA) Vol. 3 No. 1 (2024): March 2024
Publisher : Universitas Bumigora Mataram-Lombok

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30812/ijecsa.v3i1.3758

Abstract

The classification problem is one instance of a problem that is typically handled or resolved using machine learning. When there is an imbalance in the classes within the data, machine learning models have a tendency to overclassify a greater number of classes. The model will have low accuracy in a few classes and high accuracy in many classes as a result of the issue. The majority of the data has the same number of classes, but if the difference is too great, it will differ. The issue of data imbalance is also evident in the data on lung cancer, where there are 283 positive classes and negative classes 38. Therefore, this research aims to use a hybrid sampling technique, combining Synthetic Minority Over-sampling Technique (SMOTE) with Edited Nearest Neighbors (ENN) and Random Forest, to balance the data of lung cancer patients who experience class imbalance. This research method involves the SMOTE-ENN preprocessing method to balance the data and the Random Forest method is used as a classification method to predict lung cancer by dividing training data and testing 10-fold cross validation. The results of this study show that using SMOTE-ENN with Random Forest has the best performance compared to SMOTE and without oversampling on all metrics used. The conclusion is using the SMOTE-ENN hybrid sampling technique with the Random Forest model significantly improves the model's ability to identify and classify data.