Claim Missing Document
Check
Articles

Penerapan Algortitma C4.5 untuk Klasifikasi Sentimen Masyarakat terhadap #RUUKUHP pada Twitter Vusuvangat, Imam; Kurnia Gusti, Siska; Syafira, Fadhilah; Novriyanto, Novriyanto; Insani, Fitri
Jurnal Teknologi Sistem Informasi dan Aplikasi Vol. 6 No. 4 (2023): Jurnal Teknologi Sistem Informasi dan Aplikasi
Publisher : Program Studi Teknik Informatika Universitas Pamulang

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

Social media, especially Twitter, has developed into an important tool for people to share their opinions and feelings widely. Users often use hashtags to share messages related to certain topics. Some of the issues that lead to the need for sentiment analysis of the Draft Criminal Code are social impact, Public disapproval, Potential legal uncertainty, Potential abuse, Support and criticism. By conducting a sentiment analysis of the draft Penal Code, the government and policymakers can better understand the views of the public, identify possible problems and address them, and make necessary improvements or clarifications to the draft law. This can help ensure that the draft Penal Code has greater public support and adheres to good legal principles. The classification of public responses to this hashtag provides a significant snapshot of public attitudes and perspectives. This study aims to classify public sentiment towards the RUUKUHP hashtag on the Twitter platform using the C4.5 algorithm. This study uses a collection of tweets with the hashtag RUUKUHP which are manually categorized into two and three sentiment categories, namely positive, negative and positive, negative and neutral. In this study, data preprocessing is carried out before training the model which includes removing links, special characters, removing stopwords, and word tokenization. Furthermore, this research uses text representation methods such as TF-IDF to extract features from the tweet text and convert them into numerical vectors used by the C4.5 algorithm. After training the classification model using the C4.5 algorithm with the classified dataset, it evaluates the performance of the model with the metrics of accuracy, recall, precision, and F1 score. Experimental results using 2 categories of Negative and Positive show that the model applied with the C4.5 algorithm achieved an accuracy of 96.6% with a recall of 96.6%, a percision of 97.1% and an F1 score of 96.8. And experiments using 3 categories of Negative, Positive and Neutral achieved an accuracy of 67%, a recall of 67%, a precision of 65%, and an F1 score of 66%. Thus it can be concluded that the results of the RUUKUHP hashtag sentiment classification with 2 class predictions are more relevant than 3 sentiment class predictions with a value reaching 96.6%.
Perbandingan Teknik Penyeimbang Kelas Pada Multi-Layer Perceptron (MLP) Berbasis Backpropagation Untuk Klasifikasi Diabetes Mellitus Robby Azhar; Siska Kurnia Gusti; Iis Afrianty; Elvia Budianita
Bulletin of Computer Science Research Vol. 5 No. 6 (2025): October 2025
Publisher : Forum Kerjasama Pendidikan Tinggi (FKPT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/bulletincsr.v5i6.804

Abstract

Diabetes Mellitus (DM) is a chronic disease that can lead to serious complications if not detected early; therefore, early diagnosis is highly important. One of the methods that can be applied for early diagnosis is the classification technique in data mining. However, the classification process often faces challenges due to class imbalance, which can reduce model performance. This study aims to analyze the effect of class balancing techniques on the performance of the Backpropagation Neural Network (BPNN) in classifying DM cases. BPNN is a form of Multi-Layer Perceptron (MLP) with a simple structure and the ability to solve complex problems with good accuracy. The dataset used in this study is the Pima Indians Diabetes Dataset, consisting of 768 instances, including 500 non-diabetic and 268 diabetic cases. The research was conducted using three scenarios: without balancing, Synthetic Minority Over-sampling Technique (SMOTE), and Random Under Sampling (RUS). The BPNN model was designed with two architectural variations (one hidden layer and two hidden layers), three learning rate values (0.1, 0.01, and 0.001), and a varying number of neurons. The dataset was divided using the 10-Fold Cross Validation technique. The results show that applying SMOTE achieved the best performance, with an average accuracy of 90.89%, precision of 91.22%, recall of 90.89%, and F1-score of 90.89% on the BPNN architecture with one hidden layer. Furthermore, the single hidden layer architecture proved more stable than the two hidden layers, especially when the dataset size decreased due to RUS. Therefore, the combination of SMOTE and BPNN with one hidden layer provides better performance in classifying Diabetes Mellitus cases.
Penerapan Seleksi Fitur Information Gain dan Metode Backpropagation Neural Network Untuk Klasifikasi Atrisi Karyawan Dinyah Fithara; Elvia Budianita; Iis Afrianty; Siska Kurnia Gusti
Bulletin of Computer Science Research Vol. 6 No. 1 (2025): December 2025
Publisher : Forum Kerjasama Pendidikan Tinggi (FKPT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/bulletincsr.v6i1.922

Abstract

Employee attrition management is a critical challenge for organizations as it involves costs, time, and the risk of decision-making errors. This problem requires a data-driven business strategy to achieve more accurate predictions of employees who are potentially at risk of termination. This study applies the Information Gain feature selection method and the Backpropagation Neural Network (BPNN) algorithm in the employee attrition classification process with the aim of increasing the accuracy and efficiency of the prediction model. BPNN is chosen due to its simpler architecture, faster training time, and greater stability for small to medium sized datasets.  With the assistance of Information Gain feature selection, BPNN is able to achieve optimal performance without requiring a complex architecture. The dataset used consist of 35 attributes and 1.470 employee records covering various factor such as age, income level, and employment status. The research stages include feature selection based on information gain values with specific thresholds, data partitioning using k-fold cross validation, and model training using BPNN with variations of learning rates and hidden neuron counts. The results show that the combination of Information Gain and BPNN improves classification accuracy compared to models without feature selection, achieving the highest average accuracy of 87.28% when using 25 selected attributes, with a BPNN configuration of learning rate 0.001, 35 hidden neurons, and 50 epochs. The attributes with the highest Information Gain score include JobLevel, OverTime, MaritalStatus, and MonthlyIncome. This study demonstrates that the proposed approach successfully enhances the prediction performance of employee attrition and can serve as a foundation for developing data-driven models that support employee retention efforts.
Implementasi Metode RBMT dalam Penerjemahan Bahasa Indonesia ke Bahasa Makassar Hanif, Wan Muhammad; Yusra, Yusra; Muhammad Fikry; Febi Yanto; Siska Kurnia Gusti
Bulletin of Computer Science Research Vol. 6 No. 1 (2025): December 2025
Publisher : Forum Kerjasama Pendidikan Tinggi (FKPT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/bulletincsr.v6i1.935

Abstract

?This research was conducted to address the limited availability of linguistic resources for regional languages, particularly Makassar Language, which does not yet have adequate automatic translation support. The main problem addressed in this study is the absence of a reliable automatic translation system for Makassar Language. The objective of this research is to apply a rule-based translation method to translate text from Indonesian into Makassar Language. This study focuses on the implementation of the Rule-Based Machine Translation (RBMT) method for translating Indonesian text into Makassar Language using the Python programming language. The RBMT implementation involves tokenization, morphological analysis, vocabulary matching, and the application of grammatical rules, including the identification of prefixes and suffixes. The data used consist of a bilingual dictionary compiled from various sources and a set of test sentences representing everyday sentence structures. Translation evaluation was carried out using the Word Error Rate (WER) method, yielding a result of 0.289, and the Character Error Rate (CER) method, with a result of 0.21, which fall into the “Good” category based on the evaluation scale. The main findings indicate that the application of the RBMT method is capable of producing reasonably accurate translations at both the word and character levels. These findings demonstrate that a rule-based approach can be effectively applied to regional languages with limited digital data and provide an initial overview of the potential use of rule-based methods to support the development and preservation of regional languages.
Clustering of Halal MSME Aid Recipients: Uncovering Patterns and Characteristics Using the K-Medoids Method Vitriani, yelfi; Gusti, Siska Kurnia
Jurnal CoreIT: Jurnal Hasil Penelitian Ilmu Komputer dan Teknologi Informasi Vol 11, No 2 (2025): December 2025
Publisher : Fakultas Sains dan Teknologi, Universitas Islam Negeri Sultan Syarif Kasim Riau

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24014/coreit.v11i2.38634

Abstract

The rapid growth of the halal industry has strengthened the strategic role of Micro, Small, and Medium Enterprises (MSMEs) in meeting market expansion. However, the absence of structured insights regarding the characteristics and patterns of halal MSME aid recipients has hindered the formulation of effective and targeted support programs. This study aims to identify the clustering patterns of halal MSME beneficiaries in Indonesia using the K-Medoids algorithm optimized with Principal Component Analysis (PCA). A total of 129 MSME datasets were collected through validated questionnaires consisting of demographic variables, aid history, business performance, and operational challenges. Preprocessing included data cleaning, transformation, and dimensionality reduction using PCA. The optimal PCA dimension was determined as two components based on the Davies-Bouldin Index (0.1737). K-Medoids clustering produced three optimal clusters validated using Silhouette (0.4602), Davies-Bouldin Index (0.7861), and Elbow Method (K=3). Each cluster shows distinctive characteristics in income range, business legality, type of aid received, challenges, and performance outcomes. The novelty of this research lies in the application of PCA-optimized K-Medoids for halal MSME segmentation, providing insightful foundations for evidence-based policymaking.
KLASIFIKASI MINAT BACA MENGGUNAKAN METODE NAÏVE BAYES CLASSIFIER Hafiz, Muhammad; Budianita, Elvia; Nazir, Alwis; Gusti, Siska Kurnia
Journal of Economic, Bussines and Accounting (COSTING) Vol. 9 No. 1 (2026): COSTING : Journal of Economic, Bussines and Accounting
Publisher : Institut Penelitian Matematika, Komputer, Keperawatan, Pendidikan dan Ekonomi (IPM2KPE)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.31539/gcqhr236

Abstract

Minat baca merupakan faktor penting dalam mendukung keberhasilan akademik siswa, namun pengukurannya masih sering dilakukan secara subjektif. Penelitian ini bertujuan untuk mengklasifikasikan tingkat minat baca siswa MTsN 1 Payakumbuh menggunakan metode Naïve Bayes Classifier. Atribut yang digunakan sebagai input berupa 20 butir pernyataan kuesioner skala Likert yang merepresentasikan kebiasaan, frekuensi, motivasi, serta preferensi membaca siswa. Data penelitian diperoleh dari 911 responden yang dikelompokkan ke dalam tiga kelas tingkat minat baca, yaitu tinggi (329 data), sedang (501 data), dan rendah (81 data). Pengujian model dilakukan menggunakan tiga skema pembagian data latih dan data uji, yaitu 90:10, 80:20, dan 70:30. Evaluasi performa model menggunakan confusion matrix menunjukkan bahwa skema 90:10 menghasilkan akurasi sebesar 96,74%, skema 80:20 sebesar 97,81%, dan skema 70:30 sebesar 98,18%. Hasil tersebut menunjukkan bahwa metode Naïve Bayes Classifier memiliki performa yang sangat baik dan konsisten dalam mengklasifikasikan tingkat minat baca siswa berdasarkan data kuesioner.
Thyroid Disease Classification Using Support Vector Machine and Recursive Feature Elimination Method Citra Wulandari; lis Afrianty; Elvia Budianita; Siska Kurnia Gusti
bit-Tech Vol. 8 No. 2 (2025): bit-Tech
Publisher : Komunitas Dosen Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.32877/bt.v8i2.3454

Abstract

Thyroid disease is a common endocrine disorder that can cause serious metabolic and cardiovascular complications, so accurate early detection is clinically essential. This study proposes a Support Vector Machine (SVM) classifier enhanced with Recursive Feature Elimination (RFE) to select the most informative attributes and Adaptive Synthetic Sampling (ADASYN) to handle class imbalance in a Kaggle thyroid dataset of 3,771 clinical records. The data contain 25 diagnostic attributes with a strongly skewed distribution between healthy and thyroid cases. The model’s robustness was examined using three train–test split ratios. The best configuration, SVM with a Linear kernel and 20 RFE-selected features under an 80:20 split, achieved 98.39% accuracy, with precision, recall, and F1-score all reaching 0.98, indicating consistently strong performance across classes. RFE contributes by removing redundant or weakly relevant variables, helping the classifier construct a more stable and interpretable decision boundary. ADASYN further improves the representation of the minority class, yielding higher recall and F1-score for thyroid cases and reducing the risk of missed diagnoses. Overall, the combined use of feature selection and adaptive oversampling produces a balanced and computationally efficient model for thyroid disease classification. These findings suggest that the proposed approach can support clinical decision-making, reduce diagnostic errors in imbalanced data settings, and strengthen early detection efforts in endocrine health assessment. By offering high sensitivity for thyroid cases while maintaining robust specificity for healthy patients, the model is well suited for integration into clinical decision-support and routine screening workflows.
Klasifikasi Tingkat Keberhasilan Produksi Ayam Broiler di Riau Menggunakan Algoritma K-Nearest Neighbor Beni Basuki; Alwis Nazir; Siska Kurnia Gusti; Lestari Handayani; Iwan Iskandar
Jurnal Sistem Komputer dan Informatika (JSON) Vol. 4 No. 3 (2023): Maret 2023
Publisher : Universitas Budi Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30865/json.v4i3.5800

Abstract

Livestock is a crucial component of the Indonesian agriculture sector. One of the most widely practiced types of livestock farming is broiler chicken farming. The production of broiler chickens continues to increase due to the increasing consumption of broiler chickens. Presently, companies are facing an urgent requirement to support farmers, regardless of their level of experience, whether they are newly entering the sector or have been established for some time. Core companies encounter challenges in modeling the success rate of broiler chicken farmer production because of the vast quantity of data coming from collaborating farmers, which makes it arduous for the company to establish the success rate of broiler chicken production. Establishing the level of production success is very helpful in selecting the appropriate farmers to be guided, thus enabling accurate decision-making. A classification procedure utilizing data mining and K-Nearest Neighbor (KNN) algorithm is necessary to manage the growing volume of data. The study examined 927 livestock production data from Riau, where the data was divided into two sets, with 80% allocated for training and the remaining 20% for testing purposes. The findings of the confusion matrix analysis showed that the optimal result was achieved at k = 3, with an accuracy rate of 86.49%, precision of 75.00%, and recall of 70.21%.
Klasifikasi Sentimen Transformasi dan Reformasi Sepak Bola Indonesia Pada Twitter Menggunakan Algoritma Bernoulli Naïve Bayes Destri Putri Yani; Siska Kurnia Gusti; Febi Yanto; Muhammad Affandes
Jurnal Sistem Komputer dan Informatika (JSON) Vol. 4 No. 3 (2023): Maret 2023
Publisher : Universitas Budi Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30865/json.v4i3.5829

Abstract

Federation Internationale de Football Association (FIFA) carried out Transformations and Reformations to Indonesian Football with one of them Indonesia was chosen as the Host of the U-20 World Cup in 2023. The transformations and reformations carried out cause people to often provide opinions through social media Twitter. Opinions given by the public can be positive or negative. The research uses Text Mining to classify sentiment in 2 categories with the Bernoulli Naïve Bayes algorithm. This research aims to classify positive and negative sentiments and determine the level of accuracy value of the sentiment classification results of Indonesian Football Transformation and Reformation. The research stages carried out are data collection, text preprocessing, data labeling, TF-IDF weighting, Bernoulli Naïve Bayes classification, and evaluation. Based on the research results from 4907 data there is duplicate data and only uses 2125 data which is divided into 90% training data and 10% testing data, so as to get accuracy with a high category value of 88%. The classification results show that many tweets are positive sentiments.
Klasifikasi Sentimen Tragedi Kanjuruhan Pada Twitter Menggunakan Algoritma Naïve Bayes Iqbal Salim Thalib; Siska Kurnia Gusti; Febi Yanto; Muhammad Affandes
Jurnal Sistem Komputer dan Informatika (JSON) Vol. 4 No. 3 (2023): Maret 2023
Publisher : Universitas Budi Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30865/json.v4i3.5852

Abstract

The Kanjuruhan Malang incident occurred on October 1 and resulted in 132 deaths, 96 serious injuries and 484 minor injuries. The cause of the riot occurred due to provocation between Arema Malang supporters and Persebaya Surabaya supporters who mentioned harsh words and other provocative actions that caused anger on both sides. Sentiment analysis of the Kanjuruhan tragedy using the Naive Bayes method was conducted through tweets taken through Twitter to understand the public's perception of the incident. The Naïve Bayes algorithm is performed for the sentiment classification of tweet data which is applied by processing the tweet text and classifying it into positive, negative, and neutral. In this study using data as much as 4843 data and carried out with tweet data that has been crawled resulting in 2,042 data. This research aims to classify sentiment and determine the level of accuracy in the Multinomial Naïve Bayes algorithm in the Kanjuruhan tragedy using a dataset in the form of tweets from twitter social media. The processed tweet data is divided into two types, namely 90% training data and 10% test data.  The results of this classification get a Naïve Bayes accuracy of 75% with a precission of 73%, recall of 75%, and f1-score value of 74%. The results of the tweet data used in this study can be concluded that the Naïve Bayes algorithm has a fairly good accuracy value.
Co-Authors Abdul Wahid Abdullah Abdullah Abdullah, Said Noor Abdussalam Al Masykur Adi Mustofa Al Rasyid, Nabila Alfaiza, Raihan Zia Alfin Hernandes Alwaliyanto Alwis Nazir Alwis Nazir Alwis Nazir Amelia, Felina Anggi Vasella Azhima, Mohd Baehaqi Beni Basuki Citra Wulandari Cut Lira Kabaatun Nisa Destri Putri Yani Devi Julisca Sari Dina Septiawati Dinyah Fithara efni humairah Eka Pandu Cynthia Eka Pandu Cynthia Elin Haerani Elin Haerani Elin Haerani Elin Haerani Elvia Budianita Erni Rouza, Erni Fadhilah Syafria Faska, Ridho Mahardika Febi Yanto Fitri Insani Fitri Insani Fitri Wulandari Fitri, Anisa Gusti, Gogor Putra Hafi Puja Hanif, Wan Muhammad Iis Afrianty Iis Afrianty Iqbal Salim Thalib Irsyad (Scopus ID: 57204261647), Muhammad Iwan Iskandar Jasril Jasril Jasril Jasril Khair, Nada Tsawaabul Kurniansyah, Juliandi Lestari Handayani lis Afrianty M Wandi Dwi Wirawan Maemonah, Maemonah Morina Lisa Pura Muhammad Affandes Muhammad Affandes Muhammad Fauzan Muhammad Fikry Muhammad Hafiz Muhammad Irsyad Muhammad Khairy Dzaky Muhammad Rifaldo Al Magribi Nazir, Alwis Norhiza, Fitra Lestari Novriyanto Novriyanto Nurul Ikhsan Okfalisa Okfalisa Pizaini Pizaini Prima Yohana Rahmah Miya Juwita Raja Indra Ramoza Ramadhani, Astrid Risfi Ayu Sandika Robbi Nanda Robby Azhar Sardi, Hajra Satria Bumartaduri Sayyid Muhammad Habib Siti Ramadhani Siti Ramadhani Siti Ramadhani Surya Agustian Suwanto Sanjaya Syafira, Fadhilah Syafria, Fadhillah Syahbudin Hamwar Syaputra, Muhammad Dwiky Umam, Isnaini Hadiyul Vitriani, Yelfi Vusuvangat, Imam Wulandari, Fitri Yayuk Wulandari Yelfi Yelfi Yola, Melfa Yusra Yusra, - Yusra, Yusra