Claim Missing Document
Check
Articles

Found 9 Documents
Search
Journal : Journal of Electronics, Electromedical Engineering, and Medical Informatics

Implementation of Information Gain Ratio and Particle Swarm Optimization in the Sentiment Analysis Classification of Covid-19 Vaccine Using Support Vector Machine Muhamad Fawwaz Akbar; Muhammad Itqan Mazdadi; Muliadi; Triando Hamonangan Saragih; Friska Abadi
Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol 5 No 4 (2023): October
Publisher : Department of Electromedical Engineering, POLTEKKES KEMENKES SURABAYA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/jeeemi.v5i4.328

Abstract

In the current digital era, sentiment analysis has become an effective method for identifying and interpreting public opinions on various topics, including public health issues such as COVID-19 vaccination. Vaccination is a crucial measure in tackling this pandemic, but there are still a number of people who are skeptical and reluctant to receive the COVID-19 vaccine. This public perception is largely influenced by, including information received from social media and online platforms. Therefore, sentiment analysis of the COVID-19 vaccine is one way to understand the public's perception of the COVID-19 vaccine. This research has the purpose to enhance the classification performance in sentiment analysis of COVID-19 vaccines by implementing Information Gain Ratio (IGR) and Particle Swarm Optimization (PSO) on the Support Vector Machine (SVM). With a dataset of 2000 entries consisting of 1000 positive labels and 1000 negative labels, validation was performed through a combination of data splitting with an 80:20 ratio and stratified 10-Fold cross-validation. Applying the basic SVM, an accuracy of 0.794 and an AUC value of 0.890 were obtained. Integration with Information Gain Ratio (IGR) feature selection improved the accuracy to 0.814 and an AUC of 0.907. Furthermore, through the combination of SVM based on PSO and IGR, the accuracy significantly improved to 0.837 with an AUC of 0.913. These results demonstrate that the combination of feature selection techniques and parameter optimization can enhance the performance of sentiment classification towards COVID-19 vaccines. The conclusions drawn from this research indicate that the integration of IGR and PSO positively contributes to the effectiveness and predictive capability of the SVM model in sentiment classification tasks.
Feature Selection Using Firefly Algorithm With Tree-Based Classification In Software Defect Prediction Maulida, Vina; Herteno, Rudy; Kartini, Dwi; Abadi, Friska; Faisal, Mohammad Reza
Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol 5 No 4 (2023): October
Publisher : Department of Electromedical Engineering, POLTEKKES KEMENKES SURABAYA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/jeeemi.v5i4.315

Abstract

Defects that occur in software products are a universal occurrence. Software defect prediction is usually carried out to determine the performance, accuracy, precision and performance of the prediction model or method used in research, using various kinds of datasets. Software defect prediction is one of the Software Engineering studies that is of great concern to researchers. This research was conducted to determine the performance of tree-based classification algorithms including Decision Trees, Random Forests and Deep Forests without using feature selection and using firefly feature selection. And also know the tree-based classification algorithm with firefly feature selection which can provide better software defect prediction performance. The dataset used in this study is the ReLink dataset which consists of Apache, Safe and Zxing. Then the data is divided into testing data and training data with 10-fold cross validation. Then feature selection is performed using the Firefly Algorithm. Each ReLink dataset will be processed by each tree-based classification algorithm, namely Decision Tree, Random Forest and Deep Forest according to the results of the firefly feature selection. Performance evaluation uses the AUC value (Area under the ROC Curve). Research was conducted using google collab and the average AUC value generated by Firefly-Decision Tree is 0.66, the average AUC value generated by Firefly-Random Forest is 0.77, and the average AUC value generated by Firefly-Deep Forest is 0, 76. The results of this study indicate that the approach using the Firefly algorithm with Random Forest classification can work better in predicting software damage compared to other tree-based algorithms. In previous studies, tree-based classification with hyperparameter tuning on software defect prediction datasets obtained quite good results. In another study, the classification performance of SVM, Naïve Bayes and K-nearest neighbor with firefly feature selection resulted in improved performance. Therefore, this research was conducted to determine the performance of a tree-based algorithm using the firefly selection feature.
LSTM and Bi-LSTM Models For Identifying Natural Disasters Reports From Social Media Yunida, Rahmi; Faisal, Mohammad Reza; Muliadi; Indriani, Fatma; Abadi, Friska; Budiman, Irwan; Prastya, Septyan Eka
Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol 5 No 4 (2023): October
Publisher : Department of Electromedical Engineering, POLTEKKES KEMENKES SURABAYA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/jeeemi.v5i4.319

Abstract

Natural disaster events are occurrences that cause significant losses, primarily resulting in environmental and property damage and in the worst cases, even loss of life. In some cases of natural disasters, social media has been utilized as the fastest information bridge to inform many people, especially through platforms like Twitter. To provide accurate categorization of information, the field of text mining can be leveraged. This study implements a combination of the word2vec and LSTM methods and the combination of word2vec and Bi-LSTM to determine which method is the most accurate for use in the case study of news related to disaster events. The utility of word2vec lies in its feature extraction method, transforming textual data into vector form for processing in the classification stage. On the other hand, the LSTM and Bi-LSTM methods are used as classification techniques to categorize the vectorized data resulting from the extraction process. The experimental results show an accuracy of 70.67% for the combination of word2vec and LSTM and an accuracy of 72.17% for the combination of word2vec and Bi-LSTM. This indicates an improvement of 1.5% achieved by combining the word2vec and Bi-LSTM methods. This research is significant in identifying the comparative performance of each combination method, word2vec + LSTM and word2vec + Bi-LSTM, to determine the best-performing combination in the process of classifying data related to earthquake natural disasters. The study also offers insights into various parameters present in the word2vec, LSTM, and Bi-LSTM methods that researchers can determine.
Implementation of Monarch Butterfly Optimization for Feature Selection in Coronary Artery Disease Classification Using Gradient Boosting Decision Tree Siti Napi'ah; Triando Hamonangan Saragih; Dodon Turianto Nugrahadi; Dwi Kartini; Friska Abadi
Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol 5 No 4 (2023): October
Publisher : Department of Electromedical Engineering, POLTEKKES KEMENKES SURABAYA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/jeeemi.v5i4.331

Abstract

Coronary artery disease, a prevalent type of cardiovascular disease, is a significant contributor to premature mortality globally. Employing the classification of coronary artery disease as an early detection measure can have a substantial impact on reducing death rates caused by this ailment. To investigate this, the Z-Alizadeh dataset, consisting of clinical data from patients afflicted with coronary artery disease, was utilized, encompassing a total of 303 data points that comprise 55 predictive attribute features and 1 target attribute feature. For the purpose of classification, the Gradient Boosting Decision Tree (GBDT) algorithm was chosen, and in addition, a metaheuristic algorithm called monarch butterfly optimization (MBO) was implemented to diminish the number of features. The objective of this study is to compare the performance of GBDT before and after the application of MBO for feature selection. The evaluation of the study's findings involved the utilization of a confusion matrix and the calculation of the area under the curve (AUC). The outcomes demonstrated that GBDT initially attained an accuracy rate of 87.46%, a precision of 83.85%, a recall of 70.37%, and an AUC of 82.09%. Subsequent to the implementation of MBO, the performance of GBDT improved to an accuracy of 90.26%, a precision of 86.82%, a recall of 80.79%, and an AUC of 87.33% with the selection of 31 features. This improvement in performance leads to the conclusion that MBO effectively addresses the feature selection issue within this particular context.
An Approach to ECG-based Gender Recognition Using Random Forest Algorithm Arif, Nuuruddin Hamid; Faisal, Mohammad Reza; Farmadi, Andi; Nugrahadi, Dodon; Abadi, Friska; Ahmad, Umar Ali
Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol 6 No 2 (2024): April
Publisher : Department of Electromedical Engineering, POLTEKKES KEMENKES SURABAYA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/jeeemi.v6i2.363

Abstract

Human-Computer Interaction (HCI) has witnessed rapid advancements in signal processing research within the health domain, particularly in signal analyses like electrocardiogram (ECG), electromyogram (EMG), and electroencephalogram (EEG). ECG, containing diverse information about medical history, identity, emotional state, age, and gender, has exhibited potential for biometric recognition. The Random Forest method proves essential to facilitate gender classification based on ECG. This research delves into applying the Random Forest method for gender classification, utilizing ECG data from the ECG ID Database. The primary aim is to assess the efficacy of the Random Forest algorithm in gender classification. The dataset employed in this study comprises 10,000 features, encompassing both raw and filtered datasets, evaluated through 10-fold cross-validation with Random Forest Classification. Results reveal the highest accuracy for raw data at 55.000%, with sensitivity at 46.452% and specificity at 63.548%. In contrast, the filtered data achieved the highest accuracy of 65.806%, with sensitivity and specificity at 67.097%. These findings conclude that the most significant impact on gender classification in this study lies in the low sensitivity value in raw data. The implications of this research contribute to knowledge by presenting the performance results of the Random Forest algorithm in ECG-based gender classification.
Gender Classification on Social Media Messages Using fastText Feature Extraction and Long Short-Term Memory Sa’diah, Halimatus; Faisal, Mohammad Reza; Farmadi, Andi; Abadi, Friska; Indriani, Fatma; Alkaff, Muhammad; Abdullayev, Vugar
Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol 6 No 3 (2024): July
Publisher : Department of Electromedical Engineering, POLTEKKES KEMENKES SURABAYA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/jeeemi.v6i3.407

Abstract

Currently, social media is used as a platform for interacting with many people and has also become a source of information for social media researchers or analysts. Twitter is one of the platforms commonly used for research purposes, especially for data from tweets written by individuals. However, on Twitter, user information such as gender is not explicitly displayed in the account profile, yet there is a plethora of unstructured information containing such data, often unnoticed. This research aims to classify gender based on tweet data and account description data and determine the accuracy of gender classification using machine learning methods. The method used involves FastText as a feature extraction method and LSTM as a classification method based on the extracted data, while to achieve the most accurate results, classification is performed on tweet data, account description data, and a combination of both. This research shows that LSTM classification on account description data and combined data obtained an accuracy of 70%, while tweet data classification achieved 69%. This research concludes that FastText feature extraction with LSTM classification can be implemented for gender classification. However, there is no significant difference in accuracy results for each dataset. However, this research demonstrates that both methods can work well together and yield optimal results.
Impact of a Synthetic Data Vault for Imbalanced Class in Cross-Project Defect Prediction Putri Nabella; Rudy Herteno; Setyo Wahyu Saputro; Mohammad Reza Faisal; Friska Abadi
Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol 6 No 2 (2024): April
Publisher : Department of Electromedical Engineering, POLTEKKES KEMENKES SURABAYA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/jeeemi.v6i2.409

Abstract

Software Defect Prediction (SDP) is crucial for ensuring software quality. However, class imbalance (CI) poses a significant challenge in predictive modeling. This study delves into the effectiveness of the Synthetic Data Vault (SDV) in mitigating CI within Cross-Project Defect Prediction (CPDP). Methodologically, the study addresses CI across ReLink, MDP, and PROMISE datasets by leveraging SDV to augment minority classes. Classification utilizing Decision Tree (DT), Logistic Regression (LR), K-Nearest Neighbors (KNN), Naive Bayes (NB), and Random Forest (RF), also model performance is evaluated using AUC and t-Test. The results consistently show that SDV performs better than SMOTE and other techniques in various projects. This superiority is evident through statistically significant improvements. KNN dominance in average AUC results, with values 0.695, 0.704, and 0.750. On ReLink, KNN show 16.06% improvement over the imbalanced and 12.84% over SMOTE. Similarly, on MDP, KNN 20.71% improvement over the imbalanced and a 10.16% over SMOTE. Moreover, on PROMISE, KNN 13.55% improvement over the imbalanced and 7.01% over SMOTE. RF displays moderate performance, closely followed by LR and DT, while NB lags behind. The statistical significance of these findings is confirmed by t-Test, all below the 0.05 threshold. These findings underscore SDV's potential in enhancing CPDP outcomes and tackling CI challenges in SDV. With KNN as the best classification algorithm. Adoption of SDV could prove to be a promising tool for enhancing defect detection and CI mitigation
Optimization of Backward Elimination for Software Defect Prediction with Correlation Coefficient Filter Method Muhammad Noor; Radityo Adi Nugroho; Setyo Wahyu Saputro; Rudy Herteno; Friska Abadi
Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol 6 No 4 (2024): October
Publisher : Department of Electromedical Engineering, POLTEKKES KEMENKES SURABAYA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/jeeemi.v6i4.466

Abstract

Detecting software defects is a crucial step for software development not only to reduce cost and save time, but also to mitigate more costly losses. Backward Elimination is one method for detecting software defects. Notably Backward Elimination may remove features that may later become significant to the outcome affecting the performance of Backward Elimination. The aim of this study is to improve Backward Elimination performance. In this study, several features were selected based on their correlation coefficient, with the selected feature applied to improve Backward Elimination final model performance. The final model was validated using cross validation with Naïve Bayes as the classification method on the NASA MDP dataset to determine the accuracy and Area Under the Curve (AUC) of the final model. Using top 10 correlation feature and Backward Elimination achieve an average result of 86.6% accuracy and 0.797 AUC, while using top 20 correlation feature and Backward Elimination achieved an average result of 84% accuracy and 0.812 AUC. Compare to using Backward Elimination and Naïve Bayes respectively the improvement using top 10 correlation feature as follows: AUC:1.52%, 13.53% and Accuracy: 13%, 12.4% while the improvement using top 20 correlation feature as follows: AUC:3.43%, 15.66% and Accuracy: 10.4%, 9.8%. Results showed that selecting the top 10 and top 20 feature based on its correlation before using Backward Elimination have better result than only using Backward Elimination. This result shows that combining Backward Elimination with correlation coefficient feature selection does improve Backward Elimination’s final model and yielding good results for detecting software defects.
Optimizing Input Window Length and Feature Requirements for Machine Learning-Based Postprandial Hyperglycemia Prediction Maulana, Muhammad Rafly Alfarizqy; Indriani, Fatma; Abadi, Friska; Kartini, Dwi; Mazdadi, Muhammad Itqan
Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol 8 No 1 (2026): January
Publisher : Department of Electromedical Engineering, POLTEKKES KEMENKES SURABAYA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/jeeemi.v8i1.1401

Abstract

Continuous glucose monitoring systems currently generate alerts only after blood glucose thresholds are breached, limiting their utility for proactive diabetes management. Predicting postprandial glucose excursions before they occur requires determining the optimal amount of historical data and identifying which features contribute most to prediction accuracy. This study systematically evaluates how the length of the pre-meal observation window and feature composition affect machine-learning predictions of hyperglycemia events 60 minutes after eating. We analyzed 1,642 meal events from 45 adults wearing continuous glucose sensors, constructing features from pre-meal glucose trajectories, meal macronutrients, time of day, and health status. Four observation windows (15, 30, 45, 60 minutes) and three feature sets (all features, glucose-only, meal-only) were evaluated using Random Forest, XGBoost, and CatBoost with 5-fold group cross-validation. CatBoost with a 30-minute window achieved the best performance: 72.6% F1-macro, 79.6% accuracy, and 64.0% recall for hyperglycemia detection. Extending windows beyond 30 minutes did not yield consistent benefits, whereas 15-minute windows yielded comparable results. Glucose trajectory features alone retained 94% of full model performance (68.5% F1-macro), whereas meal composition alone proved insufficient (59.4% F1-macro). These findings demonstrate that recent glucose history dominates short-term prediction, enabling practical real-time systems with minimal data requirements. A 30-minute observation window with glucose and meal features offers an effective balance between prediction accuracy and system responsiveness.
Co-Authors A.A. Ketut Agung Cahyawan W AA Sudharmawan, AA Abdullayev, Vugar Achmad Zainudin Nur Adi Mu'Ammar, Rifqi Aflaha, Rahmina Ulfah Ahmad Juhdi Alfando, Muhammad Alvin Amalia, Raisa Andi Farmadi Andi Farmandi Arif, Nuuruddin Hamid Athavale, Vijay Anant budiman, irwan Deni Kurnia Dodon Turianto Nugrahadi Dwi Kartini Dwi Kartini, Dwi Emma Andini Faisal, Mohammad Reza Fathmah, Siti Fatma Indriani Fauzan Luthfi, Achmad Febrian, Muhamad Michael Halimah Halimah Halimah Herteno, Rudy Herteno, Rudy Indriani, Fatma Irwan Budiman Irwan Budiman Itqan Mazdadi, Muhammad Kartika, Najla Putri M Kevin Warendra Mafazy, Muhammad Meftah Martalisa, Asri Maulana, Muhammad Rafly Alfarizqy Mera Kartika Delimayanti Muhamad Fawwaz Akbar Muhammad Alkaff Muhammad Azmi Adhani Muhammad Denny Ersyadi Rahman Muhammad Fikri Muhammad Haekal Muhammad Itqan Mazdadi Muhammad Khairin Nahwan Muhammad Mirza Hafiz Yudianto Muhammad Nazar Gunawan Muhammad Noor Muhammad Reza Faisal, Muhammad Reza Muhammad Sholih Afif Muliadi Muliadi Muliadi Aziz Muliadi Muliadi Nabella, Putri Nor Indrani Nugrahadi, Dodon Nurlatifah Amini Nursyifa Azizah Prastya, Septyan Eka Pratama, Muhammad Yoga Adha Putri Nabella Raditya, Virgi Atha Radityo Adi Nugroho Rahman Hadi Rahman Rahmat Ramadhani Rahmayanti Rahmayanti Ramadhan, Muhammad Rizky Aulia Reina Alya Rahma Rinaldi Riza Susanto Banner Rizal, Muhammad Nur Rizky Ananda, Muhammad Rizky, Muhammad Hevny Rudy Herteno SALLY LUTFIANI Saputro, Setyo Wahyu Saputro, Setyo Wahyu Saragih, Triando Hamonangan Sarah Monika Nooralifa Sa’diah, Halimatus Septyan Eka Prastya Setyo Wahyu Saputro Siti Napi'ah Tri Mulyani Ulya, Azizatul Umar Ali Ahmad Vina Maulida, Vina Wahyu Dwi Styadi Wahyu Saputro, Setyo Yunida, Rahmi