Claim Missing Document
Check
Articles

Found 16 Documents
Search

Using genetic algorithm feature selection to optimize XGBoost performance in Australian credit Pertiwi, Dwika Ananda Agustina; Ahmad, Kamilah; Salahudin, Shahrul Nizam; Annegrat, Ahmed Mohamed; Muslim, Much Aziz
Journal of Soft Computing Exploration Vol. 5 No. 1 (2024): March 2024
Publisher : SHM Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52465/joscex.v5i1.302

Abstract

To reduce credit risk in credit institutions, credit risk management practices need to be implemented so that lending institutions can survive in the long term. Data mining is one of the techniques used for credit risk management. Where data mining can find information patterns from big data using classification techniques with the resulting level of accuracy. This research aims to increase the accuracy of classification algorithms in predicting credit risk by applying genetic algorithms as the best feature selection method. Thus, the most important feature will be used to search for credit risk information. This research applies a classification method using the XGBoost classifier on the Australian credit dataset, then carries out an evaluation by measuring the level of accuracy and AUC. The results show an increase in accuracy of 2.24%, with an accuracy value of 89.93% after optimization using a genetic algorithm. So, through research on genetic algorithm feature selection, we can improve the accuracy performance of the XGBoost algorithm on the Australian credit dataset.
A new CNN model integrated in onion and garlic sorting robot to improve classification accuracy Lestari, Apri Dwi; Khan, Atta Ullah; Pertiwi, Dwika Ananda Agustina; Muslim, Much Aziz
Journal of Soft Computing Exploration Vol. 5 No. 1 (2024): March 2024
Publisher : SHM Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52465/joscex.v5i1.304

Abstract

The profit share of the vegetable market, which is quite large in the agricultural industry, needs to be equipped with the ability to classify types of vegetables quickly and accurately. Some vegetables have a similar shape, such as onions and garlic, which can lead to misidentification of these types of vegetables. Through the use of computer vision and machine learning, vegetables, especially onions, can be classified based on the characteristics of shape, size, and color. In classifying shallot and garlic images, the CNN model was developed using 4 convolutional layers, with each layer having a kernel matrix of 2x2 and a total of 914,242 train parameters. The activation function on the convolutional layer uses ReLu and the activation function on the output layer is softmax. Model accuracy on training data is 0.9833 with a loss value of 0.762.
Comparison of gridsearchcv and bayesian hyperparameter optimization in random forest algorithm for diabetes prediction Muzayanah, Rini; Pertiwi, Dwika Ananda Agustina; Ali, Muazam; Muslim, Much Aziz
Journal of Soft Computing Exploration Vol. 5 No. 1 (2024): March 2024
Publisher : SHM Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52465/joscex.v5i1.308

Abstract

Diabetes Mellitus (DM) is a chronic disease whose complications have a significant impact on patients and the wider community. In its early stages, diabetes mellitus usually does not cause significant symptoms, but if it is detected too late and not handled properly, it can cause serious health problems. To overcome these problems, diabetes detection is one of the solutions used. In this research, diabetes detection was carried out using Random Forest with gridsearchcv and bayesian hyperparameter optimization. The research was carried out through the stages of study literature, model development using Kaggle Notebook, model testing, and results analysis. This study aims to compare GridSearchCV and Bayesian hyperparameter optimizations, then analyze the advantages and disadvantages of each optimization when applied to diabetes prediction using the Random Forest algorithm. From the research conducted, it was found that GridSearchCV and Bayesian hyperparameter optimization have their own advantages and disadvantages. The GridSearchCV hyperparameter excels in terms of accuracy of 0.74, although it takes longer for 338,416 seconds. On the other hand, Bayesian hyperparameter optimization has a lower accuracy rate than GridSearchCV optimization with a difference of 0.01, which is 0.73 and takes less time than GridSearchCV for 177,085 seconds.
Extreme Gradient Boosting Model with SMOTE for Heart Disease Classification Dullah, Ahmad Ubai; Darmawan, Aditya Yoga; Pertiwi, Dwika Ananda Agustina; Unjung, Jumanto
JISKA (Jurnal Informatika Sunan Kalijaga) Vol. 10 No. 1 (2025): January 2025
Publisher : UIN Sunan Kalijaga Yogyakarta

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.14421/jiska.2025.10.1.48-62

Abstract

Heart disease is one of the leading causes of death worldwide. According to data from the World Health Organisation (WHO), the number of victims who die from heart disease reaches 17.5 million people every year. However, the method of diagnosing heart disease in patients is still not optimal in determining the right treatment. Along with technology development, various models of machine learning algorithms and data processing techniques have been developed to find models that can produce the best precision in classifying heart disease. This research aims to develop a machine learning algorithm model in classifying heart disease to improve the effectiveness of diagnosis and help in determining the right treatment for patients. This research also aims to overcome the limitations of accuracy in existing diagnosis methods by identifying models capable of providing the best results in processing and analysing health data, especially in terms of heart disease classification. In this study, the XGBoost model was identified as the most superior, with an accuracy of 99%. These results show that the XGBoost model has a higher accuracy rate than previous methods, making it a promising solution to improve the accuracy of future heart disease diagnosis and classification.
Operational Supply Chain Risk Management on Apparel Industry Based on Supply Chain Operation Reference (SCOR) Pertiwi, Dwika Ananda Agustina; Yusuf, Muhammad; Efrilianda, Devi Ajeng
Journal of Information System Exploration and Research Vol. 1 No. 1 (2023): January 2023
Publisher : shmpublisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52465/joiser.v1i1.103

Abstract

The occurrence of uncertainty requires proper handling to avoid the adverse effects called risk. Risk tends to arise in the supply chain process called supply chain risk. The purpose of this research is to identify the possible level of risk that occurs and has the potential to disrupt supply chain activities, determine priority risk sources based on Supply Chain Operation References (SCOR). The object of this research is the apparel industry, which is a company engaged in fashion and apparel production. This study uses a qualitative and quantitative approach, the value of the instrument is assessed based on the results of the Aggregate Risk Potential (ARP) calculation in the House of Risk method phase 1.  The results showed that there were 39 correlations between risk events and risk agents, with 22 correlations with a high scale and 1 correlation with a low scale, and 15 correlations on a medium scale.
Sentiment analysis spotify applications on google play store with naïve bayes and neural network methods Syahra, Syahra Audiyani Fitra; Pertiwi, Dwika Ananda Agustina
Journal of Student Research Exploration Vol. 3 No. 2 (2025): July 2025
Publisher : SHM Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52465/josre.v3i2.416

Abstract

Digital advancements have significantly changed the way music is accessed and enjoyed, with streaming platforms such as Spotify emerging as one of the most widely used applications worldwide. Along with this growth, user reviews on platforms like the Google Play Store have become an important source of information, offering insights into user satisfaction and areas for improvement. In this study, sentiment analysis was conducted on Spotify reviews using two classification methods, Naïve Bayes and Neural Networks. The reviews were collected, processed, and then analyzed with both approaches to evaluate their performance. The results show that Neural Networks outperformed in terms of accuracy, F1-score, and recall, while Naïve Bayes performed better in AUC, precision, and MCC. Analysis of the dataset also revealed that negative reviews dominated at 52.8%, followed by positive at 28.3%, and neutral at 19%. These findings highlight the value of sentiment analysis in understanding user perspectives and can support developers in improving application quality and user experience.