Claim Missing Document
Check
Articles

Found 4 Documents
Search

Enhancing uncollateralized loan risk assessment accuracy through feature selection and advanced machine learning techniques Salahudin, Shahrul Nizam; Dasril, Yosza; Arisandy, Yosy
Indonesian Journal of Electrical Engineering and Computer Science Vol 38, No 2: May 2025
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v38.i2.pp1149-1161

Abstract

Accuracy in evaluating the risk of credit applications is crucial for lenders, particularly when dealing with unsecured loans. Accuracy can be enhanced by selecting suitable features for a machine learning model. To better identify high-risk borrowers, this study applies an elaborate feature selection technique. This study uses the light gradient boosting machine (LGBM) Classifier model with boosting type gradient boosting decision tree (GBDT) algorithm and n_estimator value 100 for feature selection process. This work uses advanced machine learning techniques namely stacking to improve accuracy model perform. The dataset consists of 307,506 applicants from European lenders who have applied for loans in Southeast Asia. Each applicant is described by 126 different features. Using GDBT algorithm GBDT, 30 best features were selected based on their maximum accuracy compared to another feature. By employing a stacking technique that combines the LGBM, gradient boosting (GB), and random forest (RF) models, and utilizing logistic regression (LR) as the final estimator, an accuracy of 0.99637 was reached. This study demonstrates an improved the accuracy compared to previous research. This discovery indicates that utilizing feature selection and stacking method can provide one of the most precise choices for modelling the binary class classification among the current models.
Accuracy of Malaysia Public Response to Economic Factors During the Covid-19 Pandemic Using Vader and Random Forest Jumanto, Jumanto; Muslim, Much Aziz; Dasril, Yosza; Mustaqim, Tanzilal
Journal of Information System Exploration and Research Vol. 1 No. 1 (2023): January 2023
Publisher : shmpublisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52465/joiser.v1i1.104

Abstract

This study conducted a sentiment analysis of the impact of the Covid-19 pandemic in the economic sector on people's lives through social media Twitter. The analysis was carried out on 23,777 tweet data collected from 13 states in Malaysia from 1 December 2019 to 17 June 2020. The research process went through 3 stages, namely pre-processing, labeling, and modeling. The pre-processing stage is collecting and cleaning data. Labeling in this study uses Vader sentiment polarity detection to provide an assessment of the sentiment of tweet data which is used as training data. The modeling stage means to test the sentiment data using the random forest algorithm plus the extraction count vectorizer and TF-IDF features as well as the N-gram selection feature. The test results show that the polarity of public sentiment in Malaysia is predominantly positive, which is 11,323 positive, 4105 neutral, and 8349 negative based on Vader labeling. The accuracy rate from the random forest modeling results was obtained 93.5 percent with TF-IDF and 1 gram.
Portfolio Selection Strategies in Bursa Malaysia Based on Quadratic Programming Ling, Liang Pei; Dasril, Yosza
Journal of Information System Exploration and Research Vol. 1 No. 2 (2023): July 2023
Publisher : shmpublisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52465/joiser.v1i2.178

Abstract

The study aims to select the efficient portfolio on stock listed in Bursa Malaysia by using the quadratic programming method. It can help the investors to gain expected returns from the diversification portfolio. However, there are some problems that should be considered such as the measurement of inputs for Mean-Variance Models (MVM), use of portfolio models through time and consistency with management objectives in the portfolio. These problems will affect the performance of selected portfolio and cause the loss problem. Therefore, this study implements a quadratic programming approach to select an efficient portfolio on stocks listed in Bursa Malaysia. The study will choose 15 potential companies which have the best performance in the Bursa Malaysia. Quadratic programming (QP) model can solve any type of mathematical optimisation problem in the study. Therefore, investors can optimise the investment portfolio returns by using QP methods. However, we can observe the efficient frontier which is a graph that representing a list of portfolios that optimising expected return for a different level of portfolio risk so can help the investors make a good decision. The findings of this study will give important inputs, especially to the investors to maximise their portfolio return at different level of risks.
Credit Risk Assessment in P2P Lending Using LightGBM and Particle Swarm Optimization Dasril, Yosza; Muslim, Much Aziz; Hakim, M. Faris Al; Jumanto , Jumanto; Prasetiyo, Budi
Register: Jurnal Ilmiah Teknologi Sistem Informasi Vol 9 No 1 (2023): January
Publisher : Information Systems - Universitas Pesantren Tinggi Darul Ulum

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26594/register.v9i1.3060

Abstract

The credit risk evaluation is a vital task in the P2P Lending platform. An effective credit risk assessment method in a P2P lending platform can significantly influence investors' decisions. The machine learning algorithm that can be used to evaluate credit risk as LightGBM, however, the results in evaluating P2P lending need to be improved. The aim of this research is to improve the accuracy of the LightGBM algorithm by combining the Particle Swarm Optimization (PSO) algorithm. The novelty developed in this research is combining LightGBM with PSO for large data from the Lending Club Dataset which can be accessed on Kaggle.com. The highest accuracy also presented satisfactory results with 98.094% of accuracy, 90.514% of Recall, and 97.754% of NPV respectively. The combination of LightGBM and PSO shows better results.