Sarah Rosdiana Tambunan
Unknown Affiliation

Published : 4 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 2 Documents
Search
Journal : Journal of Information Systems and Informatics

Enhancing Credit Risk Classification Using LightGBM with Deep Feature Synthesis Tambunan, Sarah Rosdiana; Amalia, Junita; Sitorus, Kristina Margaret; Sibuea, Yehezchiel Abed Rafles; Hutabarat, Lucas Ronaldi
Journal of Information System and Informatics Vol 6 No 4 (2024): December
Publisher : Universitas Bina Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.51519/journalisi.v6i4.902

Abstract

In the digital financial services era, Peer-to-Peer (P2P) lending has emerged as a significant innovation in fintech. However, credit risk remains a major concern due to the potential for payment failures, which can cause losses for platforms and investors. This research explores the impact of Deep Feature Synthesis (DFS) on credit risk classification and evaluates the performance of the Light Gradient Boosting Machine (LightGBM) algorithm with and without DFS. The data used in this study was sourced from Kaggle, a peer-to-peer lending company based in San Francisco, California, United States. The dataset contains 74 attributes, with a total of 887,379 rows. DFS automatically generates new attributes, while LightGBM is used for selecting the most important features, aiming to optimize credit risk predictions and simplify the model's complexity. The results of credit risk classification models using DFS and without it. Findings reveal that DFS enhances the accuracy of the credit risk classification, achieving a 0.99 accuracy rate compared to 0.97 without DFS, achieving a recall and F1-score of 0.94 and 0.96 with DFS and 0.68 and 0.81 without DFS. These results suggest that DFS is an effective feature engineering technique for boosting credit risk model performance. This research contributes significantly to the P2P lending industry by demonstrating that combining DFS with LightGBM can improve credit risk management, making it a valuable approach for financial platforms.
Enhancing Hate Speech Detection: Leveraging Emoji Preprocessing with BI-LSTM Model Amalia, Junita; Tambunan, Sarah Rosdiana; Purba, Susi Eva Maria; Simanjuntak, Walker Valentinus
Journal of Information System and Informatics Vol 7 No 2 (2025): June
Publisher : Universitas Bina Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.51519/journalisi.v7i2.1147

Abstract

Microblogging platforms like Twitter enable users to rapidly share opinions, information, and viewpoints. However, the vast volume of daily user-generated content poses challenges in ensuring the platform remains safe and inclusive. One key concern is the prevalence of hate speech, which must be addressed to foster a respectful and open environment. This study explores the effectiveness of the Emoji Description Method (EMJ DESC), which enhances tweet classification by converting emojis into descriptive text or sentences. These descriptions are then encoded into numerical vector matrices that capture the meaning and emotional tone of each emoji. Integrated into a basic text classification model, these vectors help improve detection performance. The research examines how different emoji preprocessing strategies affect the performance of a BI-LSTM model for hate speech classification. Results show that removing emojis significantly reduces accuracy (68%) and weakens the model’s ability to distinguish between hate and non-hate speech, due to the loss of valuable semantic context. In contrast, retaining emoji semantics either through textual descriptions or embeddings boosts classification accuracy to 93% and 94%, respectively. The highest performance is achieved through emoji embedding, highlighting its ability to capture subtle non-verbal cues critically for accurate hate speech detection. Overall, the findings emphasize the importance of incorporating emoji-aware preprocessing techniques to enhance the effectiveness of social media content classification.