Claim Missing Document
Check
Articles

Emoji-Based Sentiment Classification Using Ensemble Learning with Cross-Validation: A Lightweight Approach for Social Media Analysis: Klasifikasi Sentimen Berbasis Emoji Menggunakan Ensemble Learning dengan Validasi Silang: Pendekatan Ringan untuk Analisis Media Sosial Alamsyah, Nur; Bayu Wibisono, Gunthur; Parama Yoga, Titan; Budiman; Hendra, Acep
NUANSA INFORMATIKA Vol. 19 No. 2 (2025): Nuansa Informatika 19.2 Juli 2025
Publisher : FKOM UNIKU

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.25134/ilkom.v19i2.396

Abstract

The increasing use of emojis in online communication reflects emotional expression that is often more immediate and intuitive than text. This study proposes a lightweight sentiment classification approach that utilizes only emoji features extracted from social media posts, without relying on textual content. The importance of this research lies in its relevance to short-form digital content, where textual sentiment cues are minimal or absent. To address the classification problem, we implement and compare multiple machine learning models including Random Forest (RF), Support Vector Machine, and an ensemble Voting Classifier combining both. Emoji tokens were vectorized using character-level count vectorization, and performance was evaluated using 5-fold cross-validation to ensure robustness and generalizability. Results show that the ensemble model achieved the highest average accuracy of 93.6%, outperforming the individual classifiers. These findings confirm that emojis alone can serve as reliable indicators of sentiment and support the deployment of fast, interpretable, and scalable models for social media sentiment analysis.
A Bidirectional GRU Approach with Hyperparameter Optimization for Sentiment Classification in Game Reviews : Pendekatan GRU Dua Arah dengan Optimasi Hiperparameter untuk Klasifikasi Sentimen dalam Ulasan Game Alamsyah, Nur; Titan Parama Yoga; Budiman; Imannudin Akbar; Hendra, Acep; Januantara Prima, Alif
NUANSA INFORMATIKA Vol. 19 No. 2 (2025): Nuansa Informatika 19.2 Juli 2025
Publisher : FKOM UNIKU

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.25134/ilkom.v19i2.399

Abstract

Sentiment analysis plays a vital role in understanding user perspectives, especially in domains such as game reviews where user feedback influences product perception and engagement. This study presents a comparative approach using Gated Recurrent Unit (GRU), hyperparameter-tuned GRU, and Bidirectional GRU models to classify sentiments in a dataset of game reviews. The experiment begins with standard preprocessing and tokenization steps, followed by vectorization and supervised training. Hyperparameter optimization is conducted using Keras Tuner to identify the most effective configuration of embedding dimensions, GRU units, dropout rates, and learning rates. The best model, a Bidirectional GRU with tuned parameters, achieves a validation accuracy of 85.37% and shows superior performance across key metrics such as precision, recall, and F1-score. Despite the relatively small and imbalanced dataset, the Bidirectional GRU model demonstrates robust generalization. This study also highlights future directions, including class balancing techniques and the integration of pretrained word embeddings to further improve model performance.
A Data-Driven Approach to Comparative Evaluation of Regression Models for Accurate House Price Prediction: Pendekatan Berbasis Data untuk Evaluasi Komparatif Model Regresi untuk Prediksi Harga Rumah yang Akurat Permata Hati, Tiara; Budiman, Budiman; Akbar, Imannudin; Alamsyah, Nur
NUANSA INFORMATIKA Vol. 19 No. 2 (2025): Nuansa Informatika 19.2 Juli 2025
Publisher : FKOM UNIKU

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.25134/ilkom.v19i2.411

Abstract

This study aims to develop and evaluate a property price prediction model in Bandung by applying machine learning (ML) algorithms. The need for more accurate property price predictions is increasing due to fluctuations in the property market. This study analyzes property characteristics, including the number of bedrooms, bathrooms, land area, building area, and location, as well as their impact on house prices. The study evaluates four regression algorithms, including linear regression, K-Nearest Neighbors (KNN), Random Forest, and XGBoost. Finally, this study proposes price_per_m2 and building_land_ratio as new features recommended for improvement in accuracy. The bottleneck method is derived from the data collection area of the Rumah123.com website, encompassing data preprocessing and data exploration. The following metrics will be used to evaluate each model: Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and R-squared (R²). Based on our study, we conclude that both Random Forest Regression and XGBoost Regression achieve the highest accuracy, with R² values of 0.9941 and 0.9955, respectively, after adjustment. Conversely, Linear Regression and KNN Regression have the lowest accuracy, with KNN Regression being the least accurate. The primary contribution of this study is the development of a more accurate house price prediction model that can be applied in cities with similar market characteristics. These findings provide practical insights for property developers and buyers when making investment decisions.
ISOLATION FOREST PARAMETER TUNING FOR MOBILE APP ANOMALY DETECTION BASED ON PERMISSION REQUESTS Kaunang, Valencia Claudia Jennifer; Alamsyah, Nur; Nursyanti, Reni; Budiman, Budiman; Danestiara, Venia R; Setiana, Elia
Jurnal Pilar Nusa Mandiri Vol. 21 No. 2 (2025): Pilar Nusa Mandiri : Journal of Computing and Information System Publishing Pe
Publisher : LPPM Universitas Nusa Mandiri

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33480/pilar.v21i2.6647

Abstract

Ensuring mobile app security needs the capability to detect apps that request excessive or inappropriate permissions. This research proposes an anomaly detection approach using Isolation Forest, enhanced through hyperparameter tuning, to identify suspect apps based on permission request patterns. The dataset is processed into binary features, followed by exploratory data analysis (EDA) to examine the distribution and highlight sensitive permissions. The Isolation Forest model is then optimized by tuning parameters such as contamination level, number of estimators, and sample size. The fine-tuned model achieved a more accurate separation between normal and anomaly applications, detecting 10 anomalies out of 200 applications, with anomaly applications averaging 125.10 permits compared to 42.76 in normal applications. These anomalies often requested permissions related to network, storage, contacts and microphone, indicating potential privacy risks. The results show that parameter tuning improves the detection performance of Isolation Forest, providing a practical solution for mobile security monitoring. After tuning, the number of false positives decreased by 50%, and the model successfully reduced detected anomalies from 20 to 10, increasing the precision of anomaly detection from 70% to 90%. Future work could include improving feature selection and integration into real-time detection systems. 
Fine-Tuned Autoencoder Neural Network for Anomaly Detection in Accounting Transactions Nur Alamsyah; Budiman, Budiman; Rahmani, Hani Fitria; Erpurini, Wala
Paradigma - Jurnal Komputer dan Informatika Vol. 27 No. 2 (2025): September 2025 Period
Publisher : LPPM Universitas Bina Sarana Informatika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.31294/p.v27i2.8697

Abstract

Anomaly detection in accounting transactions plays a crucial role in identifying irregularities that may signal fraud, errors, or unusual financial behavior. Traditional rule-based and statistical methods often struggle to detect complex and hidden patterns in large-scale financial datasets. This paper presents a fine-tuned Autoencoder Neural Network for detecting anomalies in structured accounting records. The model processes feature such as date, account type, debit, credit, transaction category, and payment method. Preprocessing includes handling missing values, encoding categorical data, and extracting temporal features. The Autoencoder architecture was optimized using multiple hidden layers and dropout regularization to prevent overfitting. Reconstruction errors were used to determine anomaly scores, with a dynamic threshold set at the 98th percentile. Experimental results show that the model accurately distinguishes normal and anomalous transactions, identifying 2,000 outliers from a total of 100,000 records. Additional analysis indicates that anomalies often occur during weekends or holidays and involve unusual payment methods. These findings demonstrate the potential of the fine-tuned Autoencoder as a scalable and intelligent anomaly detection framework to support auditors and financial analysts in proactive fraud prevention.
OPTIMIZED DEEP AUTOENCODER WITH L1 REGULARIZATION AND DROPOUT FOR ANOMALY DETECTION IN 6G NETWORK SLICING Jennifer Kaunang, Valencia Claudia; Alamsyah, Nur; Parama Yoga, Titan; Hendra, Acep; Budiman, Budiman
Jurnal Techno Nusa Mandiri Vol. 20 No. 2 (2025): Techno Nusa Mandiri : Journal of Computing and Information Technology Period o
Publisher : Lembaga Penelitian dan Pengabdian Pada Masyarakat

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33480/techno.v20i2.6912

Abstract

The increasing complexity of 6G network slicing introduces new challenges in identifying abnormal behavior within highly virtualized and dynamic network infrastructures. This study aims to address the anomaly detection problem in 6G slicing environments by comparing the performance of three models: a supervised random forest classifier, a basic unsupervised autoencoder, and an optimized deep autoencoder enhanced with L1 regularization and dropout techniques. The optimized autoencoder is trained to reconstruct normal data patterns, with anomaly detection performed using a threshold- based reconstruction error approach. Reconstruction errors are evaluated across different percentile thresholds to determine the optimal boundary for classifying abnormal behavior. All models are tested on a publicly available 6G Network Slicing Security dataset. Results show that the optimized autoencoder outperforms both the baseline autoencoder and the random forest in terms of anomaly sensitivity. Specifically, the optimized model achieves an F1- score of 0.1782, a recall of 0.2095, and an accuracy of 0.714. These results indicate that introducing regularization and dropout significantly improves the ability of autoencoders to generalize and isolate anomalies, even in highly imbalanced datasets. This approach provides a lightweight and effective solution for unsupervised anomaly detection in next- generation network environments.
A Metaheuristic Wrapper Approach to Feature Selection with Genetic Algorithm for Enhancing XGBoost Classification in Diabetes Prediction Alamsyah, Nur; Budiman; Danestiara, Venia Restreva; Yoga, Titan Parama; Nursyanti, Reni; Kaunang, Valencia
Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control Vol. 10, No. 4, November 2025
Publisher : Universitas Muhammadiyah Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22219/kinetik.v10i4.2366

Abstract

This study addressed the problem of selecting the most relevant features for improving the accuracy of diabetes classification using health indicator data. The research focused on a binary classification task based on the Behavioral Risk Factor Surveillance System dataset, which comprised over seventy thousand records and twenty-one predictive features related to individual health behaviors and conditions. A metaheuristic wrapper approach was developed by integrating a Genetic Algorithm for feature selection with an XGBoost classifier to evaluate the predictive quality of each feature subset. The fitness function was defined as the average classification accuracy obtained through cross-validation. In addition to feature selection, hyperparameter optimization of the XGBoost model was carried out using a Bayesian-based search strategy to further enhance performance. The proposed method successfully identified a subset of fourteen optimal features that contributed most significantly to the prediction of diabetes. The final model, combining the selected features and optimized parameters, achieved an accuracy of 0.753, outperforming both the baseline models trained on all features and models using features selected through deterministic methods. These results confirmed the effectiveness of combining evolutionary feature selection with model tuning to build efficient and interpretable predictive models for medical data classification. This approach demonstrated a practical solution for managing high-dimensional data in the context of chronic disease prediction.
Optimization Of Micro, Small and Medium Enterprise Financial Management Through Android-Based Zazan Mobile Application for Efficiency of Digital Economy Sustainability Yoga, Titan Parama; Setiana, Elia; Hamzah, Encep; Budiman, Budiman; Sarifiyono, Aggi Panigoro
Jurnal Computech & Bisnis (e-journal) Vol. 18 No. 2 (2024): Jurnal Computech & Bisnis (e-Journal)
Publisher : LPPM STMIK Mardira Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56447/95x8j738

Abstract

In order to sustain and grow a firm, financial management is essential. This procedure is essential for obtaining profit and loss data, preventing employee and business partner fraud, and separating personal and business funds to determine the company's financial situation accurately. Informal MSME actors multitask and operate as small business owners, entrepreneurs, and managers of all business issues. So, there is not enough time to document the financials of a corporation. Making financial records requires basic knowledge, which makes the work feel challenging, intricate, and time-consuming. One way to address this issue is making the Android mobile application "Zazan" to assist in managing the finances of medium-sized, tiny, and microbusinesses. A descriptive strategy was used to gather the data required for this investigation. An integrated application that could record business activity transactions, record business activity schedules, and connect with customers was required, as determined by the analysis and implementation findings. Furthermore, clients can quickly learn more about business players by accessing location information. Interaction between customers and business transaction activities facilitates the automation of business activity recording, enabling the application to complete financial recording. Entrepreneurs will find it easier to manage their financial spending with the help of the company's financial reports. Having employee and multi-store functionalities would be preferable because business actors occasionally have multiple business branches and employees. The implications of this research extend beyond merely providing a tool for financial management; they highlight the necessity of accessible financial solutions for MSMEs. By bridging the gap in financial literacy and time constraints, the "Zazan" application not only empowers business owners to maintain accurate financial records but also fosters better decision-making and strategic planning. This can lead to enhanced business performance, improved relationships with stakeholders, and ultimately, sustainable growth in the competitive market landscape.