p-Index From 2021 - 2026
13.688
P-Index
Claim Missing Document
Check
Articles

Optimizing SMS Spam Detection Using Machine Learning: A Comparative Analysis of Ensemble and Traditional Classifiers Airlangga, Gregorius
Journal of Computer Networks, Architecture and High Performance Computing Vol. 6 No. 4 (2024): Articles Research October 2024
Publisher : Information Technology and Science (ITScience)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47709/cnahpc.v6i4.4822

Abstract

With the rapid rise of mobile communication, Short Message Service (SMS) has become an essential platform for transmitting information. However, the growing volume of unsolicited and harmful spam messages presents significant challenges for both users and mobile network operators. This study explores the effectiveness of various machine learning models, including Random Forest, Gradient Boosting, AdaBoost, Support Vector Machine (SVM), Logistic Regression, and an Ensemble Voting Classifier, in detecting SMS spam. A dataset containing 5,572 SMS messages, labeled as either spam or ham (legitimate), was used to evaluate these models. Hyperparameter tuning was performed on each model to optimize accuracy, and the models were assessed using metrics such as precision, recall, F1-score, and accuracy. The results indicated that the SVM and Ensemble Voting Classifier achieved the highest performance, with accuracies of 0.9857 and 0.9848, respectively. Both models demonstrated superior recall for spam messages, making them highly effective for real-world spam detection systems. While Random Forest, Gradient Boosting, and AdaBoost also performed well, their slightly lower recall for spam suggests that they may misclassify some spam as legitimate messages. The study highlights the effectiveness of machine learning models in addressing the SMS spam problem, particularly when using ensemble methods. Future research should focus on addressing class imbalance and exploring deep learning approaches to further enhance model performance. These findings offer valuable insights for developing more accurate and scalable SMS spam detection systems.
A Comparative Analysis of Deep Learning Models for SMS Spam Detection: CNN-LSTM, CNN-GRU, and ResNet Approaches Airlangga, Gregorius
Journal of Computer Networks, Architecture and High Performance Computing Vol. 6 No. 4 (2024): Articles Research October 2024
Publisher : Information Technology and Science (ITScience)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47709/cnahpc.v6i4.4827

Abstract

Spam messages have become a growing challenge in mobile communication, threatening user security and data privacy. Traditional spam detection methods, including rule-based and machine learning techniques, are increasingly insufficient due to the evolving sophistication of spam tactics. This research evaluates the effectiveness of advanced deep learning models such as CNN-LSTM, CNN-GRU, and ResNet for SMS spam detection. The dataset used consists of diverse SMS messages labeled as either spam or legitimate (ham), ensuring broad coverage of real-world spam patterns. The study employs a robust ten-fold cross-validation approach to assess the generalization capabilities of the models, measuring performance based on accuracy, precision, recall, and F1 score. The results indicate that ResNet outperformed the other models, achieving an average accuracy of 99.08% and an F1 score of 0.9646, making it the most reliable model for spam detection. CNN-GRU demonstrated competitive performance with a balance between accuracy (98.97%) and computational efficiency, making it suitable for real-time applications. CNN-LSTM, while highly accurate (98.92%), showed a slightly lower recall compared to the other models, indicating a more cautious approach to detecting spam. These findings highlight the potential of hybrid deep learning models in addressing the complexities of SMS spam detection. Future research could focus on optimizing these models for deployment in resource-constrained environments, such as mobile devices, and further exploring the integration of residual connections for more effective spam filtering.
Comparative Analysis of Machine Learning Models for Intrusion Detection in Internet of Things Networks Using the RT-IoT2022 Dataset Airlangga, Gregorius
MALCOM: Indonesian Journal of Machine Learning and Computer Science Vol. 4 No. 2 (2024): MALCOM April 2024
Publisher : Institut Riset dan Publikasi Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.57152/malcom.v4i2.1304

Abstract

This research investigates the performance of various machine learning models in developing an Intrusion Detection System (IDS) for the complex and evolving security landscape of Internet of Things (IoT) networks. Employing the RT-IoT2022 dataset, which captures a diverse array of IoT devices and attack methodologies, we meticulously evaluated four prominent models: Gradient Boosting, Random Forest, Logistic Regression, and Multi-Layer Perceptron (MLP). Our results indicate that both Gradient Boosting and Random Forest achieved perfect scores with an accuracy, precision, recall, and F1 score of 1.00, suggesting their superior ability to classify and predict security incidents within the dataset. Logistic Regression demonstrated commendable consistency with scores of 0.96 across all metrics, proposing a balance between model complexity and performance. The MLP model closely followed, with an accuracy, precision, recall, and F1 score of 0.99, highlighting its potential in capturing complex, nonlinear data relationships. These findings underscore the critical role of machine learning in fortifying IoT networks against cyber threats and the need for continuous model evaluation against real-world data. The study provides a pathway for future research to refine these IDS models for operational efficiency and sustainability in the dynamic IoT security domain. 
Advancing fake news detection: a comparative study of RNN, LSTM, and Bidirectional LSTM Architectures Airlangga, Gregorius
Jurnal Teknik Informatika C.I.T Medicom Vol 16 No 1 (2024): March: Intelligent Decision Support System (IDSS)
Publisher : Institute of Computer Science (IOCS)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35335/cit.Vol16.2024.696.pp13-23

Abstract

In the era of information overload, the exponential growth of digital content has coincided with the proliferation of 'fake news,' posing a critical challenge to online information credibility. This study addresses the pressing need for robust fake news detection systems by conducting a comparative analysis of three neural network architectures: Recurrent Neural Networks (RNN), Long Short-Term Memory (LSTM), and Bidirectional LSTM (BiLSTM). Our primary objective is to assess their effectiveness in identifying fake news in a binary classification setting. To achieve this goal, we employed advanced neural network models and a dataset of news titles. Our applied research method included data preprocessing and the utilization of RNN, LSTM, and BiLSTM models, each tailored to handle sequential data and capture temporal dependencies. we rigorously assessed the performance of RNN, LSTM, and BiLSTM models using a range of metrics, including accuracy, precision, recall, and F1-score. To achieve a comprehensive evaluation, we divided our dataset into training and testing subsets. Specifically, we allocated 67% of the data for training purposes and the remaining 33% for testing. Our research findings reveal that all three models consistently achieved high accuracy levels, approximately 91%, with slight variations in precision and recall. Notably, the LSTM model exhibited a marginal improvement in recall, which is crucial when the consequences of missing deceptive content outweigh false alarms. Conversely, the RNN model demonstrated slightly better precision, making it suitable for applications where minimizing false positives is paramount. Surprisingly, the BiLSTM model did not significantly outperform the unidirectional models, suggesting that, for our dataset, processing information bidirectionally may not be essential. In conclusion, our study contributes valuable insights to the field of fake news detection. It underscores the significance of model selection based on specific task requirements and dataset characteristics.
Enhancing Facial Emotion Recognition on FER2013 Using Attention-based CNN and Sparsemax-Driven Class-Balanced Architectures Suwartono, Christiany; Bata, Julius Victor Manuel; Airlangga, Gregorius
Buletin Ilmiah Sarjana Teknik Elektro Vol. 7 No. 4 (2025): December
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.12928/biste.v7i4.14510

Abstract

Facial emotion recognition plays a critical role in various human–computer interaction applications, yet remains challenging due to class imbalance, label noise, and subtle inter-class visual similarities. The FER2013 dataset, containing seven emotion classes, is particularly difficult because of its low resolution and heavily skewed label distribution. This study presents a comparative investigation of advanced deep learning architectures against traditional machine-learning baselines on FER2013 to address these challenges and improve recognition performance. Two novel architectures are proposed. The first is an attention-based convolutional neural network (CNN) that integrates Mish activations and squeeze-and-excitation (SE) channel recalibration to enhance the discriminative capacity of intermediate features. The second, FastCNN-SE, is a refined extension designed for computational efficiency and minority-class robustness, incorporating Sparsemax activation, Poly-Focal loss, class-balanced reweighting, and MixUp augmentation. The research contribution is demonstrating how combining attention, sparse activations, and imbalance-aware learning improves FER performance under challenging real-world conditions. Both models were extensively evaluated: the attention-CNN under 10-fold cross-validation, achieving 0.6170 accuracy and 0.555 macro-F1, and FastCNN-SE on the held-out test set, achieving 0.5960 accuracy and 0.5138 macro-F1. These deep models significantly outperform PCA-based Logistic Regression, Linear SVC, and Random Forest baselines (≤0.37 accuracy and ≤0.29 macro-F1). We additionally justify the differing evaluation protocols by emphasizing cross-validation for architectural stability and held-out testing for generalization and note that FastCNN-SE contains ~3M parameters, enabling efficient inference. These findings demonstrate that architecture-level fusion of SE attention, Sparsemax, and Poly-Focal loss improves balanced emotion recognition, offering a strong foundation for future studies on efficient and robust affective-computing systems.
Improved Text Classification for Indonesian Hate Speech Detection: FastText-LSTM Model with Easy Data Augmentation Wicaksana, Hilman Singgih; Huda, Khairul; Airlangga, Gregorius
Jurnal Sistem Komputer dan Informatika (JSON) Vol. 7 No. 3 (2026): Maret 2026
Publisher : Universitas Budi Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30865/json.v7i3.9637

Abstract

The swift expansion of social media in Indonesia has led to a significant rise in hate speech, highlighting the urgent need for effective automated detection techniques. This research evaluates the performance of the proposed FastText-Long Short-Term Memory with Easy Data Augmentation (FastText-LSTM-WE) compared with the baseline model, FastText-Convolutional Neural Network with Easy Data Augmentation (FastText-CNN-WE). To further investigate the impact of data augmentation, the effectiveness of both FastText-Long Short-Term Memory without Easy Data Augmentation (FastText-LSTM-WO) and FastText-Convolutional Neural Network without Easy Data Augmentation (FastText-CNN-WO) was also assessed. Bayesian Optimization was employed to identify the best hyperparameter configurations for each model. The experiments were carried out on a dataset comprising 14,306 samples while maintaining consistent experimental conditions. Model performance was measured using precision, recall, F1-score, and accuracy derived from the confusion matrix. The results indicate that FastText-LSTM-WE achieved the highest performance, with precision, recall, F1-score, and accuracy of 84.02%, 83.16%, 83.59%, and 81.37%, respectively. These findings demonstrate that the proposed model provides a robust and reliable solution for detecting hate speech within the Indonesian context, thereby improving automated content moderation systems in practical applications.
EVALUATING MACHINE LEARNING MODELS FOR PREDICTING SLEEP DISORDERS IN A LIFESTYLE AND HEALTH DATA CONTEXT Gregorius Airlangga
JIKO (Jurnal Informatika dan Komputer) Vol 7 No 1 (2024)
Publisher : Program Studi Teknik Informatika Universitas Khairun

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33387/jiko.v7i1.7870

Abstract

Sleep disorders significantly impact public health, but their detection is often complicated by the multifaceted nature of causative factors. This study investigates the efficacy of various machine learning (ML) models in identifying sleep disorders based on comprehensive lifestyle and health data. We employed a dataset comprising 400 individual records with features including demographic information, sleep metrics, lifestyle factors, and health parameters. The dataset distinguished between individuals with no sleep disorder, insomnia, and sleep apnea. We evaluated a broad spectrum of ML models including logistic regression, decision trees, ensemble methods like RandomForest and GradientBoosting, support vector machines, and neural networks. The models' performances were assessed using accuracy, precision, recall, and F1 score metrics. Results indicated that ensemble methods, particularly RandomForest and XGBClassifier, outperformed other models in terms of accuracy, precision, and F1 scores, achieving values as high as 0.93. These methods proved effective in managing the complexity and variability of the dataset, thereby suggesting their robustness in clinical predictive analytics. The study's findings advocate for the use of advanced ensemble techniques in developing diagnostic tools for sleep disorders, highlighting their potential to enhance predictive accuracy and reliability in real-world healthcare settings. Further research is recommended to optimize these models and explore their integration into clinical practice.
EVALUATING HYBRID NEURAL NETWORK ARCHITECTURES FOR PREDICTING SLEEP DISORDERS FROM STRUCTURED DATA Gregorius Airlangga
JIKO (Jurnal Informatika dan Komputer) Vol 7 No 1 (2024)
Publisher : Program Studi Teknik Informatika Universitas Khairun

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33387/jiko.v7i1.7873

Abstract

The accurate diagnosis of sleep disorders is crucial for effective treatment and management, yet current methods often rely on subjective assessments and are not always reliable. This research examines the efficacy of various neural network architectures, including dense networks, convolutional neural networks (CNNs), recurrent neural networks (RNNs), and innovative hybrid models, in predicting sleep disorders from structured health data. Our study focuses on comparing the performance of these models using metrics such as accuracy, precision, recall, and F1 score across a dataset comprising 400 individuals with detailed sleep and lifestyle data. Our findings demonstrate that while traditional models like dense networks and CNNs for structured data yield robust results, hybrid models, particularly the CNN-Transformer, significantly outperform others. This model effectively integrates convolutional layers with Transformer’s attention mechanisms, excelling in handling complex data interactions and providing superior predictive accuracy with an F1 score and accuracy reaching as high as 0.91. Conversely, RNN models, designed to capture temporal data dependencies, showed less efficacy, underscoring the importance of model selection aligned with data characteristics. This suggests that for datasets not exhibiting strong temporal features, models leveraging spatial relationships or advanced attention mechanisms are more suitable. This study not only advances our understanding of neural network applications in medical diagnostics but also highlights the potential of hybrid models in enhancing diagnostic accuracy. These insights could lead to significant improvements in the early detection and treatment of sleep disorders, thereby enhancing patient outcomes and contributing to the broader field of medical informatics.