Abi Mas'ud
Magister Ilmu Komputer, Universitas Bumigora

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Prediksi Gender Berdasarkan Nama Menggunakan Kombinasi Model IndoBERT, Convolutional Neural Network (CNN) dan Bidirectional Long Short-Term Memory (BiLSTM) Abi Mas'ud; Bambang Krismono Triwijoyo; Dadang Priyanto
Jurnal Teknologi Informasi dan Multimedia Vol. 7 No. 3 (2025): August
Publisher : Sekawan Institut

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35746/jtim.v7i3.736

Abstract

This study proposes a name-based gender prediction model in the Indonesian language by combining the architectures of Indonesian Bidirectional Encoder Representations from Transformers (IndoBERT), Convolutional Neural Network (CNN), and Bidirectional Long Short-Term Memory (BiLSTM). The non-standardized and diverse structure of Indonesian names presents a significant challenge for text-based gender classification tasks. To address this, a hybrid approach was developed to leverage the contextual representation power of IndoBERT, the local pattern extraction capability of CNN, and the sequential dependency modeling strength of BiLSTM. The dataset consists of 4,796 student names from Universitas Bumigora, collected between 2018 and 2023. The preprocessing steps include lowercasing, punctuation removal, label encoding, and train-test splitting. Evaluation results based on accuracy, precision, recall, and F1-score indicate that the IndoBERT-CNN-BiLSTM model achieved the best performance, with an accuracy of 90.94%, F1-score of 91.03%, and training stability without signs of overfitting. This model demonstrates high effectiveness in name-based gender classification and holds strong potential for applications such as population information systems, service personalization, and name-based demographic analysis.