Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control

Evaluating Synonym Augmentation Impact on SBERT Performance for Indonesian Social Media Style Classification Pamput, Jessicha Putrianingsih; Muthmainnah, Aindri Rizky; Surianto, Dewi Fatmarani; Budiarti, Nur Azizah Eka; Wahid, Abdul
Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control Vol. 11, No. 2, May 2026 (Article in Progress)
Publisher : Universitas Muhammadiyah Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22219/kinetik.v11i2.2580

Abstract

Language on social media reflected the identity and characteristics of its users, including differences in language style between generations. Millennials and Generation Z were two dominant demographic groups in digital communication that exhibited linguistic variations, which often caused gaps in understanding during online interactions. Variations in language structure and expression posed challenges in understanding the context of cross-generational communication. Therefore, this study aimed to classify linguistic styles across generations in social media texts by combining Sentence-BERT (SBERT). FastText-based synonym augmentation in Indonesian, and Support Vector Machine (SVM) as a margin-based classification model that utilizes embedding representations from SBERT. The results showed that synonym augmentation improved model accuracy from 85% to 93%, with a similarity threshold of 0.7 providing the best balance between data diversity and semantic consistency. These findings confirmed that synonym-based augmentation and SBERT semantic adaptation were effective in capturing generational linguistic differences in informal Indonesian. This approach had the potential to be applied in other NLP tasks that required contextual understanding of social language variation, such as sentiment analysis and cross-generational dialect detection.