"Journal of Data Science
Vol. 4 No. 01 (2026): Journal Of Data Science, 2026

Phonological Features in Indonesian Social Media Marketing Texts: A Natural Language Processing Approach

Rumaisa, Fitrah (Unknown)
Darajat, Zakiah (Unknown)
Yan Puspitarani (Unknown)



Article Info

Publish Date
04 May 2026

Abstract

The expansion of social media platforms has significantly reshaped marketing communication strategies, particularly in countries with highly active digital audiences such as Indonesia. Among these platforms, TikTok has become a dominant channel for promotional messaging, where short textual captions play a crucial role in attracting user attention. Within this context, the phonological structure of language may influence how promotional messages are perceived, remembered, and shared. This study investigates the phonological characteristics of Indonesian marketing captions on social media using a Natural Language Processing (NLP) framework. A dataset consisting of user-generated promotional captions from TikTok was collected and processed through several stages, including text normalization, tokenization, and noise removal. The cleaned text was then transformed into phoneme sequences using a hybrid grapheme-to-phoneme (G2P) conversion approach. From these phoneme representations, several phonological features were extracted, including vowel–consonant ratios, syllable length distribution, phoneme entropy, and phonotactic surprisal. The analysis reveals several recurring sound patterns in Indonesian promotional language. In particular, captions tend to favor open syllable structures, vowel-rich word formations, and repetitive phoneme sequences that contribute to rhythmic flow and memorability. Words containing dominant vowels such as /a/ and /i/ appear frequently in marketing expressions, suggesting that phonological aesthetics may play a role in shaping audience engagement. Overall, the findings suggest that phonological structure is not merely a linguistic artifact but may also function as a subtle persuasive mechanism in digital marketing discourse. By combining computational phonology with marketing analysis, this study offers new insights into how sound patterns embedded in written promotional texts can influence communication effectiveness in the Indonesian social media landscape.

Copyrights © 2026






Journal Info

Abbrev

visualization

Publisher

Subject

Automotive Engineering Computer Science & IT

Description

The "Journal of Data Science" is a real journal that focuses on the field of data science. It covers a wide range of topics related to data analysis, machine learning, statistics, data mining, and related areas. The journal aims to publish high-quality research papers, reviews, and technical notes ...