The expansion of social media platforms has significantly reshaped marketing communication strategies, particularly in countries with highly active digital audiences such as Indonesia. Among these platforms, TikTok has become a dominant channel for promotional messaging, where short textual captions play a crucial role in attracting user attention. Within this context, the phonological structure of language may influence how promotional messages are perceived, remembered, and shared. This study investigates the phonological characteristics of Indonesian marketing captions on social media using a Natural Language Processing (NLP) framework. A dataset consisting of user-generated promotional captions from TikTok was collected and processed through several stages, including text normalization, tokenization, and noise removal. The cleaned text was then transformed into phoneme sequences using a hybrid grapheme-to-phoneme (G2P) conversion approach. From these phoneme representations, several phonological features were extracted, including vowel–consonant ratios, syllable length distribution, phoneme entropy, and phonotactic surprisal. The analysis reveals several recurring sound patterns in Indonesian promotional language. In particular, captions tend to favor open syllable structures, vowel-rich word formations, and repetitive phoneme sequences that contribute to rhythmic flow and memorability. Words containing dominant vowels such as /a/ and /i/ appear frequently in marketing expressions, suggesting that phonological aesthetics may play a role in shaping audience engagement. Overall, the findings suggest that phonological structure is not merely a linguistic artifact but may also function as a subtle persuasive mechanism in digital marketing discourse. By combining computational phonology with marketing analysis, this study offers new insights into how sound patterns embedded in written promotional texts can influence communication effectiveness in the Indonesian social media landscape.
Copyrights © 2026