Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : JOIV : International Journal on Informatics Visualization

Implementation of Word Trends Using a Machine Learning Approach with TF-IDF and Latent Dirichlet Allocation Rifaldi, Dianda; Fadlil, Abdul; Herman, -
JOIV : International Journal on Informatics Visualization Vol 8, No 4 (2024)
Publisher : Society of Visual Informatics

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.62527/joiv.8.4.2452

Abstract

In today's technological age, the prevalence of social media has become ubiquitous, facilitating the easy dissemination of information and communication. This has led to the uploading of various content, including opinions on mental health, particularly in Indonesia. Mental health refers to an individual's emotional, psychological, and social well-being, commonly affecting individuals from adolescence to adulthood. This research utilized Twitter data on mental health issues gathered from October to November 2022, employing TF-IDF and Latent Dirichlet Allocation (LDA) to conduct topic modeling for word trend analysis based on user-generated content. The sentiment analysis concept was used to label text as either negative or positive sentiment. Subsequently, TF-IDF weighed the word frequency in the documents/tweets, categorizing the data based on the resulting sentiments. Manual labeling ensured accuracy, avoiding potential errors from libraries provided in the Indonesian language. Employing these two topic modeling techniques, conclusions were drawn for each concept, aiming to identify word trends, mainly focusing on mental health discourse within Twitter user-generated content. Results indicated the synchronicity of the keyword 'mental health' with word trends generated by LDA. At the same time, TF-IDF produced word trends based on positive and negative labels, revealing commonly used terms by Twitter users to express these concerns. Furthermore, subsequent research can be experimented by comparing topic modeling techniques using Latent Semantic Allocation (LSA), Probabilistic Latent Semantic Analysis (pLSA), and Hierarchical Dirichlet Process (HDP), where LSA and pLSA present approaches closely aligned with LDA.