International Journal of Electrical and Computer Engineering
Vol 14, No 5: October 2024

Reddit social media text analysis for depression prediction: using logistic regression with enhanced term frequency-inverse document frequency features

Ayyalasomayajula, Madan Mohan Tito (Unknown)
Agarwal, Akshay (Unknown)
Khan, Shahnawaz (Unknown)



Article Info

Publish Date
01 Oct 2024

Abstract

Language provides significant insights into an individual’s emotional state, social status, and personality traits. This research aims to enhance depression detection through the analysis of linguistic features and various dataset attributes. The dataset, sourced from the social networking platform Reddit, comprises posts and comments from individuals diagnosed with depression. Logistic regression with term frequency-inverse document frequency (TF-IDF) is employed as the primary model for text classification. To improve model performance, a novel feature—the average time interval between consecutive posts or comments—is introduced, contributing to a marginal but noteworthy improvement in accuracy. The proposed model demonstrates superior F1 scores compared to other models applied to the same dataset. Given the increasing recognition of mental health’s significance, accurately diagnosing mental disorders is of paramount importance. This study underscores the potential of leveraging linguistic analysis and advanced machine learning techniques to identify depressive symptoms, thereby contributing to more effective mental health diagnostics and interventions.

Copyrights © 2024






Journal Info

Abbrev

IJECE

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering

Description

International Journal of Electrical and Computer Engineering (IJECE, ISSN: 2088-8708, a SCOPUS indexed Journal, SNIP: 1.001; SJR: 0.296; CiteScore: 0.99; SJR & CiteScore Q2 on both of the Electrical & Electronics Engineering, and Computer Science) is the official publication of the Institute of ...