The Indonesian Journal of Computer Science
Vol. 13 No. 3 (2024): The Indonesian Journal of Computer Science (IJCS)

A Review of Text Classification Based on ML & Data Mining Algorithms

Mustafa, Ashraf Atam (Unknown)
Mohsin Abdulazeez, Adnan (Unknown)



Article Info

Publish Date
15 Jun 2024

Abstract

In the digital era, the field of text classification has experienced transformative growth through the application of Machine Learning (ML) and Data Mining (DM) algorithms. This review traces the evolution from traditional data mining methods to sophisticated ML strategies that significantly enhance the analysis and categorization of textual data. We discuss pivotal technologies including Bayesian classifiers, Support Vector Machines (SVM), and contemporary advances such as Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs). The integration of Natural Language Processing (NLP) techniques is highlighted for their critical role in enriching semantic analysis capabilities, a necessity for effective text classification. Additionally, the paper addresses challenges like handling high-dimensional data, dealing with imbalanced datasets, and confronting ethical issues such as bias and privacy in automated systems. By synthesizing the latest research, this review identifies current gaps, proposes practical solutions, and forecasts future trends in text classification to support ongoing research and application across various sectors.

Copyrights © 2024






Journal Info

Abbrev

ijcs

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering Engineering

Description

The Indonesian Journal of Computer Science (IJCS) is a bimonthly peer-reviewed journal published by AI Society and STMIK Indonesia. IJCS editions will be published at the end of February, April, June, August, October and December. The scope of IJCS includes general computer science, information ...