Indonesian Journal of Applied Research (IJAR)
Vol. 4 No. 1 (2023): Indonesian Journal of Applied Research (IJAR)

Comparison of Feature Selection Based on Computation Time and Classification Accuracy Using Support Vector Machine

Salmun K Nasib (Statistics Study Program and Gorontalo State University, Indonesia)
Fadilah Istiqomah Pammus (Statistics Study Program and Gorontalo State University, Indonesia)
Nurwan (Mathematics Study Program and Gorontalo State University, Indonesia)
La Ode Nashar (Statistics Study Program and Gorontalo State University, Indonesia)



Article Info

Publish Date
18 Apr 2023

Abstract

The goal of this research to compare Chi-Square feature selection with Mutual Information feature selection based on computation time and classification accuracy. In this research, people's comments on Twitter are classified based on positive, negative, and neutral sentiments using the Support Vector Machine method. Sentiment classification has the disadvantage that it has many features that are used, therefore feature selection is needed to optimize a sentiment classification performance. Chi-square feature selection and mutual information feature selection are feature selections that both can improve the accuracy of sentiment classification. How to collect the data on twitter taken using the IDE application from python. The results of this study indicate that sentiment classification using Chi-Square feature selection produces a computation time of 0.4375 seconds with an accuracy of 78% while sentiment classification using Mutual Information feature selection produces an accuracy of 80% with a required computation time of 252.75 seconds. So that the conclusion are obtained based on the computational time aspect, the Chi-Square feature selection is superior to the Mutual Information feature selection, while based on the classification accuracy aspect, the Mutual Information feature selection is more accurate than the Chi-Square feature selection. The recommendations for further research can use mutual information feature selection to get high accuracy results on sentiment classification

Copyrights © 2023






Journal Info

Abbrev

IJAR

Publisher

Subject

Agriculture, Biological Sciences & Forestry Biochemistry, Genetics & Molecular Biology Computer Science & IT

Description

Indonesian Journal of Applied Research (IJAR), e-ISSN 2722-6395 is high quality open access peer reviewed research journal that is published by Universitas Djuanda (UNIDA). IJAR dedicated to publish significant research findings in the field of Applied Sciences, Engineering &Technology. We welcome ...