JURNAL MEDIA INFORMATIKA BUDIDARMA
Vol 8, No 3 (2024): Juli 2024

Optimization of Sentiment Analysis Classification of ChatGPT on Big Data Twitter in Indonesia using BERT

Sinaga, Frans Mikael (Unknown)
Purba, Ronsen (Unknown)
Pipin, Sio Jurnalis (Unknown)
Lestari, Wulan Sri (Unknown)
Winardi, Sunaryo (Unknown)



Article Info

Publish Date
27 Jul 2024

Abstract

This research is grounded in the emergence of ChatGPT technology, supported by prior and similar studies. The urgency of the issue is highlighted by previous research indicating non-convergent classification outcomes in LSTM (Long Short-Term Memory) methods due to suboptimal hyperparameter settings and limitations in understanding text data within Big Data. The presence of ChatGPT technology brings both benefits and potential misuse, such as copyright infringement, unauthorized news extraction, and violations of accountability principles. Understanding public sentiment towards the presence of ChatGPT technology is crucial. The research aims to implement the BERT (Bidirectional Encoder Representations from Transformers) method to achieve accurate and convergent sentiment analysis classification. This study involves data preprocessing stages using Natural Language Processing (NLP) techniques. Text data, already vectorized, is classified using BERT to determine public sentiment (positive, negative, neutral) towards ChatGPT technology, ensuring greater accuracy, convergence, and contextual relevance. Performance testing of the BERT model is conducted using a Confusion Matrix. With parameters set to Max Sequence Length = 128 and Batch Size = 16, the highest classification accuracy achieved is 93.4%.

Copyrights © 2024






Journal Info

Abbrev

mib

Publisher

Subject

Computer Science & IT Control & Systems Engineering Electrical & Electronics Engineering

Description

Decission Support System, Expert System, Informatics tecnique, Information System, Cryptography, Networking, Security, Computer Science, Image Processing, Artificial Inteligence, Steganography etc (related to informatics and computer ...