Claim Missing Document
Check
Articles

Found 6 Documents
Search
Journal : IJCCS (Indonesian Journal of Computing and Cybernetics Systems)

Aspect-Based Sentiment Analysis of Online Marketplace Reviews Using Convolutional Neural Network MHD Theo Ari Bangsa; Sigit Priyanta; Yohanes Suyanto
IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Vol 14, No 2 (2020): April
Publisher : IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia.

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/ijccs.51646

Abstract

Most online stores provide product review facilities that contain responses to a product. The number of reviews makes it difficult for potential customers to make conclusions, so that sentiment analysis is needed to extract information from these reviews. Most sentiment analysis is done at the document level, so the results were still lacking in detail because the classification is based on the entire sentence or document and does not identify the specific aspect discussed. This research aims to classify aspect-based sentiments from online store reviews using the convolutional neural network (CNN) method with the extraction of features using Word2Vec. The dataset used is Indonesian review data from the site bukalapak.com. The test results on the built system showed that CNN's method of Word2Vec feature extraction has a better score than the naive bayes method with an accuracy value of 85.54%, 96.12% precision, 88.39% recall, and f-measure 92.02%. Classification without using stemming preprocessing on the dataset increases the accuracy by 2.77%.
Bidirectional Long Short Term Memory Method and Word2vec Extraction Approach for Hate Speech Detection Auliya Rahman Isnain; Agus Sihabuddin; Yohanes Suyanto
IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Vol 14, No 2 (2020): April
Publisher : IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia.

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/ijccs.51743

Abstract

Currently, the discussion about hate speech in Indonesia is warm, primarily through social media. Hate speech is communication that disparages a person or group based on characteristics such as (race, ethnicity, gender, citizenship, religion and organization). Twitter is one of the social media that someone uses to express their feelings and opinions through tweets, including tweets that contain expressions of hatred because Twitter has a significant influence on the success or destruction of one's image.This study aims to detect hate speech or not hate Indonesian speech tweets by using the Bidirectional Long Short Term Memory method and the word2vec feature extraction method with Continuous bag-of-word (CBOW) architecture. For testing the BiLSTM purpose with the calculation of the value of accuracy, precision, recall, and F-measure.The use of word2vec and the Bidirectional Long Short Term Memory method with CBOW architecture, with epoch 10, learning rate 0.001 and the number of neurons 200 on the hidden layer, produce an accuracy rate of 94.66%, with each precision value of 99.08%, recall 93, 74% and F-measure 96.29%. In contrast, the Bidirectional Long Short Term Memory with three layers has an accuracy of 96.93%. The addition of one layer to BiLSTM increased by 2.27%.
Reccomendations on Selecting The Topic of Student Thesis Concentration using Case Based Reasoning Annisaa Utami; Yohanes Suyanto; Agus Sihabuddin
IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Vol 15, No 1 (2021): January
Publisher : IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia.

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/ijccs.58919

Abstract

Case Based Reasoning (CBR) is a method that aims to resolve a new case by adapting the solutions contained in previous cases that are similar to the new case. The system built in this study is the CBR system to make recommendations on the topic of student thesis concentration.               This study used data from undergraduate students of Informatics Engineering IST AKPRIND Yogyakarta with a total of 115 data consisting of 80 training data and 35 test data. This study aims to design and build a Case Based Reasoning system using the Nearest Neighbor and Manhattan Distance Similarity Methods, and to compare the results of the accuracy value using the Nearest Neighbor Similarity and Manhattan Distance Similarity methods.               The recommendation process is carried out by calculating the value of closeness or similarity between new cases and old cases stored on a case basis using the Nearest Neighbor Method and Manhattan Distance.  The features used in this study consisted of GPA and course grades. The case taken is the case with the highest similarity value. If a case doesnt get a topic recommendation or is less than the trashold value of 0.8, a case revision will be carried out by an expert. Successfully revised cases are stored in the system to be made new knowledge. The test results using the Nearest Neighbor Method get an accuracy value of 97.14% and Manhattan Distance Method 94.29%.
Attention-Based BiLSTM for Negation Handling in Sentimen Analysis Riszki Wijayatun Pratiwi; Yunita Sari; Yohanes Suyanto
IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Vol 14, No 4 (2020): October
Publisher : IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia.

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/ijccs.60733

Abstract

Research on sentiment analysis in recent years has increased. However, in sentiment analysis research there are still few ideas about the handling of negation, one of which is in the Indonesian sentence. This results in sentences that contain elements of the word negation have not found the exact polarity.The purpose of this research is to analyze the effect of the negation word in Indonesian. Based on positive, neutral and negative classes, using attention-based Long Short Term Memory and word2vec feature extraction method with continuous bag-of-word (CBOW) architecture. The dataset used is data from Twitter. Model performance is seen in the accuracy value.The use of word2vec with CBOW architecture and the addition of layer attention to the Long Short Term Memory (LSTM) and Bidirectional Long Short Term Memory (BiLSTM) methods obtained an accuracy of 78.16% and for BiLSTM resulted in an accuracy of 79.68%. whereas in the FSW algorithm is 73.50% and FWL 73.79%. It can be concluded that attention based BiLSTM has the highest accuracy, but the addition of layer attention in the Long Short Term Memory method is not too significant for negation handling. because the addition of the attention layer cannot determine the words that you want to pay attention to.
The Effect of Text Summarization in Essay Scoring (Case Study: Teach on E-Learning) Sensa Gudya Sauma Syahra; Yunita Sari; Yohanes Suyanto
IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Vol 16, No 1 (2022): January
Publisher : IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia.

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/ijccs.69906

Abstract

The development of automated essay scoring (AES) in the neural network (NN) approach has eliminated feature engineering. However, feature engineering is still needed, moreover, data with labels in the form of rubric scores, which are complementary to AES holistic scores, are still rarely found. In general, data without labels/scores is found more. However, unsupervised AES research has not progressed with the more common use of publicly labeled data. Based on the case studies adopted in the research, automatic text summarization (ATS) was used as a feature engineering model of AES and readability index as the definition of rubric values for data without labels.This research focuses on developing AES by implementing ATS results on SOM and HDBSCAN. The data used in this research are 403 documents of TEACH ON E-learning essays. Data is represented in the form of a combination of word vectors and a readability index. Based on the tests and measurements carried out, it was concluded that AES with ATS implementation had no good potential for the assessment of TEACH ON essays in increasing the silhouette score. The model produces the best silhouette score of 0.727286113 with original essay data.
Offensive Language and Hate Speech Detection using BERT Model Amalia, Fadila Shely; Suyanto, Yohanes
IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Vol 18, No 4 (2024): October
Publisher : IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia.

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.22146/ijccs.99841

Abstract

Hate speech detection is an important issue in sentiment analysis and natural language processing. This study aims to improve the effectiveness of hate speech detection in English text using the BERT model, along with modified preprocessing techniques to enhance the F1-score. The dataset, sourced from Kaggle, contains English text with hate speech content. Evaluation results show a significant improvement in the model's accuracy and overall text classification performance. The BERT model achieved 89.11% accuracy on test data, correctly predicting 85 out of 95 samples. While the model excels at classifying offensive text with around 95% accuracy, it struggles to distinguish between hate and offensive text, with some confusion between neither and offensive categories. The classification report shows F1-scores of 0.43 for the hate class, 0.94 for the offensive class, and 0.84 for the neither class, with a weighted average F1-score of 0.89 and a macro average of 0.73. These results indicate that the BERT model delivers solid performance in detecting hate speech, though there is room for improvement, particularly in distinguishing certain classes.