Claim Missing Document
Check
Articles

Hate Speech Detection on Twitter in Indonesia with Feature Expansion Using GloVe Febiana Anistya; Erwin Budi Setiawan
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Vol 5 No 6 (2021): Desember 2021
Publisher : Ikatan Ahli Informatika Indonesia (IAII)

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (400.201 KB) | DOI: 10.29207/resti.v5i6.3521

Abstract

Twitter is one of the popular social media to channel opinions in the form of criticism and suggestions. Criticism could be a form of hate speech if the criticism implies attacking something (an individual, race, or group). With the limit of 280 characters in a tweet, there is often a vocabulary mismatch due to abbreviations which can be solved with word embedding. This study utilizes feature expansion to reduce vocabulary mismatches in hate speech on Twitter containing Indonesian language by using Global Vectors (GloVe). Feature selection related to the best model is carried out using the Logistic Regression (LR), Random Forest (RF), and Artificial Neural Network (ANN) algorithms. The results show that the Random Forest model with 5.000 features and a combination of TF-IDF and Tweet corpus built with GloVe produce the best accuracy rate between the other models with an average of 88,59% accuracy score, which is 1,25% higher than the predetermined Baseline. The number of features used is proven to improve the performance of the system.
Feature Expansion Word2Vec Untuk Analisis Sentimen Kebijakan Publik di Twitter Alvi Rahmy Royyan; Erwin Budi Setiawan
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Vol 6 No 1 (2022): Februari 2022
Publisher : Ikatan Ahli Informatika Indonesia (IAII)

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (363.774 KB) | DOI: 10.29207/resti.v6i1.3525

Abstract

Social media users, especially on Twitter, can freely express opinions or other information in the form of tweets about anything, including responding to a public policy. In a written tweet, there is a limit of 280 characters per tweet and this allows for problems such as vocabulary mismatches. Therefore, in this study, the feature expansion Word2vec method was applied to overcome when the vocabulary mismatches occur. This study develops and compares the Twitter sentiment analysis system using the feature expansion Word2vec method with the Logistic Regression (LR) and Support Vector Machine (SVM) classification algorithms and the system without the feature expansion Word2Vec method. The results of this study, the feature expansion Word2Vec method on the SVM classification algorithm succeeded in increasing the system accuracy up to 0,99% with an accuracy value of 78,99%.
Optimization Prediction of Big Five Personality in Twitter Users Gita Safitri; Erwin Budi Setiawan
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Vol 6 No 1 (2022): Februari 2022
Publisher : Ikatan Ahli Informatika Indonesia (IAII)

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (380.58 KB) | DOI: 10.29207/resti.v6i1.3529

Abstract

Various kinds of information can be acquired from social media platforms; one of them is on Twitter. User biographical information and tweets are the essential assets for research that can describe the Big Five Personality, including openness, conscientiousness, extraversion, agreeableness, and neuroticism. Several previous studies have tried the prediction of Big Five Personality. However, the authors found problems in how to optimize the work of the personality prediction system. So, in this study, Big Five Personality predictions were carried out on users of Twitter and improved the performance of the personality prediction system. We implement optimization techniques such as sampling, feature selection, and hyperparameter tuning to enhance the performance. This study also applies linguistic feature extraction, such as LIWC and TF-IDF. By using 287 Twitter users that have permitted their data to be crawled acquired from an online survey using Big Five Inventory (BFI), and applying all optimization techniques, the average accuracy result is 84.22% which is a 74.44% gain over the specified baseline.
Word2Vec on Sentiment Analysis with Synthetic Minority Oversampling Technique and Boosting Algorithm Rayhan Rahmanda; Erwin Budi Setiawan
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Vol 6 No 4 (2022): Agustus 2022
Publisher : Ikatan Ahli Informatika Indonesia (IAII)

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (386.752 KB) | DOI: 10.29207/resti.v6i4.4186

Abstract

Customer opinion is an important aspect in determining the success of a company or service provider. By determining the sentiment of the existing opinion, the company can use it as an evaluation material to improve the quality of the service or product provided. Sentiment analysis can be used as a measure of opinion sentiment with input data in the form of a corpus which will be classified into positive or negative classes to obtain the level of customer satisfaction with a product or service. Aspect-based sentiment analysis can be used by companies to analyze more specifically and find out what aspects need to be improved. In this research, an aspect-based sentiment analysis was conducted on Telkomsel users on Twitter. The data used is 16,992 tweets from users who discuss several aspects such as Telkomsel's services and signals in Twitter. In this research Word2Vec was used for feature expansion to minimize vocabulary mismatch caused by limited words in tweets. The results showed that Word2Vec, Synthetic Minority Oversampling Technique (SMOTE), and Boosting algorithm combination with Logistic Regression classifier achieve highest accuracy of 95.10% for signal aspect and using hyperparameters makes the service aspect get the highest accuracy of 93.34%.
Aspect Based Sentiment Analysis with FastText Feature Expansion and Support Vector Machine Method on Twitter Muhammad Afif Raihan; Erwin Budi Setiawan
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Vol 6 No 4 (2022): Agustus 2022
Publisher : Ikatan Ahli Informatika Indonesia (IAII)

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (393.449 KB) | DOI: 10.29207/resti.v6i4.4187

Abstract

Social media such as Twitter has now become very close to society. Twitter users can express current issues, their opinions, product reviews, and many other things both positive and negative. Twitter is also used by companies to monitor the assessment of their products among the public as insight that will be used to evaluate what aspects of their products need to be further developed. Twitter with its limitation of only allowing users to post a maximum tweet of 280 characters will make a lot of abbreviated and difficult to understand words used, so it will allow vocabulary mismatch problems to occur. Therefore, in this paper, research conducted on aspect-based sentiment analysis of Telkomsel’s products from the aspects of signal and service by applying feature expansion using Fasttext word embedding to overcome vocabulary mismatch problem and classification with the Support Vector Machine (SVM) method. Sampling technique with Synthetic Minority Oversampling Technique (SMOTE) used to overcome data imbalance. The experimental results show that feature expansion can increase the performance of model. The final results obtained F1-Score value of the model for the signal aspect increased by 27.91% with F1-Score 95.93%, and for the service aspect increased by 42.36% with F1-Score 94.53%.
Recommender System Based on Tweets with Singular Value Decomposition and Support Vector Machine Classification Rafi Anandita Wicaksono; Erwin Budi Setiawan
Journal of Computer System and Informatics (JoSYC) Vol 3 No 4 (2022): August 2022
Publisher : Forum Kerjasama Pendidikan Tinggi (FKPT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/josyc.v3i4.2072

Abstract

In modern times, the movie industry is growing rapidly. Netflix is one of the platforms that can be used to watch movies and provides many types of genres and movie titles. With so many genres and movie titles sometimes making it difficult for people to choose a movie to watch, one solution to the problem is a recommendation system that can recommend movies based on user ratings. One method in the recommendation system is collaborative filtering. One of the algorithms contained in collaborative filtering is singular value decomposition. Twitter is one of the places where people often write their opinions about the movies they have watched, from people's tweets on Twitter will be processed into rating value data. In this system, tweets become input that is processed into data that has a rating. This research implements a user-based recommendation system based on ratings from tweets using collaborative filtering combined with the Singular Value Decomposition (SVD) algorithm and Support Vector Machine (SVM) classification and implemented it on user-based and item-based. This research aims to implement a system that combines collaborative filtering techniques with the Singular Value Decomposition (SVD) algorithm and Support Vector Machine (SVM) classification. With the hope of producing a good movie recommendation model and providing accurate predictions for recommended and non-recommended movies. The test results in this study show that Collaborative Filtering gets the best RMSE value of 0.8162 on user-based and 0.5911 on item-based. The combination of Singular Value Decomposition (SVD) algorithm and Support Vector Machine (SVM) classification using hyperparameter tuning resulted in 81% precision and 81% recall for user-based while 80% precision and 80% recall for item-based.
Sentiment Analysis Based on Aspects Using FastText Feature Expansion and NBSVM Classification Method Sukmawati Dwi Lestari; Erwin Budi Setiawan
Journal of Computer System and Informatics (JoSYC) Vol 3 No 4 (2022): August 2022
Publisher : Forum Kerjasama Pendidikan Tinggi (FKPT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/josyc.v3i4.2202

Abstract

Telkomsel is a service that the people of Indonesia widely use. Complaints from users referring to Telkomsel's service and signal aspects are often made in Twitter tweets with harsh or good language. This is done because users continue to demand to get better service. Therefore, an aspect-based sentiment analysis technique is needed to determine a person's view of each aspect, such as Telkomsel's service and signal aspects. Aspect-based sentiment analysis is a solution to find out the opinions of Telkomsel users based on their aspects. In its implementation, the NBSVM method is used as a classification model that is proven to work well compared to other methods, namely MNB and SVM. The implementation of the expansion of the FastText feature can affect the level of performance model, and the best results are obtained in the Top 1 feature on the signal aspect and Top 5 on the service aspect with a combination of Twitter corpus and news. In this study, the data used is unbalanced and has been handled by applying SMOTE and AdaBoost techniques to the FastText feature expansion model. Based on the results of the tests that have been carried out, SMOTE can handle data imbalances compared to AdaBoost. The performance results of the FastText feature expansion model after SMOTE are applied to get F1-Score 91.24% in the signal aspect and F1-Score 88.75% in the service aspect.
Recommender System with User-Based and Item-Based Collaborative Filtering on Twitter using K-Nearest Neighbors Classification Muhammad Shiba Kabul; Erwin Budi Setiawan
Journal of Computer System and Informatics (JoSYC) Vol 3 No 4 (2022): August 2022
Publisher : Forum Kerjasama Pendidikan Tinggi (FKPT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/josyc.v3i4.2204

Abstract

Netflix is one of the most widely used applications for watching movies online. There are various movie titles that can be watched by users, so a recommendation system is needed to help users who feel confused in choosing movie titles. Twitter is a social media used to express ideas, thoughts, and feelings. Not a few Twitter users who conduct movie discussions, with the movie discussion can be converted into a rating that can be used in the recommendation system. Collaborative Filtering is one of the methods of the recommendation system, by recommending based on the similarity between users (user-based) and based on items that have similarities with user-selected items (item-based). In this research, the Collaborative Filtering method is combined with K-Nearest Neighbors classification which obtains an RMSE value for user-based 1.8244 and item-based 0.5449. K-Nearest Neighbors gets 91.22% precision and 91.07% recall for user-based, while item-based gets 89.44% precision and 91.22% recall with the optimal K as a parameter is 3.
Netflix Movie Recommendation System Using Collaborative Filtering With K-Means Clustering Method on Twitter Muhammad Tsaqif Muhadzdzib Ramadhan; Erwin Budi Setiawan
JURNAL MEDIA INFORMATIKA BUDIDARMA Vol 6, No 4 (2022): Oktober 2022
Publisher : Universitas Budi Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30865/mib.v6i4.4571

Abstract

Nowadays, the development of technology is very rapid, so watching movies at home has become a means of entertainment. Netflix is one of the platforms for watching movies and provides various movie titles. However, because of the many movie titles, it makes it difficult for users to determine the movie they want to watch. The solution to this problem is to provide a recommendation system that can provide movie recommendations to watch. Collaborative filtering is a method that exists in the recommendation system by providing recommendations based on the ratings given by other users. Collaborative filtering is divided into two, namely based on items (item-based) and based on users (user-based). Twitter is a social media used to write posts called tweets. For this system, tweets serve as data that will be processed into ratings. This research was conducted using k-means clustering with collaborative filtering and collaborative filtering only. By using a dataset obtained from Twitter by crawling data and added with ratings from IMDb, Rotten Tomatoes, and Metacritic. Which resulted in a dataset with 35 users, 785 movie titles, and 6184 reviews. Then preprocessing the data with text processing, polarity, and labeling. And get the dataset that will be used for this experiment. The results of this research test show that k-means clustering with collaborative filtering gets the best results with the best prediction of 2.8466, getting an MAE value of 0.5029, and an RMSE value of 0.6354
Recommender System Movie Netflix using Collaborative Filtering with Weighted Slope One Algorithm in Twitter Rakhmat Rifaldy; Erwin Budi Setiawan
Building of Informatics, Technology and Science (BITS) Vol 4 No 2 (2022): September 2022
Publisher : Forum Kerjasama Pendidikan Tinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/bits.v4i2.1959

Abstract

Movies are entertainment that many people enjoy filling their spare time. After watching a movie, people usually write reviews about the movie on social media such as Twitter. As the number of movies grows, a recommendation system is created, which is useful for finding movies they might like based on the movies they have seen. This study developed a movie recommendation system using Collaborative Filtering (CF) with the Weighted Slope One (WSO) algorithm. The dataset used is taken from tweet data on Twitter. Then the tweet dataset is converted into a rating value which will later be used in the recommendation system. This study uses Mean Absolute Error (MAE) to measure accuracy. In Collaborative Filtering, the system gets the best MAE of 0.924. Then for Weighted Slope One, the system gets the best MAE of 0.568.
Co-Authors Abdullah, Athallah Zacky Adriana, Kaysa Azzahra Agung Toto Wibowo Ahmad Zahri Ruhban Adam Aji Reksanegara Aji, Hilman Bayu Alvi Rahmy Royyan Anang Furkon RIfai Anindika Riska Intan Fauzy Annisa Aditsania Annisa Cahya Anggraeni Annisa Cahya Anggraeni Annisa Rahmaniar Dwi Pratiwi Arie Ardiyanti Arki Rifazka Arsytania, Ihsani Hawa Athirah Rifdha Aryani Aufa Ab'dil Mustofa Aydin, Raditya Bagas Teguh Imani Bayu Muhammad Iqbal Bayu Surya Dharma Sanjaya Billy Anthony Christian Martani Brenda Irena Brigita Tenggehi Crisanadenta Wintang Kencana Damarsari Cahyo Wilogo Daniar Dwi Pratiwi Daniar Dwi Pratiwi Dea Alfatihah Nindya Erlani Dede Tarwidi Dedy Handriyadi Dery Anjas Ramadhan Dhinta Darmantoro Diaz Tiyasya Putra Dion Pratama Putra, Dion Pratama Diyas Puspandari Dwi Hariyansyah Riaji Faidh Ilzam Nur Haq Famardi Putra Muhammad Raffly Raffly Fathurahman Alhikmah Fathurahman Alhikmah Fazira Ansshory, Azrina Febiana Anistya Feby Ali Dzuhri Fhina Nhita Fhina Nhita Fida Nurmala Nugraha Fikri Maulana, Fikri Firdaus, Dzaki Afin Fitria, Mahrunissa Azmima Fitria Gde Bagus Janardana Abasan, I Ghina Dwi Salsabila Gita Safitri Grace Yohana Grace Yohana Hanif Reangga Alhakiem Hildan Fawwaz Naufal Husnul Khotimah Farid I Gusti Ayu Putu Sintha Deviya Yuliani I Kadek Candradinata I Made Darma Cahya Adyatma Ibnu Sina, Muhammad Noer Ilyana Fadhilah Inggit Restu Illahi Irma Palupi Isabella Vichita Kacaribu Isep Mumu Mubaroq Isman Kurniawan Kamil, Ghani Kartika Prameswari Kemas Muslim Lhaksmana Kevin Usmayadhy Wijaya Khoirunnisa, Sanabila Luthfi Firmansah M. Arif Bijaksana Mahmud Imrona Mansel Lorenzo Nugraha Marissa Aflah Syahran Marissa Aflah Syahran Maulina Gustiani Tambunan Mela Mai Anggraini Moh Adi Ikfini M Moh. Hilman Fariz Muhammad Afif Raihan Muhammad Faiq Ardyanto Putro Muhammad Khiyarus Syiam Muhammad Kiko Aulia Reiki Muhammad Nur Ilyas Muhammad Shiba Kabul Muhammad Tsaqif Muhadzdzib Ramadhan Nabilla Kamil Naufal Adi Nugroho Naufal Razzak , Robith Nilla, Arliyanna Nisa Maulia Azahra Nur Ihsan Putra Munggaran Nuril Adlan , Muhammad Prahasto, Girindra Syukran Rafi Anandita Wicaksono Raisa Sianipar Rakhmat Rifaldy Ramadhan, Ananta Ihza Ramadhan, Helmi Sunjaya Rayhan Rahmanda Refka Muhammad Furqon Regina Anatasya Rudiyanto Rendo Zenico Ridho Maulana Cahyudi Rifaldy Bintang Ramadhan Rizki Annas Sholehat Roji Ellandi Salsabil, Adinda Arwa Sari Ernawati Saut Sihol Ritonga Septian Nugraha Kudrat Septian Nugraha Kudrat Setiawan, Rizki Tri Shakina Rizkia Siti Inayah Putri Sri Suryani Sri Suryani Sukmawati Dwi Lestari Syafa Fahreza Syahdan Naufal Nur Ihsan Valentino, Nico Wida Sofiya Widiarta, I Wayan Abi Widjayanto, Leonardus Adi Widyanto, Jammie Reyhan Windy Ramadhanti Yoan Maria Vianny Yuliant Sibaroni Zahwa Dewi Artika Zakaria, Aditya Mahendra ZK Abdurahman Baizal