Twitter is a social media that allows users to share information with others in real time. Information that is shared on Twitter is usually referred to as a tweet. Sentiment analysis is a branch of research in the text mining domain where the process of identifying and extracting sentiment data will usually be categorized based on its polarity, whether it is positive, negative or neutral. We can process data from opinions on Twitter using data mining techniques, namely classification. The algorithm that will be used in this research is the Naïve Bayes Algorithm. This research will also use the RStudio application. It is a computer programming language that allows users to program algorithms and use tools that have been developed through R by other users. R is a high-level programming language and is also an environment for data and graph analysis. Based on the experimental results, using a comparison of training data and test data of 20%: 80%, 40%: 60%, 60%: 40%, 80%: 20% and 90%:10%, the results of sentiment classification using the Naïve Bayes method are obtained. and using 10-fold cross validation obtained an average value of 85.00% accuracy and The decrease in machine learning performance occurs in the ratio of 80:20 or 1440 training data: 360 data testing, while the ratio of 20%:80% and 90%:10% has the same accuracy value, namely 85.41%.
Copyrights © 2024