Journal of Dinda : Data Science, Information Technology, and Data Analytics
Journal of Dinda : Data Science, Information Technology, and Data Analytics as a publication media for research results in the fields of Data Science, Information Technology, and Data Analytics, but not implicitly limited. Published 2 times a year in February and August. The journal is managed by the Data Engineering Research Group, Faculty of Informatics, Telkom Purwokerto Institute of Technology. Journal of Dinda is a medium for scientific studies resulting from research, thinking, and critical-analytic studies regarding Data Science, Informatics, and Information Technology. This journal is expected to be a place to foster enthusiasm in education, research, and community service which continues to develop into supporting references for academics. FOCUS AND SCOPE Journal of Dinda : Data Science, Information Technology, and Data Analytics receive scientific articles with the scope of research on: Machine Learning, Deep Learning, Artificial Intelligence, Databases, Statistics, Optimization, Natural Language Processing, Big Data and Cloud Computing, Bioinformatics, Computer Vision, Speech Processing, Information Theory and Models, Data Mining, Mathematical, Probabilistic and Statical Theories, Machine Learning Theories, Models and Systems, Social Science, Information Technology
Articles
15 Documents
Search results for
, issue
"Vol 4 No 2 (2024): August"
:
15 Documents
clear
Development of Sentiment Analysis System of Simple Pol Application on Google Play Store Using Naive Bayes Classifier Method and BERT Prediction
Muhammad Dhito Maulidan;
Sri Sumarlinda;
Sopingi Sopingi
Indonesian Journal of Data Science, IoT, Machine Learning and Informatics Vol 4 No 2 (2024): August
Publisher : Research Group of Data Engineering, Faculty of Informatics
Show Abstract
|
Download Original
|
Original Source
|
Check in Google Scholar
|
DOI: 10.20895/dinda.v4i2.1577
Digitalization in public services raises various sentiments that are very dynamic, one example is the Simpel Pol Health Test application made by PT Cipta Sari Arsonia (CSA). The research objective is to obtain useful information from accurate community review sentiments for service improvement and feedback for service providers and application developers. The method used is Naïve Bayes Classifier with Tf-idf weighting, Multinomial Naïve Bayes with review value indicators and review sentences predicted by the BERT method as a determinant of sentiment value whether positive or negative. Sentiment towards the application shows quite encouraging results, from 3000 data analyzed with 1772 positive reviews and 263 negative reviews with 80% training data and 20% test data, the naïve bayes classification model is able to provide a high level of accuracy, which is 88.7% with a precision of 88.5%, recall of 100% and f1-score of 93.9%. The data showed that most people gave a positive response to this application, with the dominant word being 'easy'. This system was developed using the local-based streamlit framework and proved to be quite reliable in developing systems for data processing and web-based data analysis even though the scraping process is slightly longer than the google colab service. Future research is expected to be able to predict data that is positive or negative with several parameters and several sentiment analysis methods at once and their comparison.
Post-Election Sentiment Analysis 2024 via Twitter (X) Using the Naive Bayes Classifier Algorithm
Yessi Mayasari;
Yusuf Ramadhan Nasution
Indonesian Journal of Data Science, IoT, Machine Learning and Informatics Vol 4 No 2 (2024): August
Publisher : Research Group of Data Engineering, Faculty of Informatics
Show Abstract
|
Download Original
|
Original Source
|
Check in Google Scholar
|
DOI: 10.20895/dinda.v4i2.1582
This study examines sentiment related to the topic of Twitter after the 2024 election, where the topic focused on the 2024 presidential election. Where there are a lot of public opinions and comments after the 2024 presidential election. One of them is the phenomenon when Anies-Muhaimin and Ganjar-Mahfud filed a lawsuit with the Constitutional Court (MK) to appeal over suspicions of fraud over the victory of the elected pair Prabowo-Gibran. By applying the Naïve Bayes Classifier algorithm to analyze public sentiment. Through data crawling, preprocessing, feature extraction, and sentiment classification, the study identified the dominant sentiment and its intensity among social media users. This methodology utilizes quantitative data analysis, using Twitter data linked to specific election-related hashtags. The findings reveal a mix of negative and positive sentiments, reflecting diverse public opinion about election results and related political developments. The accuracy of Naïve Bayes Classifier is highlighted, demonstrating its effectiveness in sentiment classification in the context of social media. This research contributes to understanding public sentiment in the political realm and improving methodological approaches in sentiment analysis using machine learning.
Comparison of Fisher-Yates Shuffle and Linear Congruent Algorithms for Question Randomization
Nugroho Dwi Saputro
Indonesian Journal of Data Science, IoT, Machine Learning and Informatics Vol 4 No 2 (2024): August
Publisher : Research Group of Data Engineering, Faculty of Informatics
Show Abstract
|
Download Original
|
Original Source
|
Check in Google Scholar
|
DOI: 10.20895/dinda.v4i2.1584
This research aims to compare Fisher-Yates algorithm and Linear Congruent algorithm in generating random numbers or permutations. The test is conducted using the Chi-Square method to evaluate the quality of randomness generated by both algorithms. The Chi-Square value of the shuffling results is calculated and compared with the critical value of Chi-Square at a significance level of 0.05 with a degree of freedom (df) of 4, which is 9.488. The results show that the Chi-Square value for the Fisher-Yates algorithm is 3.8 and for the Linear Congruent algorithm is 4.3, both of which are below the critical value. This indicates that there is not enough evidence to reject the Null Hypothesis (H₀), implying that the difference in randomness quality between the two algorithms is not statistically significant. Therefore, both algorithms are considered to have equivalent performance. The decision to choose one of the algorithms can be based on other considerations such as complexity and efficiency. Further research is recommended to explore the performance of the algorithms under different conditions.
Document Similarity Using Term Frequency-Inverse Document Frequency Representation and Cosine Similarity
Adi Widianto;
Eka Pebriyanto;
Fitriyanti Fitriyanti;
Marna Marna
Indonesian Journal of Data Science, IoT, Machine Learning and Informatics Vol 4 No 2 (2024): August
Publisher : Research Group of Data Engineering, Faculty of Informatics
Show Abstract
|
Download Original
|
Original Source
|
Check in Google Scholar
|
DOI: 10.20895/dinda.v4i2.1589
Document similarity is a fundamental task in natural language processing and information retrieval, with applications ranging from plagiarism detection to recommendation systems. In this study, we leverage the term frequency-inverse document frequency (TF-IDF) to represent documents in a high-dimensional vector space, capturing their unique content while mitigating the influence of common terms. Subsequently, we employ the cosine similarity metric to measure the similarity between pairs of documents, which assesses the angle between their respective TF-IDF vectors. To evaluate the effectiveness of our approach, we conducted experiments on the Document Similarity Triplets Dataset, a benchmark dataset specifically designed for assessing document similarity techniques. Our experimental results demonstrate a significant performance with an accuracy score of 93.6% using bigram-only representation. However, we observed instances where false predictions occurred due to paired documents having similar terms but differing semantics, revealing a weakness in the TF-IDF approach. To address this limitation, future research could focus on augmenting document representations with semantic features. Incorporating semantic information, such as word embeddings or contextual embeddings, could enhance the model's ability to capture nuanced semantic relationships between documents, thereby improving accuracy in scenarios where term overlap does not adequately signify similarity.
Implementation of Android-Based Flutter Framework and Waterfall Method in Building Marketplace Applications (MariUmroh)
Mardiah Ramadhani;
Ilka Zufria;
Ali Ikhwan
Indonesian Journal of Data Science, IoT, Machine Learning and Informatics Vol 4 No 2 (2024): August
Publisher : Research Group of Data Engineering, Faculty of Informatics
Show Abstract
|
Download Original
|
Original Source
|
Check in Google Scholar
|
DOI: 10.20895/dinda.v4i2.1594
The MariUmroh application is an Android-based Umrah and Hajj travel marketplace application designed specifically to bring together sellers and buyers in one digital platform. The MariUmroh marketplace application aims to make it easier for prospective pilgrims and Umrah and Hajj travel companies to interact, transact, and promote their products online. In its business processes, the MariUmroh application uses a B2C (Business to Customer) business model. Where business activities are carried out by travel admins to customers using electronic media directly. The development of the Android-based MariUmroh application is built with the Flutter Framework which uses the Dart programming language. In building the MariUmroh marketplace application, the development system uses the waterfall method, in its creation it begins with needs analysis, system design, program code writing, program testing, program implementation and maintenance. In the MariUmroh application which offers various Umrah and Hajj travel packages to the wider community easily and more efficiently, and sellers easily promote their products and don't have to worry about losing consumers, the features provided by the application are also very useful for pilgrims during their worship at holy land. In building this application, MySQL is used as a database for data storage. Keywords: Marketplace, Umrah and Hajj travel, flutter framework, Waterlfall