cover
Contact Name
Huzain
Contact Email
huzain.azis@umi.ac.id
Phone
+628114484875
Journal Mail Official
ijodas.journal@gmail.com
Editorial Address
Jln. Paccerakkang, Kel. Berua, Kec.Biringkanaya, Kota Makassar, Propinsi Sulawesi Selatan, 90241
Location
Unknown,
Unknown
INDONESIA
Indonesian Journal of Data and Science
Published by yocto brain
ISSN : -     EISSN : 27159930     DOI : -
Core Subject : Science, Education,
IJODAS provides online media to publish scientific articles from research in the field of Data Science, Data Mining, Data Communication, Data Security and Data Representation
Articles 171 Documents
Fine-Tuning a Large Language Model on Vertex AI for a New Student Registration Chatbot at Universitas Muhammadiyah Makassar Desi Anggreani; Muhyiddin A M Hayat; Lukman; Ahmad Faisal; Khadijah; Darniati
Indonesian Journal of Data and Science Vol. 7 No. 1 (2026): Indonesian Journal of Data and Science
Publisher : yocto brain

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56705/ijodas.v7i1.341

Abstract

This study addresses the limitations of manual admission services at Universitas Muhammadiyah Makassar, which often result in delayed and inconsistent information delivery. To overcome these challenges, an institution-specific chatbot was developed by fine-tuning the Gemini 2.5 Flash model on the Google Cloud Vertex AI platform. The model was trained using a curated domain-specific dataset of 1,430 question–answer pairs derived from official documents and frequently asked questions. The fine-tuning process employed supervised learning to enhance contextual relevance and response accuracy. System performance was evaluated using automated text quality metrics, achieving an average BLEU score of 0.23526 and a ROUGE-L Recall score of 0.53424, indicating satisfactory lexical and semantic similarity. Furthermore, a user acceptance evaluation involving 52 respondents yielded a Customer Satisfaction Score (CSAT) of 84.2%, reflecting high user satisfaction. These results demonstrate that fine-tuning a Large Language Model (LLM) for specific institutional needs effectively improves both response quality and service reliability. Ultimately, this approach offers a practical and scalable solution for modernizing student admission services in higher education, ensuring that prospective students receive accurate information in a timely and efficient manner.
Vehicle Detection Using YOLOv8 on Low-Resolution Images Nifal; Farniwati Fattah; Andi Widya Mufila Gaffar
Indonesian Journal of Data and Science Vol. 7 No. 1 (2026): Indonesian Journal of Data and Science
Publisher : yocto brain

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56705/ijodas.v7i1.371

Abstract

Vehicle detection in low-resolution images remains a significant challenge in computer vision, particularly for embedded devices such as ESP32-CAM with limited computational resources and simple image resolution. This study evaluates the performance of YOLOv8 on low-resolution QVGA (320 × 240 pixels) images for vehicle detection and classification. The dataset was independently collected in a controlled laboratory environment using miniature vehicles, covering four vehicle classes (motorcycle, car, bus, and truck) with a total of 4,000 images and a 70:20:10 data split. A pretrained YOLOv8 model was fine tuned for 100 epochs and tested on an ESP32-CAM prototype. The evaluation results demonstrate excellent performance, achieving precision of 0.999, recall of 1.000, mAP@0.5 of 0.995, and mAP@0.5-0.95 of 0.995 on the validation data, as well as real-time detection accuracy of 97% for motorcycles and cars, and 99% for buses and trucks. These findings indicate that YOLOv8 can deliver reliable vehicle detection performance on low-resolution images and is suitable for implementation in embedded device-based systems
Artificial Intelligence (AI) using Long Short-Term Memory (LSTM) for Sales Prediction in Campus Minimarkets Harlinda L; Abdul Rachman Manga; Ramdan Satra
Indonesian Journal of Data and Science Vol. 7 No. 1 (2026): Indonesian Journal of Data and Science
Publisher : yocto brain

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56705/ijodas.v7i1.373

Abstract

This study applies Artificial Intelligence (AI) using the Long Short-Term Memory (LSTM) algorithm to predict daily sales at the FIKOM-UMI Minimarket. Sales data from 2023 to 2024 involving 82 items were used and processed into a time series format. Five LSTM architectural scenarios were tested, including baseline, bigger model, lightweight, bidirectional LSTM, and single-layer medium, to identify the most effective model in capturing sales patterns. The data underwent preprocessing stages, including daily aggregation, reindexing to fill missing dates, and normalization using MinMaxScaler before being transformed into sequences with a 30-day time step. Model performance was evaluated using MSE, RMSE, MAPE, and accuracy metrics. The results show that the Bidirectional LSTM (Scenario 4) achieved the best performance, with the lowest MAPE of 19.43% and the highest accuracy of 80.57%. The model successfully generated stable predictions for 7-day and 30-day forecasting with a range of 153–155 units per day, indicating consistent sales patterns. Testing on the top 10 best-selling items showed significant performance variation, with GARUDA ROSTA BWNG 100 Gram achieving the highest accuracy (46.97%), while aoka rasa pandan showed the lowest performance (-76.05%). These findings demonstrate that the LSTM model can be effectively applied for sales prediction in campus minimarkets; however, a hybrid approach with product segmentation is recommended to optimize inventory management across product categories with varying levels of predictability
Sentiment Classification and Influential Actor Detection on Twitter (Case Study: The Raja Ampat Mining Conflict) Micguel Arter Imbiri; Lorna Yertas Baisa; Josua Josen A. Limbong
Indonesian Journal of Data and Science Vol. 7 No. 1 (2026): Indonesian Journal of Data and Science
Publisher : yocto brain

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56705/ijodas.v7i1.376

Abstract

The nickel mining conflict in Raja Ampat has attracted extensive public attention due to the region’s global ecological significance and the potential environmental risks posed by extractive activities. Social media platforms, particularly Twitter, have become important spaces for public discussion and opinion exchange regarding this issue. This study aims to analyze public sentiment and identify influential actors in online discussions of the Raja Ampat mining conflict by integrating sentiment analysis and Social Network Analysis (SNA). This study adopts a cross-sectional design using Indonesian-language tweets collected between 15-27 November 2025. A total of 11,671 tweets were obtained through keyword-based crawling, and after preprocessing and duplicate removal, 8,909 tweets were retained for analysis. Sentiment labeling was performed using a lexicon-based approach, categorizing tweets into positive, neutral, and negative classes. The dataset was divided using an 80:20 train–test split. Sentiment classification was conducted using Support Vector Machine (SVM), K-Nearest Neighbor (KNN), and Naive Bayes algorithms. Model performance was evaluated using confusion matrix–based metrics, including accuracy, precision, recall, and F1-score. Social Network Analysis was carried out by constructing a directed interaction network based on mentions, replies, and retweets, with influential actors identified using degree and betweenness centrality measures. The results indicate that neutral sentiment dominates the discourse (51.58%), followed by negative and positive sentiments. SVM and Naive Bayes demonstrate more stable classification performance than KNN, while network analysis shows that influence is concentrated among a limited number of central actors
Comparing Sentiment Labeling with RoBERTa and IndoBERTweet on Public Opinion of Program Makan Bergizi Gratis Putri Nur Rezky; Dolly Indra; Herdianti
Indonesian Journal of Data and Science Vol. 7 No. 1 (2026): Indonesian Journal of Data and Science
Publisher : yocto brain

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56705/ijodas.v7i1.381

Abstract

The Program Makan Bergizi Gratis (MBG) is a flagship program of the Prabowo Subianto administration launched in 2024, triggering diverse public responses on social media. Sentiment analysis using deep learning models offers an effective approach to understanding public opinion at scale. However, selecting the appropriate model for Indonesian social media text remains challenging. This study aims to compare the performance of two pretrained transformer models, RoBERTa Base and IndoBERTweet Base, in conducting automatic sentiment labeling on Indonesian tweets related to the MBG program using a zero-shot labeling approach without human-annotated ground truth. A total of 1,831 tweets were collected from platform X and preprocessed using case folding, normalization, and stopword removal. Both models were applied in parallel to label each tweet with sentiment categories (positive, neutral, negative) along with confidence scores. The comparison was evaluated using agreement rate, Cohen's Kappa, and confidence score analysis. RoBERTa Base exhibits a conservative tendency with 75.20% neutral labels, while IndoBERTweet Base produces a more balanced distribution (68.16% neutral). The comparison shows 77.28% agreement with Cohen's Kappa of 0.490 (Moderate Agreement). RoBERTa Base achieves higher confidence (mean: 0.9559, 83.01% above 0.95) compared to IndoBERTweet Base (mean: 0.9236, 68.65% above 0.95). IndoBERTweet Base is more effective in detecting negative sentiment, identifying nearly twice as many negative tweets (13.54% vs. 7.48%). This study recommends IndoBERTweet Base for exploratory research requiring sensitive sentiment detection and RoBERTa Base for precision-critical applications. An ensemble approach combining both models is recommended for production-critical applications
Comparison of Naïve Bayes and SVM in Sentiment Analysis of ChatGPT for Learning on X and YouTube Ni Putu Eka Swari; Ni Wayan Jeri Kusuma Dewi; Ni Ketut Utami Nilawati; Aniek Suryanti Kusuma; Ni Luh Wiwik Sri Rahayu Ginantra
Indonesian Journal of Data and Science Vol. 7 No. 1 (2026): Indonesian Journal of Data and Science
Publisher : yocto brain

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56705/ijodas.v7i1.382

Abstract

The rapid development of artificial intelligence technology has encouraged users to actively express opinions on social media platforms such as X and YouTube, including discussions on the use of ChatGPT as a learning support tool. This study aims to analyze public sentiment toward the use of ChatGPT in learning contexts by comparing the performance of the Naïve Bayes and Support Vector Machine (SVM) classification methods. A total of 5,500 comments from platform X and 5,543 comments from YouTube were collected through a crawling process using relevant keywords during the period from January 2023 to December 2025. The data were preprocessed and labeled into three sentiment classes (positive, negative, and neutral) using a lexicon-based approach with the INSET Lexicon. Feature extraction was conducted using the Term Frequency–Inverse Document Frequency (TF-IDF) method, and the dataset was divided into training and testing sets with an 80:20 ratio. Model performance was evaluated using accuracy, precision, recall, and F1-score. The results show that the SVM classifier consistently outperformed the Naïve Bayes method on both platforms. On platform X, SVM achieved an accuracy of 76.67%, while Naïve Bayes obtained 74.60%. On YouTube, SVM achieved an accuracy of 73.10%, significantly higher than Naïve Bayes at 62.04%. These findings indicate that SVM is more effective for sentiment analysis of social media data related to the use of ChatGPT in learning environments
Implementation of an AI Agent Chatbot with a Dynamic Knowledge Base from Google Drive for Journal Information Service Huzain Azis; Randika sakha Ramadani Putra Ahmad; Irawati Syahriawan
Indonesian Journal of Data and Science Vol. 7 No. 1 (2026): Indonesian Journal of Data and Science
Publisher : yocto brain

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56705/ijodas.v7i1.384

Abstract

This study presents the implementation of an AI agent chatbot to support journal information services on the Open Journal Systems (OJS) platform and the Telegram messaging application using a dynamic knowledge base sourced from Google Drive. The chatbot provides automated responses to user inquiries related to journal scope, publication fees, submission procedures, and review timelines, while allowing journal administrators to update information content without modifying the system. Functional testing results indicate that the chatbot delivers accurate and consistent information with acceptable response times across both platforms. The implementation demonstrates that integrating an AI agent chatbot with a dynamic knowledge base can enhance information accessibility, reduce administrative workload, and improve service efficiency in academic journal management
Application of the DeepSurv Model to Predict Survival in Patients with Kidney Failure Undergoing Hemodialysis Rizki Amanda; Aviolla Terza Damaliana; Muhammad Idhom; Muhamad Liswansyah Pratama
Indonesian Journal of Data and Science Vol. 7 No. 1 (2026): Indonesian Journal of Data and Science
Publisher : yocto brain

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56705/ijodas.v7i1.389

Abstract

This study aims to improve survival prediction in patients with kidney failure undergoing hemodialysis, given their high mortality risk. Traditional models such as Cox Proportional Hazards (Cox PH) have limitations in capturing complex and nonlinear relationships in clinical data. Therefore, this study applies DeepSurv, a deep learning–based survival model, and compares its performance with Cox PH and Cox PH Spline. A total of 300 patients were included, with 165 events and 135 censored observations. The data were split into training and testing sets. DeepSurv was implemented using two hidden layers (64 and 32 neurons), a dropout rate of 0.2, and a learning rate of 1e-3. The model was trained for up to 1000 epochs with early stopping at epoch 435. Performance was evaluated using the concordance index (C-index) and time-dependent AUC at 365, 544, and 730 days. Patients were stratified into low-, medium-, and high-risk groups based on predicted scores. Results showed that Cox PH achieved a C-index of 0.913 and average AUC of 0.964, while Cox PH Spline reached 0.917 and 0.971. DeepSurv achieved a C-index of 0.920 and average AUC of 0.969. Performance differences were small, but DeepSurv provided consistent individual risk estimates. In conclusion, DeepSurv is a flexible approach with performance comparable to Cox-based models. Further external validation and clinical evaluation are needed before wider application
Kodály Hand Sign Recognition from Hand Landmarks Using XGBoost Achmad Zulfikar; Farniwati Fattah; Andi Widya Mufila Gaffar Gaffar
Indonesian Journal of Data and Science Vol. 7 No. 1 (2026): Indonesian Journal of Data and Science
Publisher : yocto brain

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56705/ijodas.v7i1.391

Abstract

Introduction: Angklung is a traditional Indonesian musical instrument that continues to evolve through digital technology. However, computer vision–based gesture recognition for controlling physical angklung instruments remains limited. This study investigates landmark-based recognition of Kodály hand signs and evaluates its application for real-time angklung interaction. Method: Hand landmarks were extracted using MediaPipe Hands from RGB camera input. Each gesture was represented by 63 normalized numerical features derived from 21 landmarks. The dataset consists of 8,000 images representing eight Kodály gesture classes (Do–Do'). Gesture classification was performed using the Extreme Gradient Boosting (XGBoost) algorithm. Model evaluation applied a subject-independent two-fold scheme using accuracy, precision, recall, F1-score, and confusion matrix analysis. Real-time system trials were conducted under different lighting conditions and capture distances, and TCP communication with an ESP32 controller was evaluated. Results: The model achieved 96.63% accuracy in Fold 1 and 96.40% in Fold 2. Misclassifications were mainly observed between visually similar gestures, particularly La and Mi. Separate real-time system trials showed consistent recognition under bright lighting, while accuracy decreased under dim lighting, especially for Do (90%) and Mi (86.7%). Gesture recognition remained reliable up to approximately 1.5 m. TCP testing over 200 command events recorded 0% failed acknowledgments with a mean round-trip time of 87.36 ms. Conclusion: These indicate that landmark-based Kodály gesture classification using MediaPipe Hands and XGBoost can support real-time angklung interaction under controlled conditions, although improvements are needed for low-light environments and visually similar gestures
Drug Recommendation Using Multilabel Classification with Decision Tree Based on Patient Complaints and Diagnoses Muh Aristsyah Malik; Harlinda; Herdianti Darwis
Indonesian Journal of Data and Science Vol. 7 No. 1 (2026): Indonesian Journal of Data and Science
Publisher : yocto brain

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.56705/ijodas.v7i1.397

Abstract

This study develops a drug recommendation system using multilabel classification with the Decision Tree algorithm based on patient complaint and diagnosis data from electronic medical records. The dataset consists of patient visit records from a community health center in Pangkajene and Kepulauan Regency and is transformed using multi-hot encoding. Model performance is evaluated under three dataset scenarios (N=500, N=800, and N=1000) using multilabel metrics, including Micro-F1, Samples-F1, Hamming Loss, Jaccard Similarity, Hit@5, Precision@K, and Recall@K. The best Decision Tree model achieved a Micro-F1 score of 0.292, Samples-F1 of 0.281, and Hit@5 of 0.690 on the N=1000 dataset scenario. Bootstrap validation with 1000 iterations indicates relatively stable performance, with narrow confidence intervals across evaluation metrics. These results show that the multilabel Decision Tree model is capable of capturing relationships between patient complaints, diagnoses, and drug therapies while maintaining an interpretable decision structure