cover
Contact Name
Husni Teja Sukmana
Contact Email
husni@bright-journal.org
Phone
+62895422720524
Journal Mail Official
jads@bright-journal.org
Editorial Address
Gedung FST UIN Jakarta, Jl. Lkr. Kampus UIN, Cemp. Putih, Kec. Ciputat Tim., Kota Tangerang Selatan, Banten 15412
Location
Kota adm. jakarta pusat,
Dki jakarta
INDONESIA
Journal of Applied Data Sciences
Published by Bright Publisher
ISSN : -     EISSN : 27236471     DOI : doi.org/10.47738/jads
One of the current hot topics in science is data: how can datasets be used in scientific and scholarly research in a more reliable, citable and accountable way? Data is of paramount importance to scientific progress, yet most research data remains private. Enhancing the transparency of the processes applied to collect, treat and analyze data will help to render scientific research results reproducible and thus more accountable. The datasets itself should also be accessible to other researchers, so that research publications, dataset descriptions, and the actual datasets can be linked. The journal Data provides a forum to publish methodical papers on processes applied to data collection, treatment and analysis, as well as for data descriptors publishing descriptions of a linked dataset.
Articles 51 Documents
Search results for , issue "Vol 6, No 1: JANUARY 2025" : 51 Documents clear
Fake vs Real Image Detection Using Deep Learning Algorithm Fatoni, Fatoni; Kurniawan, Tri Basuki; Dewi, Deshinta Arrova; Zakaria, Mohd Zaki; Muhayeddin, Abdul Muniif Mohd
Journal of Applied Data Sciences Vol 6, No 1: JANUARY 2025
Publisher : Bright Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47738/jads.v6i1.490

Abstract

The purpose of this research project is to address the growing issues presented by modified visual information by developing a deep learning model for identifying between real and fake images. To enhance accuracy, this project evaluates the effectiveness of deep learning algorithms such as Residual Neural Network (ResNet), Visual Geometry Group 16 (VGG16), and Convolutional Neural Network (CNN) together with Error Level Analysis (ELA) as preprocessing the dataset. The CASIA dataset contains 7,492 real images and 5,124 fake images. The images included are from a wide range of random subjects, including buildings, fruits, animals, and more, providing a comprehensive dataset for model training and validation. This research examined models' effectiveness through experiments, measuring their training and validation accuracies. It comes out with the best accuracy of each model, which is for Convolutional Neural Network (CNN), 94% for training accuracy, and validation accuracy of 92%. For VGG16, with both training and validation accuracy reaching 94%. Lastly, Residual Neural Network (ResNet) demonstrated optimal performance with 95% training accuracy and 93% validation accuracy. This project also constructs a system prototype for practical applications, offering an interface for real-world testing. When integrating into the system prototype, only Residual Neural Network (ResNet) shows consistency and effectiveness when predicting both fake and real images, and this led to the decision to choose ResNet for integration into the system. Furthermore, the project identified several areas for improvement. Firstly, expanding the model comparison for discovering more successful algorithms. Next, improving the dataset preprocessing phase by incorporating filtering or denoising techniques. Lastly, refining the system prototype for greater appeal and user-friendliness has the potential to attract a larger audience.
Using Machine Learning Approach to Cluster Marine Environmental Features of Lesser Sunda Island Lusiana, Evellin Dewi; Astutik, Suci; Nurjannah, Nurjannah; Sambah, Abu Bakar
Journal of Applied Data Sciences Vol 6, No 1: JANUARY 2025
Publisher : Bright Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47738/jads.v6i1.478

Abstract

Mapping marine ecosystems is acknowledged as a vital tool for implementing ecosystem services in practical situations. It provides a framework for effective marine spatial planning, enabling the designation of marine protected areas (MPAs) that consider ecological connectivity and habitat requirements. It also helps pinpoint areas of high biodiversity or ecological significance, allowing conservationists to prioritize these regions for protection and management. Numerous studies over decades have produced a vast amount of data that illustrates the features of the marine ecosystem. Therefore, the unsupervised learning is a promising technique to map marine ecosystem based on its environmental features. This study aims to compare unsupervised learning techniques to analyze marine environmental features in order to map marine ecosystem in Lesser Sunda waters. Eleven global environmental variables were accessed from global databases. The Lesser Sunda waters were delineated into groups according to their environmental characteristics using four unsupervised learning techniques: k-mean, fuzzy c-mean, self-organizing map (SOM), and density-based spatial clustering of applications with noise (DBSCAN). According to the findings, the Lesser Sunda waters can be divided into five to nine clusters, each with distinct environmental features. Moreover, the fuzzy c-mean method's clustering result outperformed the others based on the highest Silhouette (0.2204478) and Calinski-Harabasz (1741.099) Index. As an unsupervised learning technique, fuzzy c-mean clustering offered good performance in delineating Lesser Sunda Island marine waters with five clusters. The clustering results mostly consistent with existing conservation programs, even though there are several areas which needed international and multinational organization collaboration to effectively accomplish marine conservation objectives.
Object-Level Sentiment Analysis Use a Language Model Le, Thuy Thi; Phan, Tuoi Thi
Journal of Applied Data Sciences Vol 6, No 1: JANUARY 2025
Publisher : Bright Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47738/jads.v6i1.448

Abstract

Sentiment analysis remains a prominent area of research in the natural language processing (NLP) community and holds significant practical value in domains such as commerce and education. Most existing approaches evaluate sentiments for a single object or product, typically categorizing them as positive or negative. However, when a text involves comparisons between multiple objects, it can be challenging to identify which sentiment or emotion is associated with which object. Few studies have addressed this issue, often stopping at evaluating emotions at the sentence level or for individual words related to aspects or objects. This study proposes an object-level sentiment analysis problem that produces a set of pairs or triples consisting of an object, aspect, and sentiment. Additionally, in texts expressing opinions or comments on a specific aspect, the aspect may be implied through references to the object without being explicitly mentioned. Identifying such implicit aspects is crucial, as it ensures no loss of information and enhances the efficiency of extraction of information in object-level sentiment analysis. The integration of implicit aspect identification and object-level sentiment analysis is the primary focus of this research. In recent years, many language models have been developed and effectively applied to various NLP tasks. Therefore, to address the proposed challenges, this study utilizes deep learning that incorporates language models combined with NLP methods such as parsing and dependency analysis, to achieve the desired output. Using language model and NLP techniques automatically generate training data for the learning model. The proposed method achieves an accuracy of 90%, making a substantial contribution to the field of NLP.
Fuzzy TOPSIS-Based Group Decision Model for Selecting IT Employees Vania, Abigail; Utama, Ditdit Nugeraha
Journal of Applied Data Sciences Vol 6, No 1: JANUARY 2025
Publisher : Bright Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47738/jads.v6i1.511

Abstract

In the era of digitalization, the demand for competent IT employees is growing rapidly. However, the IT employee selection process often faces various challenges, such as biased selection criteria, many applicants, and difficulty in objective assessment. These challenges can lead to inaccurate selection decisions and have a negative impact on company performance. This research aims to develop a Group Decision Support Model (GDSM) for IT Employee Selection using the Fuzzy TOPSIS method to enhance objectivity and reliability in decision-making. This GDSM combines assessments from HRD and User IT groups by considering the weight of each criterion. The proposed model overcomes bias, uncertainty, and subjectivity in judgments from both groups. The GDSM is constructed with 8 parameters/sub-criteria (2 criteria) from the HRD group and 12 parameters (5 criteria) from the User IT group from interviews and research. Thus, the total is 20 assessment parameters, consisting of coding test, education, certification, computer literacy, openness to experience, conscientiousness, extroversion, agreeableness, neuroticism, verbal, numerical, ability to learn, appearance attitude, work experience, communication skills, time management, job knowledge, motivation to apply, decision making, and service orientation. The methodology involves determining parameters, weights, fuzzification and this GDSM was tested through a limited simulation of IT employee selection using 11 respondents from Computer Science students for evaluation of the model. The result of this model is a ranking of the candidates. The best candidate is Cand. 8, with a closeness coefficient (CC) value of 0.896. The worst candidate is Cand. 3, with CC 0.241. The model is acceptable because it has no difference value between coding and manual for all candidates. This study contributes to increasing objectivity in IT employee selection and offers an implementation model for companies that want to improve the effectiveness of the recruitment process.
Improving Classification Accuracy of Local Coconut Fruits with Image Augmentation and Deep Learning Algorithm Convolutional Neural Networks (CNN) Usman, Usman; Yunita, Fitri; Ridha, Muh. Rasyid
Journal of Applied Data Sciences Vol 6, No 1: JANUARY 2025
Publisher : Bright Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47738/jads.v6i1.389

Abstract

Local coconut varieties must be classified to maintain the quality and genetic diversity of coconuts as the main commodity in Indonesia's largest coconut-producing region. This study introduces a deep learning module for improved classification of coconuts, using color jitter as part of a data augmentation strategy to supplement the existing dataset and utilizing well-known CNN-based models like VGG16 for image analysis, with a focus on the needs of future research. The goal is to improve the classification accuracy of local coconut varieties through deep learning. We investigate both data augmentations and EDA, and we use VGG-16-based CNN models to enhance the classification performance. We used a confusion matrix for the model evaluation, containing metrics like accuracy, precision, recall, and f1-score. Results reveal that a color jitter augmentation model attained a training accuracy of 99.12%, testing accuracy of 97.33%, and validation accuracy of 97.33%. Model exploration using VGG16, on the other hand, improved all three: training accuracy—99.87%, testing accuracy—98.77%, and validation accuracy—98.97% average F1-score: 99%. Our research contributes massively to providing the best automatic classification method that will benefit and help farmers shorten their jobs while promoting economic growth in trading effectively across Indonesian regions. Its novelty is in combining image augmentation and CNNs, concerning the VGG16 model, showing better.
Polarization of Religious Issues in Indonesia’s Social Media Society and Its Impact on Social Conflict Faizin, Barzan; Fitri, Susanti Ainul; AS, Enjang; Maylawati, Dian Sa'adillah; Rizqullah, Naufal; Ramdhani, Muhammad Ali
Journal of Applied Data Sciences Vol 6, No 1: JANUARY 2025
Publisher : Bright Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47738/jads.v6i1.447

Abstract

In this new era, people use social media to share information and discuss political, social, and religious issues, leading to pros and cons arguments. In Twitter’s hashtags and tweets, religious issues frequently trigger a hot conversation that causes disputes among citizens and even street movements. This study is intended to reveal the religious issues that often trigger polarization among Twitter users and how they influence horizontal conflict in society as what happened during the election period in 2019. This research applied mixed methods with social media analytics to conceal religious issues in Indonesia's social media society. The data collection was done by crawling data from the Indonesian Twitter users’ tweets regarding religious issues hashtags, which is a reference for further analysis. The research findings show that the top eight religious issues widely discussed based on 23,433 Twitter users’ tweets are the hashtags (#) salafi, wahabi, intoleransi (intolerance), taliban, anti-Pancasila, politisasi agama (politicization of religion), politik identitas (identity politics), and radikalisme (radicalism). In social conversation networks, these issues are related to each other and other issues such as political figures, the three presidential candidates, the general election, and the Republic of Indonesia presidential election in 2024. Concerning these issues, Twitter users believe that the issues, positive or negative, do not influence their religious and political stance. However, to a certain extent, they believe that religious issues impact social discourses regarding horizontal conflicts. 44% opinions prove this indicated that the debate over religious matters had little influence on their opinion of these topics, and 64.5% agreed that religious concerns can cause social strife. Finally, it is hoped that further studies will elaborate on how religious issues on Twitter and other social media directly impact social harmony in everyday life.
Current and Future Trends for Sustainable Software Development: Software Security in Agile and Hybrid Agile through Bibliometric Analysis Maidin, Siti Sarah; Yahya, Norzariyah; Fauzi, Muhammad Ashraf bin Fauri; Bakar, Normi Sham Awang Abu
Journal of Applied Data Sciences Vol 6, No 1: JANUARY 2025
Publisher : Bright Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47738/jads.v6i1.473

Abstract

The industrial growth of digitalized era has given rise to a growing concern in software development. The present research investigates the prevailing and projected patterns in sustainable software development, especially those related to process innovation, with a particular emphasis on software security within Agile and Hybrid Agile approaches, employing bibliometric analysis. However, a comprehensive understanding of the security concerns of both agile and hybrid agile is crucial and needs further garnered. However, it is expected that a thorough comprehension of the hybrid agile model landscape would uncover various themes encompassing its implementation. The analysis aims to provide a comprehensive overview of the current, present, and future state of software security for agile and hybrid agile. The study employed a bibliometric approach to gather a total of 1593 journals from the Web of Science (WOS) database. This study utilizes co-citation and co-word analysis techniques to identify the most significant articles, delineate the fundamentals framework, and provide a prognosis for future development. The present investigation has successfully discovered four distinct co-citation and three distinct co-word clusters. This study offers valuable insights regarding the software security in agile and hybrid agile. The increasing evolution of the software ecosystem necessitates the prioritization of bridging the gap between agility and security. This paper provides a detailed roadmap for scholars and practitioners who are navigating this intersection
Gamification Effect of Team Games Tournament in Game-Based Learning on Student Motivation Wijaya, Anugerah Bagus; Nida, Faridatun; Zettira, Salsa Billa Zulmi; Suliswaningsih, Suliswaningsih; Afiana, Fiby Nur; Rifai, Zanuar
Journal of Applied Data Sciences Vol 6, No 1: JANUARY 2025
Publisher : Bright Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47738/jads.v6i1.450

Abstract

This study examines the impact of gamification elements, specifically the duration of use and level of collaboration, on student motivation in online learning environments. Using the Team Games Tournament model, which combines elements of both competition and collaboration, a web-based Game-Based Learning application was developed to enhance student motivation. The study employed a motivation survey based on the model Attention, Relevance, Confidence, Satisfaction, which was administered to participants before and after using the application. In addition to the survey, interaction data, such as the duration of application use, frequency of participation, points earned, and the level of collaboration, were collected to assess the relationship between these factors and student motivation. The study involved 20 fifth-semester students (12 male, 8 female) enrolled in a digital games course, many of whom had prior gaming experience, which could influence their response to the gamified learning experience. The data collected was analyzed using Decision Tree algorithms, Pearson correlation, and simple linear regression to understand the impact of various gamification elements on motivation. The results showed that both the duration of application use and the level of collaboration were significant factors in increasing student motivation. Specifically, motivation increased by an average of 0.72 points for every 10 minutes of application use, as measured by the difference between pre-test and post-test survey scores. These findings underscore the importance of balancing competitive and collaborative elements within game-based learning environments. By incorporating features that promote collaboration and encouraging sustained application use, educators can significantly enhance student engagement and motivation. The study provides valuable insights for the development of future game-based learning applications, highlighting the need for optimal design in terms of collaboration and duration to create an effective and engaging digital learning experience.
The Development of Stacking Techniques in Machine Learning for Breast Cancer Detection Van FC, Lucky Lhaura; Anam, M. Khairul; Bukhori, Saiful; Mahamad, Abd Kadir; Saon, Sharifah; Nyoto, Rebecca La Volla
Journal of Applied Data Sciences Vol 6, No 1: JANUARY 2025
Publisher : Bright Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47738/jads.v6i1.416

Abstract

This study addresses the challenges of accurately detecting breast cancer using machine learning (ML) models, particularly when handling imbalanced datasets that often cause model bias toward the majority class. To tackle this, the Synthetic Minority Over-sampling Technique (SMOTE) was applied not only to balance the class distribution but also to improve the model's sensitivity in detecting malignant tumors, which are underrepresented in the dataset. SMOTE was effective in generating synthetic samples for the minority class without introducing overfitting, enhancing the model's generalization on unseen data. Additionally, AdaBoost was employed as the meta model in the stacking framework, chosen for its ability to focus on misclassified instances during training, thereby boosting the overall performance of the combined base models. The study evaluates several models and combinations, with K-Nearest Neighbors (KNN) + SMOTE achieving an accuracy of 97%, precision, recall, and F1-score of 97%. Similarly, C4.5 + Hyperparameter Tuning + SMOTE reached 95% in all metrics. The stacking model with Logistic Regression (LR) as the meta model and SMOTE achieved a strong performance with 97% accuracy, precision, recall, and F1-score all at 97%. The best result was obtained using the combination of Stacking AdaBoost + Hyperparameter Tuning + SMOTE, reaching an accuracy of 98%. These findings highlight the effectiveness of combining SMOTE with stacking techniques to develop robust predictive models for medical applications. The novelty of this study lies in the integration of SMOTE and advanced stacking methods, particularly using AdaBoost and Logistic Regression, to address the issue of class imbalance in medical datasets. Future work will explore deploying this model in clinical settings for accurate and timely breast cancer detection.
Machine Learning Models for Predicting Flood Events Using Weather Data: An Evaluation of Logistic Regression, LightGBM, and XGBoost Maharina, Maharina; Paryono, Tukino; Fauzi, Ahmad; Indra, Jamaludin; Sihabudin, Sihabudin; Harahap, Muhammad Khoiruddin; Rizki, Lutfi Trisandi
Journal of Applied Data Sciences Vol 6, No 1: JANUARY 2025
Publisher : Bright Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47738/jads.v6i1.503

Abstract

This study examines flood prediction in Jakarta, Indonesia, a pressing concern due to its significant implications for public safety and urban management. Machine Learning (ML) presents promising methodologies for accurately forecasting floods by leveraging weather data. However, flood prediction in Jakarta remains challenging due to the city’s highly variable weather patterns, including fluctuations in rainfall, humidity, temperature, and wind characteristics. Existing methods often struggle with these complexities, as they rely on traditional ML models such as K-Nearest Neighbors (KNN), which may not capture certain patterns or provide high accuracy and robustness. Therefore, this study proposes three ML methods—Logistic Regression (LR), LightGBM, and XGBoost—to predict floods accurately. Five performance metrics (i.e., accuracy, area under the curve (AUC), precision, recall, and F1-score) were used to measure and compare the accuracy of the algorithms. The proposed method consists of three main processes. The first process involves data preprocessing and evaluation using 14 different ML models. In the second process, additional feature engineering is applied to improve the quality of the data. Finally, the third process combines the previous steps with oversampling techniques and cross-validation methods. This structured approach aims to enhance the overall performance of the analysis. The experimental results show that Process 3 significantly improves performance compared to Processes 1 and 2. The model predicts floods with an accuracy score of 93.82% for LR, 96.67% for XGBoost, and 96.81% for LightGBM, respectively. Thus, the proposed model offers a solution for operational decision-making in flood risk management, including flood mitigation planning.