Garuda - Garba Rujukan Digital

Jurnal ilmiah teknologi informasi Asia

Vol 20 No 1 (2026): Volume 20 Issue 1 2026 (8)

Riska, Suastika Yulia (Unknown)
Widiyaningtyas, Triyanna (Unknown)
Elmunsyah, Hakkun (Unknown)
Sendari, Siti (Unknown)

Publish Date
23 Jan 2026

The swift advancements in Artificial Intelligence and Machine Learning have rendered datasets essential; nonetheless, their heightened utilization has engendered intricate ethical dilemmas that are frequently neglected. This study seeks to delineate and highlight ethical concerns associated with the collection of primary data and the reutilization of secondary datasets in computer science research. We employed a Systematic Literature Review (SLR) methodology in accordance with the PRISMA 2020 guidelines, examining 72 publications sourced from five esteemed academic databases (Scopus, Web of Science, IEEE Xplore, ACM Digital Library, Google Scholar) published from 2021 to 2025. The study results indicate that ethical difficulties emerge uniformly in both primary and secondary datasets. Primary datasets primarily face challenges related to privacy threats, anonymization, and Informed Consent, whereas secondary datasets are more susceptible to licensing infringements, dataset repurposing, and insufficient preparation transparency. The three domains that predominantly encountered these challenges were Machine Learning, Computer Vision, and Natural Language Processing. Moreover, practices of data manipulation, including cherry-picking and concealed preparation, were identified as detrimental to scientific integrity. This study's findings underscore the need for enhanced ethical standards for datasets and greater transparency in preparation documentation to ensure the repeatability of data-driven research.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Jurnal ilmiah teknologi informasi Asia

Website

Abbrev

jitika

Publisher

Institut Teknologi dan Bisnis Asia Malang

Subject

Computer Science & IT

Description

Published by Institute for Research, Development and Community Service (Lembaga Penelitian, Pengembangan dan Pengabdian Masyarakat / LP2M) of High School of Information & Computer Management (Institut Teknologi dan Bisnis AsiA MALANG as a periodical publication that provides information and analysis ...

Article Info

Abstract

Ethical Challenges in Primary vs. Secondary Datasets: A Systematic Review of Manipulation and Transparency

Article Info

Abstract