Garuda - Garba Rujukan Digital

Jurnal Ilmiah Giga

Vol. 28 No. 2 (2025): Volume 28 Edisi 2 Tahun 2025

Sabila, Fadlilah Izzatus (Unknown)
Nugroho, Catur Adi (Unknown)
Fauziah, Fauziah (Unknown)

Publish Date
19 Jan 2026

Personal data leakage is becoming an increasingly serious issue, especially when the leaked data has been partially modified to avoid direct matching with the original source. This study develops a fuzzy approach based on algorithmic mapping of each attribute (field-algorithm pairing) as well as a weighting scheme based on relevance, to support a many-to-one data match between the leaked data and the original database. Four algorithms are used: Levenshtein, Jaro-Winkler, Token Sort Ratio, and Cosine Similarity, selected based on the semantic characteristics of the attributes. Experiments were conducted on 10,000 synthetic data with various modification scenarios, including clean data, light modification, and weight modification Results showed high performance in both clean data and light modification (F1-score 0.90–1.00), but significantly decreased in heavy modification (F1-score 0.10–0.45). This approach offers a lightweight yet effective solution for the early stages of identity verification in data leak investigations, as well as opening up opportunities for further development through a combination of algorithms and adaptive adjustment of matching thresholds.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Jurnal Ilmiah Giga

Website

Abbrev

giga

Publisher

Universitas Nasional Jakarta

Subject

Education

Description

GIGA Scientific Journals is a scientific publication from research and literature study which are conducted by undergraduate, graduate, doctoral student, researchers and lecturers to be published widely and can be utilized as widely as possible for the advancement of science and technology at ...

Article Info

Abstract

Strategi Pencocokan Fuzzy Berdasarkan Specific-Field Untuk Validasi Kebocoran Data Termodifikasi: Studi Eksperimen Many-To-One Matching

Article Info

Abstract