Jurnal Ilmiah Giga
Vol. 28 No. 2 (2025): Volume 28 Edisi 2 Tahun 2025

Strategi Pencocokan Fuzzy Berdasarkan Specific-Field Untuk Validasi Kebocoran Data Termodifikasi: Studi Eksperimen Many-To-One Matching

Sabila, Fadlilah Izzatus (Unknown)
Nugroho, Catur Adi (Unknown)
Fauziah, Fauziah (Unknown)



Article Info

Publish Date
19 Jan 2026

Abstract

Personal data leakage is becoming an increasingly serious issue, especially when the leaked data has been partially modified to avoid direct matching with the original source. This study develops a fuzzy approach based on algorithmic mapping of each attribute (field-algorithm pairing) as well as a weighting scheme based on relevance, to support a many-to-one data match between the leaked data and the original database. Four algorithms are used: Levenshtein, Jaro-Winkler, Token Sort Ratio, and Cosine Similarity, selected based on the semantic characteristics of the attributes. Experiments were conducted on 10,000 synthetic data with various modification scenarios, including clean data, light modification, and weight modification Results showed high performance in both clean data and light modification (F1-score 0.90–1.00), but significantly decreased in heavy modification (F1-score 0.10–0.45). This approach offers a lightweight yet effective solution for the early stages of identity verification in data leak investigations, as well as opening up opportunities for further development through a combination of algorithms and adaptive adjustment of matching thresholds.

Copyrights © 2025






Journal Info

Abbrev

giga

Publisher

Subject

Education

Description

GIGA Scientific Journals is a scientific publication from research and literature study which are conducted by undergraduate, graduate, doctoral student, researchers and lecturers to be published widely and can be utilized as widely as possible for the advancement of science and technology at ...