Claim Missing Document
Check
Articles

Found 2 Documents
Search

Evaluation of Synthetic Data Effectiveness using Generative Adversarial Networks (GAN) in Improving Javanese Script Recognition on Ancient Manuscript Faizin, Muhammad 'Arif; Suciati, Nanik; Fatichah, Chastine
JUTI: Jurnal Ilmiah Teknologi Informasi Vol. 23, No. 1, January 2025
Publisher : Department of Informatics, Institut Teknologi Sepuluh Nopember

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.12962/j24068535.v23i1.a1256

Abstract

The imbalance of Javanese script data in ancient manuscript recognition poses a challenge due to the limited availability of data. A potential approach to addressing this issue is the use of Generative Adversarial Networks (GAN). This study evaluates the effectiveness of synthetic data generated using Enhanced Balancing GAN (EBGAN) in mitigating data imbalance. Various evaluation scenarios are conducted, including: (i) assessing the impact of syn-thetic data as augmentation, (ii) evaluating the sufficiency of synthetic data for recognition models, (iii) analyzing minority class oversampling with different selection strategies, and (iv) evaluating model generalization through cross-validation. Quantitative analysis of the generated synthetic data, based on Fréchet Inception Distance (FID) and Structural Similarity Index (SSIM), as well as visual assessment, indicates that the quality of synthetic data closely resembles real data. Additionally, experimental results demonstrate that combining real and synthetic data improves accuracy, precision, recall, and F1-score. The oversampling strategy for synthetic data has proven effective in meeting the data sufficiency requirements for training recognition models. Meanwhile, selecting minority classes and determining threshold values based on percentage, distribution, and model performance in oversampling can serve as guidelines for enhancing script recognition performance. Compared to previous methods, the use of EBGAN has been shown to produce more diverse synthetic data with better visual quality. However, further research is still needed to optimize GAN performance in supporting script recognition.
Handling Imbalance in Javanese Manuscript Character Dataset using Skeleton-based Balancing Generative Adversarial Networks Faizin, Muhammad 'Arif; Suciati, Nanik; Fatichah, Chastine
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Vol 9 No 4 (2025): August 2025
Publisher : Ikatan Ahli Informatika Indonesia (IAII)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29207/resti.v9i4.6572

Abstract

Javanese script is an important part of Indonesia’s cultural heritage, representing cultural values from the past. However, recognizing and classifying Javanese characters within manuscripts is challenging due to the limited availability of data and uneven distribution of character classes. The decline in formal use of Javanese script has drastically reduced the pool of manuscript samples, causing certain characters to appear rarely and skewing class frequencies. Existing methods that utilize Generative Adversarial Networks (GANs) attempt to address this problem. However, they often struggle to generate characters that are both consistent and visually accurate in terms of structural details. To address these issues, this study introduces a skeleton-based balancing GAN (SkelBAGAN), which improves the structural details of the previous method for generating characters. The proposed method introduces three main enhancements: (i) a layer for extracting the character skeleton structure, (ii) an optimized pretrained network using an autoencoder for learning the skeleton distribution, and (iii) refinement of the evaluation function, preserving both the distribution and structural fidelity in the adversarial process. The performance of the proposed model is evaluated against previous methods using the Fréchet Inception Distance (FID) to assess distribution quality and the Structural Similarity Index Measure (SSIM) to evaluate structural fidelity. The results indicate that the proposed methods outperform previous methods in balancing the FID and SSIM metrics. The integration of all enhancements in SkelBAGAN achieves the lowest FID, indicating improved generative quality while maintaining competitive SSIM values. The qualitative study indicates that SkelBAGAN outperforms previous methods in character generation. These results highlight how the skeleton-based improvement of the quality of generated characters enhances the recognition performance for underrepresented Javanese characters in imbalanced datasets. Ultimately, this work contributes to the broader effort to preserve the Javanese script as a vital element of Indonesia’s cultural identity.