International Journal of Electrical and Computer Engineering
Vol 15, No 3: June 2025

Enhancing training performance for small models using data-centric approaches

El-Khoribi, Reda A. (Unknown)
Emary, Eid (Unknown)
Hassan, Amr Essam (Unknown)



Article Info

Publish Date
01 Jun 2025

Abstract

In this work, we propose a new system to improve the performance of classification models by applying data-centric principles. The system optimizes datasets by removing poor-quality samples and generating high-quality synthetic data. We tested the system on various classification models and datasets, measuring its performance with accuracy, precision, recall, and F1-score. The results showed significant improvements in classification performance, highlighting the effectiveness of this data-centric approach. While the scalability to large-scale datasets is still an open question, it offers great potential for future research. This approach could be valuable in critical areas like healthcare, finance, and autonomous systems, where high-quality data is crucial. Future work could explore advanced data augmentation, adapting the system for different data types like text and time-series, and extending it to semi-supervised and unsupervised learning. Our findings emphasize the importance of data quality in achieving better model performance, often overlooked in favor of model architecture. By advancing data-centric artificial intelligence (AI), this work offers a practical framework for researchers and practitioners to optimize datasets and improve machine learning systems.

Copyrights © 2025






Journal Info

Abbrev

IJECE

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering

Description

International Journal of Electrical and Computer Engineering (IJECE, ISSN: 2088-8708, a SCOPUS indexed Journal, SNIP: 1.001; SJR: 0.296; CiteScore: 0.99; SJR & CiteScore Q2 on both of the Electrical & Electronics Engineering, and Computer Science) is the official publication of the Institute of ...