JURNAL MEDIA INFORMATIKA BUDIDARMA
Vol 7, No 3 (2023): Juli 2023

Handling Unbalanced Data Sets Using DBMUTE and NearMiss Methods to Improve Classification Performance of Yeast Data Sets

Bima Mahardika Wirawan (Telkom University, Bandung)
Mahendra Dwifebri Purbolaksono (Telkom University, Bandung)
Fhira Nhita (Telkom University, Bandung)



Article Info

Publish Date
23 Jul 2023

Abstract

Yeast vacuole biogenesis was chosen as a model system for organelle assembly because most vacuole functions can be used for vegetative cell growth. Therefore it is possible to generate an extensive collection of mutants with defects in unbalanced vacuole assembly. With this in mind, we must find the structural balance of data in yeast. Imbalanced data is when there is an unbalanced distribution of data classes and the number of data classes is either more or lower than the number of other data classes. Our method uses the f1score performance matrix method and the balanced accuracy on DBMUTE and NearMiss undersampling. Previously, only a few studies explained the results of using a performance matrix and balanced accuracy. Then, find out the performance results of the f1 score and balanced accuracy and get the best score from the yeast datasets. In the study, a comparison between the imbalanced datasets using the undersampling method. Furthermore, to obtain the performance matrix results, use the f1 score and balance accuracy. After testing five yeast datasets, we performed an average f1 score and balance accuracy with the highest average NearMiss f1 score of 62.23% and the highest average balanced accuracy of 78.59%.

Copyrights © 2023






Journal Info

Abbrev

mib

Publisher

Subject

Computer Science & IT Control & Systems Engineering Electrical & Electronics Engineering

Description

Decission Support System, Expert System, Informatics tecnique, Information System, Cryptography, Networking, Security, Computer Science, Image Processing, Artificial Inteligence, Steganography etc (related to informatics and computer ...