Ling, Yeong Tyng
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Malware detection using Gini, Simpson diversity, and Shannon-Wiener indexes Ling, Yeong Tyng; Chiew, Kang Leng; Phang, Piau; Zhang, Xiaowei
International Journal of Informatics and Communication Technology (IJ-ICT) Vol 14, No 2: August 2025
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijict.v14i2.pp737-750

Abstract

The increasing number of malware attacks poses a significant challenge to cyber security. This paper proposes a methodology for static malware analysis using biodiveristy-inspired metrics that is Gini coefficient, Simpson diversity, and Shannon-Wiener index for malware detection. These metrics are used to build the structural feature representation on the raw binary file as the feature space. The effectiveness of these metrics are evaluated using multilayer perceptron (MLP) neural network and extreme gradient boosting (XGBoost) models. A deterministic algorithm is used to generate these features that represent the feature signature of the executable file. Additionally, we investigated the effectiveness of different byte sizes as the input feature for these two classifiers. According to the results, Gini coefficient with on chunk size of 128 has successfully achieved average F1 score of more than 98.7% by using XGBoost model.