JURNAL POLI-TEKNOLOGI
Vol. 23 No. 1 (2024)

Designing A Pdf Malware Detection System Using Machine Learning

Salman Abdul Jabbaar Wiharja (Universitas Pendidikan Indonesia)
Deden Pradeka (Universitas Pendidikan Indonesia)
Wirmanto Suteddy (Universitas Pendidikan Indonesia)



Article Info

Publish Date
22 Feb 2024

Abstract

This research proposes an approach to build malicious PDF detection system using random forest algorithm, focusing the Evasive-PDFMal2022 dataset which is updated and extended with the addition of new datasets. This dataset includes malicious PDF files from CVE and Exploit-DB, non-malicious PDF files, as well as files from private collections and Technically-oriented PDF Collection. Features were extracted using the PDFID tool, resulting in 29 structural features that formed the basis for the Random Forest classification algorithm. Experiments showed that the model trained with the new dataset provided accuracy equivalent to the Evasive-PDFMal2022 model, at 98%, albeit with a small decrease in recall for the benign class. In addition, this research involved the creation of a website for metadata extraction and malicious PDF detection. Recognition goes to the dataset contributors, tool developers, and dataset providers from NIST and Exploit-DB. Overall, this research successfully increased the representation and diversity of the dataset, provided good model training results, improved detection from 3 malicious PDF variants to 13 variants, and created a practical tool for malicious PDF extraction and detection. Nonetheless, further development may be required to improve detection performance in more complex scenarios

Copyrights © 2024






Journal Info

Abbrev

politeknologi

Publisher

Subject

Aerospace Engineering Automotive Engineering Chemical Engineering, Chemistry & Bioengineering Civil Engineering, Building, Construction & Architecture Computer Science & IT Control & Systems Engineering Electrical & Electronics Engineering Energy Engineering Industrial & Manufacturing Engineering Materials Science & Nanotechnology Mechanical Engineering Transportation

Description

Poli-Teknologi Journal is a journal, which began publication in 2002, published by the Research and community service Unit of Politeknik Negeri Jakarta. It starts from Volume 1 Number 1 in January 2022 for printed version; ISSN (print) 1412-2782 and ISSN (online) 2407-9103. Poli-Teknologi Journal is ...