Jurnal Informatika Universitas Pamulang
Vol 9 No 2 (2024): JURNAL INFORMATIKA UNIVERSITAS PAMULANG

A Hybrid Model for Human DNA Sequence Classification Using Convolutional Neural Networks and Random Forests

Gregorius Airlangga (Universitas Katolik Indonesia Atma Jaya)



Article Info

Publish Date
30 Jul 2024

Abstract

Human DNA sequence classification is a fundamental task in genomics, essential for understanding genetic variations and its implications in disease susceptibility, personalized medicine, and evolutionary biology. This study proposes a novel hybrid model combining Convolutional Neural Networks (CNN) for feature extraction and Random Forest classifiers for final classification. The model was evaluated on a dataset of human DNA sequences, with achieving an accuracy of 75.34%. The results showed that performance metrics, including precision, recall, and F1-scores across multiple classes, showed significant improvements over traditional models. The CNN component effectively captures local dependencies and patterns within the sequences, while the Random Forest classifier handles complex decision boundaries, resulting in enhanced classification accuracy. Comparative analysis demonstrated the superiority of our hybrid approach, with the CNN-LSTM model achieving only 59.47% accuracy, and other RNN-based models like CNN-GRU and CNN-BiLSTM performing similarly lower. These results suggest that hybrid models can leverage the strengths of both deep learning and traditional machine learning techniques an offering a more effective tool for DNA sequence classification. The future work will optimize model architecture and explore larger, thus more diverse datasets to validate our approach's generalizability and robustness.

Copyrights © 2024






Journal Info

Abbrev

informatika

Publisher

Subject

Computer Science & IT

Description

Jurnal Informatika Universitas Pamulang is a periodical scientific journal that contains research results in the field of computer science from all aspects of theory, practice and application. Papers can be in the form of technical papers or surveys of recent developments research ...