Indonesian Journal of Electrical Engineering and Computer Science
Vol 24, No 3: December 2021

Classifying clinically actionable genetic mutations using KNN and SVM

Rohit Chivukula (University of Huddersfield)
T. Jaya Lakshmi (SRM University)
Sanku Satya Uday (SRM University)
Satti Thanuja Pavani (SRM University)



Article Info

Publish Date
01 Dec 2021

Abstract

Cancer is one of the major causes of death in humans. Early diagnosis of genetic mutations that cause cancer tumor growth leads to personalized medicine to the decease and can save the life of majority of patients. With this aim, Kaggle has conducted a competition to classify clinically actionable gene mutations based on clinical evidence and some other features related to gene mutations. The dataset contains 3321 training data points that can be classified into 9 classes. In this work, an attempt is made to classify these data points using K-nearest neighbors (KNN) and linear support vector machines (SVM) in a multi class environment. As the features are categorical, one hot encoding as well as response coding are applied to make them suitable to the classifiers. The prediction performance is evaluated using log loss and KNN has performed better with a log loss value of 1.10 compared to that of SVM 1.24.

Copyrights © 2021