Indonesian Journal of Electrical Engineering and Computer Science
Vol 33, No 1: January 2024

A hybrid model for data visualization using linear algebra methods and machine learning algorithm

Mohsin Ali (Medi-Caps University Indore)
jitendra Choudhary (Medi-Caps University Indore)
Tanmay Kasbe (Shri Vaishnav Vidyapeeth Vishwavidyalaya)



Article Info

Publish Date
01 Jan 2024

Abstract

The t-distributed stochastic neighbor embedding (t-SNE) is a powerful technique for visualizing high-dimensional datasets. By reducing the dimensionality of the data, t-SNE transforms it into a format that can be more easily understood and analyzed. The existing approach is to visualize high-dimensional data but not deeply visualize. This paper proposes a model that enhances visualization and improves the accuracy. The proposed model combines the non-linear embedding technique t-SNE, the linear dimensionality reduction method principal component analysis (PCA), and the QR decomposition algorithm for discovering eigenvalues and eigenvectors. In Addition, we quantitatively compare the proposed model QRPCA-t-SNE with PCA-t-SNE using the following criteria: data visualization with different perplexity and different principal components, confusion matrix, model score, mean square error (MSE), training, testing accuracy, receiver operating characteristic curve (ROC) score, and AUC score.

Copyrights © 2024