JAMBURA JOURNAL OF PROBABILITY AND STATISTICS
Vol 4, No 1 (2023): Jambura Journal Of Probability and Statistics

Perbandingan Kinerja Metode Regresi K-Nearest Neighbor dan Metode Regresi Linear Berganda pada Data Boston Housing

Lutfi Sivana Ihzaniah (Prodi Matematika, Fakultas Sains dan Matematika, Universitas Kristen Satya Wacana Salatiga)
Adi Setiawan (Prodi Matematika, Fakultas Sains dan Matematika, Universitas Kristen Satya Wacana Salatiga)
Rachel Wulan N. Wijaya (Prodi Matematika, Fakultas Sains dan Matematika, Universitas Kristen Satya Wacana Salatiga)



Article Info

Publish Date
31 May 2023

Abstract

This research was made in order to see which method  performance is better between the KNN (K-Nearest Neighbor) regression method and the multiple linear regression method on Boston Housing data. The method performace referred here is MAE, RMSE, MAPE, and R2. The KNN method is a method to predict something based on the closest training examples of an object. Meanwhile, multiple linear regression is a forecasting technique involving more than one independent variable. The comparison of the two methods is based on the results of the Mean Absolute Percent Error (MAPE). In this research the definitions of distance used are Euclidean distance and Minkowski distance. The K value in the KNN method defines the number of nearest neighbors to be examined to determine the value of a dependent variable, in this research we use K values from 1 to 10 for each test data and definition of distance. In this research, the percentage of test data used was 20%, 30%, and 40% for both methods. The best MAPE value obtained by the KNN regression method was 12,89% at K = 3 for Euclidean distance and 13,22% at K = 3 for Minkowski distance. Meanwhile the best MAPE value for the multiple linear regression method is 17,17%. The best method between the two methods is the KNN regression method as seen from the MAPE value of the KNN regression method which is smaller than the MAPE value of the multiple linear regression method.

Copyrights © 2023






Journal Info

Abbrev

jps

Publisher

Subject

Computer Science & IT Decision Sciences, Operations Research & Management Environmental Science Social Sciences

Description

Probability Theory Mathematical Statistics Computational Statistics Stochastic Processes Financial Statistics Bayesian Analysis Survival Analysis Time Series Analysis Neural Network Another field which is related to statistics and the applications Another field which is related to Probability and ...