Jurnal INFOTEL
Vol 14 No 4 (2022): November 2022

Imputasi KNN terhadap Nilai yang Hilang dari Prediksi Durasi Hujan Berbasis Regresi pada Data BMKG

Ikke Dian Oktaviani (School of Computing, Telkom University)
Aji Gautama Putrada (Advanced and Creative Networks Research Center, Telkom University)



Article Info

Publish Date
01 Nov 2022

Abstract

The prediction of rain duration based on data from the Meteorology, Climatology, and Geophysics Agency (BMKG) is an important issue but remains an open problem. At the same time, several studies have shown that missing values can cause a decrease in the performance of the model in making predictions. This study proposes k-nearest neighbors (KNN) imputation to overcome the problem of missing values in predicting rain duration. The source of the rain duration prediction dataset is the BMKG data. We compared gradient boosting regression (GBR), adaptive boosting regression (ABR), and linear regression (LR) for the regression model for predicting rain duration. We compared the KNN imputation method with several benchmark methods, including zero imputation, mean imputation, and iterative imputation. Parameters r2, mean squared error (MSE) and mean bias error (MBE) measure the performance of these imputation methods. The test results show that for rain duration prediction using the regression method, GBR shows the best performance, both for train data and test data with r2 = 0.915 and 0.776, respectively. Then our proposed KNN imputation has the best performance for missing value imputation compared to the benchmark imputation method. The prediction values of r2 and MSE when using KNN imputation at Missing Percentage = 90% are 0.71 and 0.36, respectively.

Copyrights © 2022






Journal Info

Abbrev

infotel

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering

Description

Jurnal INFOTEL is a scientific journal published by Lembaga Penelitian dan Pengabdian Masyarakat (LPPM) of Institut Teknologi Telkom Purwokerto, Indonesia. Jurnal INFOTEL covers the field of informatics, telecommunication, and electronics. First published in 2009 for a printed version and published ...