Ratih Prasetya
Badan Meteorologi Klimatologi dan Geofisika

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Data Mining Application on Weather Prediction Using Classification Tree, Naïve Bayes and K-Nearest Neighbor Algorithm With Model Testing of Supervised Learning Probabilistic Brier Score, Confusion Matrix and ROC Ratih Prasetya
JAICT Vol 4, No 2 (2019)
Publisher : Politeknik Negeri Semarang

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (92.643 KB) | DOI: 10.32497/jaict.v4i2.1690

Abstract

— One of data mining techniques is Classification, used to predict relationships between data on a dataset. The prediction performed by classifying data into several different classes considering certain factor. Classification is a performance of Supervised Learning application where the training data already has a label when entered as input data. Classification is an approach of empirical techniques that can be utilized for short-term weather prediction. The most widely used algorithms in Classification Techniques are Classification Tree, Naïve Bayes and K-Nearest Neighbors. In this study, the author used these three algorithms to predict rain with validation parameters of Brier Score, Confusion Matrix and ROC curves. The input data is synoptic data of Kemayoran Meteorological Station, Jakarta (96745) for 10 years (2006 - 2015) consists of 3528 datasets and 8 attributes. Based on a series of data processing, selection and model testing shows that the Naïve Bayes Algorithm has the best accuracy rate of 77.1% with the category of fair classification so it is quite potential to be used in the operational. The dominant weather attributes in rain formation are moisture (RHavg), minimum temperature (Tmin), maximum temperature (Tmax), average temperature (Tavg) and wind direction (ddd).