S.M. Shaharudin
Universiti Pendidikan Sultan Idris

Published : 3 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 3 Documents
Search

Identification of Rainfall Patterns on Hydrological Simulation Using Robust Principal Component Analysis S.M. Shaharudin; N. Ahmad; N.H. Zainuddin; N.S. Mohamed
Indonesian Journal of Electrical Engineering and Computer Science Vol 11, No 3: September 2018
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v11.i3.pp1162-1167

Abstract

A robust dimension reduction method in Principal Component Analysis (PCA) was used to rectify the issue of unbalanced clusters in rainfall patterns due to the skewed nature of rainfall data. A robust measure in PCA using Tukey’s biweight correlation to downweigh observations was introduced and the optimum breakdown point to extract the number of components in PCA using this approach is proposed. A set of simulated data matrix that mimicked the real data set was used to determine an appropriate breakdown point for robust PCA and  compare the performance of the both approaches. The simulated data indicated a breakdown point of 70% cumulative percentage of variance gave a good balance in extracting the number of components .The results showed a  more significant and substantial improvement with the robust PCA than the PCA based Pearson correlation in terms of the average number of clusters obtained and its cluster quality.
Modified singular spectrum analysis in identifying rainfall trend over peninsular Malaysia S.M. Shaharudin; N. Ahmad; N.H. Zainuddin
Indonesian Journal of Electrical Engineering and Computer Science Vol 15, No 1: July 2019
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v15.i1.pp283-293

Abstract

Identifying the local time scale of the torrential rainfall pattern through Singular Spectrum Analysis (SSA) is useful to separate the trend and noise components. However, SSA poses two main issues which are torrential rainfall time series data have coinciding singular values and the leading components from eigenvector obtained from the decomposing time series matrix are usually assesed by graphical inference lacking in a specific statistical measure. In consequences to both issues, the extracted trend from SSA tended to flatten out and did not show any distinct pattern.  This problem was approached in two ways. First, an Iterative Oblique SSA (Iterative O-SSA) was presented to make adjustment to the singular values data. Second, a measure was introduced to group the decomposed eigenvector based on Robust Sparse K-means (RSK-Means). As the results, the extracted trend using modification of SSA appeared to fit the original time series and looked more flexible compared to SSA.
Fitting statistical distribution of extreme rainfall data for the purpose of simulation S.M. Shaharudin; N Ahmad; N.S. Mohamed; Hairulnizam Mahdin
Indonesian Journal of Electrical Engineering and Computer Science Vol 18, No 3: June 2020
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v18.i3.pp1367-1374

Abstract

In this study, several types of probability distributions were used to fit the daily torrential rainfall data from 15 monitoring stations of Peninsular Malaysia from the period of 1975 to 2007. The study of fitting statistical distribution is important to find the most suitable model that could anticipate extreme events of certain natural phenomena such as flood and tsunamis. The aim of the study is to determine which distribution fits well with the daily torrential Malaysian rainfall data. Generalized Pareto, Lognormal and Gamma distributions were the distributions that had been tested to fit the daily torrential rainfall amount in Peninsular Malaysia. First, the appropriate distribution of the daily torrential rainfall was identified within the selected distributions for rainfall stations. Then, data sets were generated based on probability distributions that mimic a daily torrential rainfall data. Graphical representation and goodness of fit tests were used in finding the best fit model. The Generalized Pareto was found to be the most appropriate distribution in describing the daily torrential rainfall amounts of Peninsular Malaysia. The outputs can be beneficial for the purpose of generating several sets of simulated data matrices that mimic the same characteristics of rainfall data in order to assess the performance of the modification method compared to classical method.