Suraya Masrom
Faculty of Computer and Mathematical Sciences, Universiti Teknologi Mara

Published : 3 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 3 Documents
Search

An application of machine learning on corporate tax avoidance detection model Rahayu Abdul Rahman; Suraya Masrom; Normah Omar; Maheran Zakaria
IAES International Journal of Artificial Intelligence (IJ-AI) Vol 9, No 4: December 2020
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijai.v9.i4.pp721-725

Abstract

Corporate tax avoidance reduces government revenues which could limit country development plans. Thus, the main objectives of this study is to establish a rigorous and effective model to detect corporate tax avoidance to assist government to prevent such practice. This paper presents the fundamental knowledge on the design and implementation of machine learning model based on five selected algorithms tested on the real dataset of 3,365 Malaysian companies listed on bursa Malaysia from 2005 to 2015. The performance of each machine learning algorithms on the tested dataset has been observed based on two approaches of training. The accuracy score for each algorithm is better with the cross-validation training approach. Additionationally, with the cross-validation training approach, the performances of each machine learning algorithm were tested on different group of features selection namely industry, governance, year and firm characteristics. The findings indicated that the machine learning models present better reliability with industry, governance and firm characteristics features rather than single year determinant mainly with the Random Forest and Logistic Regression algorithms.
Different mutation and crossover set of genetic programming in an automated machine learning Suraya Masrom; Masurah Mohamad; Shahirah Mohamed Hatim; Norhayati Baharun; Nasiroh Omar; Abdullah Sani Abd. Rahman
IAES International Journal of Artificial Intelligence (IJ-AI) Vol 9, No 3: September 2020
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (657.112 KB) | DOI: 10.11591/ijai.v9.i3.pp402-408

Abstract

Automated machine learning is a promising approach widely used to solve classification and prediction problems, which currently receives much attention for modification and improvement. One of the progressing works for automated machine learning improvement is the inclusion of evolutionary algorithm such as Genetic Programming. The function of Genetic Programming is to optimize the best combination of solutions from the possible pipelines of machine learning modelling, including selection of algorithms and parameters optimization of the selected algorithm.  As a family of evolutionary based algorithm, the effectiveness of Genetic Programming in providing the best machine learning pipelines for a given problem or dataset is substantially depending on the algorithm parameterizations including the mutation and crossover rates. This paper presents the effect of different pairs of mutation and crossover rates on the automated machine learning performances that tested on different types of datasets. The finding can be used to support the theory that higher crossover rates used to improve the algorithm accuracy score while lower crossover rates may cause the algorithm to converge at earlier stage.
Machine learning building price prediction with green building determinant Thuraiya Mohd; Syafiqah Jamil; Suraya Masrom
IAES International Journal of Artificial Intelligence (IJ-AI) Vol 9, No 3: September 2020
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (412.66 KB) | DOI: 10.11591/ijai.v9.i3.pp379-386

Abstract

In the era of Industrial 4.0, many urgent issues in the industries can be effectively solved with artificial intelligence techniques, including machine learning. Designing an effective machine learning model for prediction and classification problems is an ongoing endeavor. Besides that, time and expertise are important factors that are needed to tailor the model to a specific issue, such as the green building housing issue. Green building is known as a potential approach to increase the efficiency of the building. To the best of our knowledge, there is still no implementation of machine learning model on GB valuation factors for building price prediction compared to conventional building development. This paper provides a report of an empirical study that model building price prediction based on green building and other common determinants. The experiments used five common machine learning algorithms namely Linear Regression, Decision Tree, Random Forest, Ridge and Lasso tested on a set of real building datasets that covered Kuala Lumpur District, Malaysia. The result showed that the Random Forest algorithm outperforms the other four algorithms on the tested dataset and the green building determinant has contributed some promising effects to the model.