Bau, Yoon-Teck
Unknown Affiliation

Published : 3 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 3 Documents
Search
Journal : JOIV : International Journal on Informatics Visualization

Predicting the Next Day's Closing Price of Stock Indices Using Machine Learning and Deep Learning Algorithms Cayzer, Ahmad Firdaus; Bau, Yoon-Teck
JOIV : International Journal on Informatics Visualization Vol 9, No 2 (2025)
Publisher : Society of Visual Informatics

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.62527/joiv.9.2.3501

Abstract

Share prices are a critical factor in a stock index’s worth but are never constant. Thus, an effective method of predicting share prices is needed. This is where machine learning comes in. This research discusses the applicability of machine learning algorithms, precisely long short-term memory, artificial neural networks, and linear regression in predicting share prices. Additionally, this research goes in-depth, explaining how each algorithm functions. These three algorithms were implemented using the financial dataset of the S&P 500, one of the more known stock indices out there. Data was collected from Yahoo Finance for 34 years, from 1990 to 2023. Then, the algorithms mentioned were used to train a model using the collected dataset. All three algorithms were measured using three performance metrics: mean absolute error, R-squared score, and mean absolute percentage error. The final implementation involved training them by only using 1-day lagged features to create a model that can predict the next day's closing price. All the algorithms performed considerably well, with linear regression being the best, followed by artificial neural networks and long short-term memory being the worst. Finally, the implemented algorithms were used to predict the closing prices of other stock indices, NASDAQ and Hang Seng Index. All algorithms performed well and followed the same trend, wherein linear regression performed the best and long- and short-term memory the worst. Future research should be conducted to explore the possibilities of utilizing lagged features along with external features like GDP growth rate, political trends, etc.
Comparative Analysis of Machine Learning Algorithms for Health Insurance Pricing Bau, Yoon-Teck; Md Hanif, Shuhail Azri
JOIV : International Journal on Informatics Visualization Vol 8, No 1 (2024)
Publisher : Society of Visual Informatics

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.62527/joiv.8.1.2282

Abstract

Insurance is an effective way to guard against potential loss. Risk management is primarily employed to protect against the risk of a financial loss. Risk and uncertainty are inevitable parts of life, and the pace of life has led to a rise in these risks and uncertainties. Health insurance pricing has emerged as one of the essential fields of this study following the coronavirus pandemic. The anticipated outcomes from this study will be applied to guarantee that an insurance company's goal for its health insurance packages is within the range of profitability so that the insurance company will also choose the most price-effective course of action. The US Health Insurance dataset was utilized for this study. This health insurance pricing prediction aims to examine four different types of regression-based machine learning algorithms: multiple linear regression, ridge regression, XGBoost regression, and random forest regression. The implemented model's performance is assessed using four evaluation metrics: MAE, MSE, RMSE, and R2 score. Random forest regression outperforms all other algorithms in terms of all four evaluation metrics. The best machine learning algorithm, random forest, is further enhanced with hyperparameter tuning. Random forest with hyperparameter tuning performs better for three evaluation metrics except for MAE. To gain further insights, data visualizations are also implemented to showcase the importance of features and the differences between actual and predicted prices for all the data points.
Developing and Comparing Machine Learning Algorithms for Music Recommendation Bau, Yoon-Teck; Mohd Reza, Puteri Ainna Ezzurin; Lee, Kian-Chin
JOIV : International Journal on Informatics Visualization Vol 8, No 3-2 (2024): IT for Global Goals: Building a Sustainable Tomorrow
Publisher : Society of Visual Informatics

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.62527/joiv.8.3-2.2947

Abstract

The increasing prevalence of song skipping in music streaming applications negatively impacts user satisfaction and subscription retention. Dissatisfaction often arises when users encounter songs they actively dislike, highlighting a gap between user expectations and the value offered by these services. To address this, music recommendation algorithms were researched and developed. Initially, data collection is started. Data collection is through the Spotify application programming interface. This initiation step sets the stage for subsequent exploratory data analysis. Exploratory data analysis examined the collected data to plot a bar chart for total songs released over the years, plot a bar chart for the popularity of songs based on the year it is released, visualize word cloud on frequently mentioned music genres, chart count plot for explicit songs, and chart count plot for song modalities. Data preprocessing involved cleaning the data, handling missing values, and checking for null values to prepare the application of machine learning algorithms. Four machine learning algorithms were applied, k-means, mini-batch k-means, Gaussian mixture, and density-based spatial clustering of applications with noise (DBSCAN), to analyze musical features like rhythm, tempo, and other relevant music attributes. The results showed that the k-means outperforms all other algorithms evaluated regarding recommendation quality, as measured by the Calinski-Harabasz score. Based on the evaluation, the best machine learning will then be applied to identify similarities between songs and be used to generate music recommendation results.