Journal of Future Artificial Intelligence and Technologies
Vol. 1 No. 2 (2024): September 2024

Pilot Study on Enhanced Detection of Cues over Malicious Sites Using Data Balancing on the Random Forest Ensemble

Okpor, Margaret Dumebi (Unknown)
Aghware, Fidelis Obukohwo (Unknown)
Akazue, Maureen Ifeanyi (Unknown)
Eboka, Andrew Okonji (Unknown)
Ako, Rita Erhovwo (Unknown)
Ojugo, Arnold Adimabua (Unknown)
Odiakaose, Christopher Chukwufunaya (Unknown)
Binitie, Amaka Patience (Unknown)
Geteloma, Victor Ochuko (Unknown)
Ejeh, Patrick Ogholuwarami (Unknown)



Article Info

Publish Date
07 Sep 2024

Abstract

The digital revolution frontiers have rippled across society today – with various web content shared online for users as they seek to promote monetization and asset exchange, with clients constantly seeking improved alternatives at lowered costs to meet their value demands. From item upgrades to their replacement, businesses are poised with retention strategies to help curb the challenge of customer attrition. The birth of smartphones has proliferated feats such as mobility, ease of accessibility, and portability – which, in turn, have continued to ease their rise in adoption, exposing user device vulnerability as they are quite susceptible to phishing. With users classified as more susceptible than others due to online presence and personality traits, studies have sought to reveal lures/cues as exploited by adversaries to enhance phishing success and classify web content as genuine and malicious. Our study explores the tree-based Random Forest to effectively identify phishing cues via sentiment analysis on phishing website datasets as scrapped from user accounts on social network sites. The dataset is scrapped via Python Google Scrapper and divided into train/test subsets to effectively classify contents as genuine or malicious with data balancing and feature selection techniques. With Random Forest as the machine learning of choice, the result shows the ensemble yields a prediction accuracy of 97 percent with an F1-score of 98.19% that effectively correctly classified 2089 instances with 85 incorrectly classified instances for the test-dataset.

Copyrights © 2024






Journal Info

Abbrev

FAITH

Publisher

Subject

Computer Science & IT

Description

Journal of Future Artificial Intelligence and Technologies E-ISSN: 3048-3719 is an international journal that delves into the comprehensive spectrum of artificial intelligence, focusing on its foundations, advanced theories, and applications. All accepted articles will be published online, receive a ...