Claim Missing Document
Check
Articles

Found 9 Documents
Search

A Survey of Machine Learning Techniques for Self-tuning Hadoop Performance Md. Armanur Rahman; J. Hossen; Venkataseshaiah C; CK Ho; Tan Kim Geok; Aziza Sultana; Jesmeen M. Z. H.; Ferdous Hossain
International Journal of Electrical and Computer Engineering (IJECE) Vol 8, No 3: June 2018
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (475.262 KB) | DOI: 10.11591/ijece.v8i3.pp1854-1862

Abstract

The Apache Hadoop framework is an open source implementation of MapReduce for processing and storing big data. However, to get the best performance from this is a big challenge because of its large number configuration parameters. In this paper, the concept of critical issues of Hadoop system, big data and machine learning have been highlighted and an analysis of some machine learning techniques applied so far, for improving the Hadoop performance is presented. Then, a promising machine learning technique using deep learning algorithm is proposed for Hadoop system performance improvement.
An Efficient Microcontroller Based Sun Tracker Control for Solar Cell Systems E.M.H. Arif; J. Hossen; G. Ramana Murthy; Jesmeen M. Z. H.; J. Emerson Raja
International Journal of Electrical and Computer Engineering (IJECE) Vol 9, No 4: August 2019
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (25.088 KB) | DOI: 10.11591/ijece.v9i4.pp2743-2750

Abstract

The solar energy is fast becoming a different means of electricity resource. Now in world Fossil fuels are seriously depleting thus the need for another energy source is a necessity. To create effective utilization of its solar, energy efficiency must be maximized. An attainable way to deal with amplifying the power output of sun-powered exhibit is by sun tracking. This paper presents the control system for a solar cell orientation device which follows the sun in real time during daytime.
An efficient encode-decode deep learning network for lane markings instant segmentation A. Al Mamun; P. P. Em; J. Hossen
International Journal of Electrical and Computer Engineering (IJECE) Vol 11, No 6: December 2021
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijece.v11i6.pp4982-4990

Abstract

Nowadays, advanced driver assistance systems (ADAS) has been incorporated with a distinct type of progressive and essential features. One of the most preliminary and significant features of the ADAS is lane marking detection, which permits the vehicle to keep in a particular road lane itself. It has been detected by utilizing high-specialized, handcrafted features and distinct post-processing approaches lead to less accurate, less efficient, and high computational framework under different environmental conditions. Hence, this research proposed a simple encode-decode deep learning approach under distinguishing environmental effects like different daytime, multiple lanes, different traffic condition, good and medium weather conditions for detecting the lane markings more accurately and efficiently. The proposed model is emphasized on the simple encode-decode Seg-Net framework incorporated with VGG16 architecture that has been trained by using the inequity and cross-entropy losses to obtain more accurate instant segmentation result of lane markings. The framework has been trained and tested on a vast public dataset named Tusimple, which includes around 3.6K training and 2.7 k testing image frames of different environmental conditions. The model has noted the highest accuracy, 96.61%, F1 score 96.34%, precision 98.91%, and recall 93.89%. Also, it has also obtained the lowest 3.125% false positive and 1.259% false-negative value, which transcended some of the previous researches. It is expected to assist significantly in the field of lane markings detection applying deep neural networks.
Lane marking detection using simple encode decode deep learning technique: SegNet A. Al Mamun; P. P. Em; J. Hossen
International Journal of Electrical and Computer Engineering (IJECE) Vol 11, No 4: August 2021
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijece.v11i4.pp3032-3039

Abstract

In recent times, many innocent people are suffering from sudden death for the sake of unwanted road accidents, which also riveting a lot of financial properties. The researchers have deployed advanced driver assistance systems (ADAS) in which a large number of automated features have been incorporated in the modern vehicles to overcome human mortality as well as financial loss, and lane markings detection is one of them. Many computer vision techniques and intricate image processing approaches have been used for detecting the lane markings by utilizing the handcrafted with highly specialized features. However, the systems have become more challenging due to the computational complexity, overfitting, less accuracy, and incapability to cope up with the intricate environmental conditions. Therefore, this research paper proposed a simple encode-decode deep learning model to detect lane markings under the distinct environmental condition with lower computational complexity. The model is based on SegNet architecture for improving the performance of the existing researches, which is trained by the lane marking dataset containing different complex environment conditions like rain, cloud, low light, curve roads. The model has successfully achieved 96.38% accuracy, 0.0311 false positive, 0.0201 false negative, 0.960 F1 score with a loss of only 1.45%, less overfitting and 428 ms per step that outstripped some of the existing researches. It is expected that this research will bring a significant contribution to the field lane marking detection.
A smart method for spark using neural network for big data Md. Armanur Rahman; J. Hossen; Aziza Sultana; Abdullah Al Mamun; Nor Azlina Ab. Aziz
International Journal of Electrical and Computer Engineering (IJECE) Vol 11, No 3: June 2021
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijece.v11i3.pp2525-2534

Abstract

Apache spark, famously known for big data handling ability, is a distributed open-source framework that utilizes the idea of distributed memory to process big data. As the performance of the spark is mostly being affected by the spark predominant configuration parameters, it is challenging to achieve the optimal result from spark. The current practice of tuning the parameters is ineffective, as it is performed manually. Manual tuning is challenging for large space of parameters and complex interactions with and among the parameters. This paper proposes a more effective, self-tuning approach subject to a neural network called Smart method for spark using neural network for big data (SSNNB) to avoid the disadvantages of manual tuning of the parameters. The paper has selected five predominant parameters with five different sizes of data to test the approach. The proposed approach has increased the speed of around 30% compared with the default parameter configuration.
AUTO-CDD: automatic cleaning dirty data using machine learning techniques Jesmeen M. Z. H.; Abid Hossen; J. Hossen; J. Emerson Raja; Bhuvaneswari Thangavel; S. Sayeed; Tawsif K.
TELKOMNIKA (Telecommunication Computing Electronics and Control) Vol 17, No 4: August 2019
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.12928/telkomnika.v17i4.12780

Abstract

Cleaning the dirty data has become very critical significance for many years, especially in medical sectors. This is the reason behind widening research in this sector. To initiate the research, a comparison between currently used functions of handling missing values and Auto-CDD is presented. The developed system will guarantee to overcome processing unwanted outcomes in data Analytical process; second, it will improve overall data processing. Our motivation is to create an intelligent tool that will automatically predict the missing data. Starting with feature selection using Random Forest Gini Index values. Then by using three Machine Learning Paradigm trained model was developed and evaluated by two datasets from UCI (i.e. Diabetics and Student Performance). Evaluated outcomes of accuracy proved Random Forest Classifier and Logistic Regression gives constant accuracy at around 90%. Finally, it concludes that this process will help to get clean data for further analytical process.
Internet of things (IoT) based smart garbage monitoring system Thangavel Bhuvaneswari; J. Hossen; NurAsyiqinbt. Amir Hamzah; P. Velrajkumar; Oo Hong Jack
Indonesian Journal of Electrical Engineering and Computer Science Vol 20, No 2: November 2020
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v20.i2.pp736-743

Abstract

Garbage waste monitoring, collection and management is one of the primary concerns of the present era due to its detrimental effects on environment. The traditional way of manually monitoring and collecting the garbage is a cumbersome process as it requires considerable human effort and time leading to higher cost. In this paper, an IoT based garbage monitoring system using Thingspeak, an open IoT platform is presented. The system consists of an Arduino microcontroller, an ultrasonic sensor, a load cell and a Wi-Fi module. The Arduino microcontroller receives data from the ultrasonic sensor and load cell. The depth of the garbage in the bin is measured using ultrasonic sensor and the weight of the bin with garbage is measured from the load cell. The LCD screen is used to display the data. The Wi-Fi module transmits the above data to the internet. An open IoT platform Thingspeak is used to monitor the garbage system. With this system, the administrator can monitor and schedule garbage collection more efficiently. A prototype has been developed and tested. It has been found to work satisfactorily. The details are presented in this paper.
Towards machine learning-based self-tuning of Hadoop-Spark system Md. Armanur Rahman; Abid Hossen; J. Hossen; Venkataseshaiah C; Thangavel Bhuvaneswari; Aziza Sultana
Indonesian Journal of Electrical Engineering and Computer Science Vol 15, No 2: August 2019
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v15.i2.pp1076-1085

Abstract

Apache Spark is an open source distributed platform which uses the concept of distributed memory for processing big data. Spark has more than 180 predominant configuration parameter. Configuration settings directly control the efficiency of Apache spark while processing big data, to get the best outcome yet a challenging task as it has many configuration parameters.  Currently, these predominant parameters are tuned manually by trial and error. To overcome this manual tuning problem in this paper proposed and developed a self-tuning approach using machine learning. This approach can tune the parameter value when it’s required. The approach was implemented on Dell server and experiment was done on five different sizes of the dataset and parameter. A comparison is provided to highlight the experimented result of the proposed approach with default Spark configuration system. The results demonstrate that the execution is speeded-up by about 33% (on an average) compared to the default configuration.
A Survey on Cleaning Dirty Data Using Machine Learning Paradigm for Big Data Analytics Jesmeen M. Z. H; J. Hossen; S. Sayeed; CK Ho; Tawsif K; Armanur Rahman; E.M.H. Arif
Indonesian Journal of Electrical Engineering and Computer Science Vol 10, No 3: June 2018
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v10.i3.pp1234-1243

Abstract

Recently Big Data has become one of the important new factors in the business field. This needs to have strategies to manage large volumes of structured, unstructured and semi-structured data. It’s challenging to analyze such large scale of data to extract data meaning and handling uncertain outcomes. Almost all big data sets are dirty, i.e. the set may contain inaccuracies, missing data, miscoding and other issues that influence the strength of big data analytics. One of the biggest challenges in big data analytics is to discover and repair dirty data; failure to do this can lead to inaccurate analytics and unpredictable conclusions. Data cleaning is an essential part of managing and analyzing data. In this survey paper, data quality troubles which may occur in big data processing to understand clearly why an organization requires data cleaning are examined, followed by data quality criteria (dimensions used to indicate data quality). Then, cleaning tools available in market are summarized. Also challenges faced in cleaning big data due to nature of data are discussed. Machine learning algorithms can be used to analyze data and make predictions and finally clean data automatically.