Journal of Applied Information, Communication and Technology
Vol. 4 No. 1 (2017)

Hadoop Configuration Tuning for Performance Optimization

Christian, Christian (Unknown)
I Eng, Kho (Unknown)
Ipung, Heru Purnomo (Unknown)



Article Info

Publish Date
25 Apr 2017

Abstract

Configuration parameter tuning is an essential part of the implementation of Hadoop clusters. Each parameter in a configuration plays a role that impacts the ov erall performance of the cluster. Therefore, we need to learn the characteristics of said parameter and understand the impact in hardware utilization in order to achieve optimal configuration. In this paper, we conducted experiments that includes modifying configuration and performed benchmark to find out if there is any performance gain. TeraSort is the program that runs the benchmark, we measure the time needed to complete the sort of the set of data and the CPU utilization during the benchmark. We conclu de that from our experiments we can see significant performance improvements by tuning with the configurations. However, the results may vary between different cluster configuration.

Copyrights © 2017






Journal Info

Abbrev

EJAICT

Publisher

Subject

Computer Science & IT

Description

Journal of Applied Information, Communication and (eJAICT) welcomes full research papers in the area of Information and Communication Technology (ICT). The journal publishes review and research result on the frontier research, development, and application in the scope of ICT. The scope of the ...