International Journal of Intelligent Systems and Applications in Engineering
2016: Special Issue

Comparison of the effect of unsupervised and supervised discretization methods on classification process

HACIBEYOĞLU, MEHMET (Unknown)
IBRAHIM, Mohammed H. (Unknown)



Article Info

Publish Date
26 Dec 2016

Abstract

Most of the machine learning and data mining algorithms use discrete data for the classification process. But, most data in practice include continuous features. Therefore, a discretization pre-processing step is applied on these datasets before the classification. Discretization process converts continuous values to discrete values. In the literature, there are many methods used for discretization process. These methods are grouped as supervised and unsupervised methods according to whether a class information is used or not. In this paper, we used two unsupervised methods: Equal Width Interval (EW), Equal Frequency (EF) and one supervised method: Entropy Based (EB) discretization. In the experiments, a well-known 10 dataset from UCI (Machine Learning Repository) is used in order to compare the effect of the discretization methods on the classification. The results show that, Naive Bayes (NB), C4.5 and ID3 classification algorithms obtain higher accuracy with EB discretization method.

Copyrights © 2016






Journal Info

Abbrev

IJISAE

Publisher

Subject

Computer Science & IT

Description

International Journal of Intelligent Systems and Applications in Engineering (IJISAE) is an international and interdisciplinary journal for both invited and contributed peer reviewed articles that intelligent systems and applications in engineering at all levels. The journal publishes a broad range ...