Indonesian Journal of Electrical Engineering and Computer Science
Vol 12, No 7: July 2014

Sequence Clustering Algorithm Based on Weighed Sequential Pattern Similarity

Di Wu (Unknown)
Jiadong Ren (Unknown)



Article Info

Publish Date
01 Jul 2014

Abstract

Sequence clustering has become an active issue in the current scientific community. However, the clustering quality is affected heavily by selecting initial clustering centers randomly. In this paper, a new sequence similarity measurement based on weighed sequential patterns is defined. SCWSPS (Sequence Clustering Algorithm Based on Weighed Sequential Pattern Similarity) algorithm is proposed. Sequences with the largest weighted similarity are chosen as the merge objects. The last K-1 synthesis results are deleted from sequence database. Others sequences are divided into K clusters. Moreover, in each cluster, the sequence which has the largest sum of similarities with other sequences is viewed as the updated center. The experimental results and analysis show that the performance of SCWSPS is better than KSPAM and K-means in clustering quality. When the sequence scale is very large, the execution efficiency of SCWSPS is slightly worse than KSPAM and K-means.

Copyrights © 2014