eProceedings of Engineering
Vol 4, No 3 (2017): Desember, 2017

Indonesian Language Stemmer Algorithm Improvement By Rearrange Stemming Process Steps Sequence

Hari Widayanto (Telkom University)
Arief Huda (Telkom University)



Article Info

Publish Date
01 Dec 2017

Abstract

Stemming is a processs to find root word from its compounded form by removing all affixes are attached on it. Stemmer was applied in various text mining application to improve application performance, such as in Information Retrieval stemmer could improve performance by providing variant morphological searched terms and reduce size of index [9]. In word based text compression, stemmer could simplify the dictionary as various word from could be represented by one word [6]. Besides reduce size of document index, stemmer could increase text retrieval accuracy [10]. In text classification stemmer reduce the number of features [18]. The first Indonesian stemmer was developed by Nazief-Adriani then Jelita Asian improved the algorithm called confix stripping (CS) stemmer. There were heaps of improvement was done by CS stemmer so it is highest accuracy stemmer algorithm. Experiment would be performed to compare the accuracy between Nazief – Adriani and CS stemmeralgorithm for stemm words were extracted from online news, Republika. Keywords : Stemming, Indonesian, Nazief-Adriani, CS stemmer

Copyrights © 2017






Journal Info

Abbrev

engineering

Publisher

Subject

Computer Science & IT Control & Systems Engineering Electrical & Electronics Engineering Engineering Industrial & Manufacturing Engineering

Description

Merupakan media publikasi karya ilmiah lulusan Universitas Telkom yang berisi tentang kajian teknik. Karya Tulis ilmiah yang diunggah akan melalui prosedur pemeriksaan (reviewer) dan approval pembimbing ...