Claim Missing Document
Check
Articles

Found 21 Documents
Search

Improving Computational Efficiency and Accuracy of Damerau-Levenshtein Distance for Indonesian Spelling Correction using Cosine Similarity husni husni; Yoga Dwitya Pramudita; Mohammad Syarief; Army Justitia; Ika Oktavia Suzanti
Journal of Innovation Information Technology and Application (JINITA) Vol 7 No 2 (2025): JINITA, December 2025
Publisher : Politeknik Negeri Cilacap

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35970/jinita.v7i2.2893

Abstract

Spelling correction is an automatic correction feature useful in detecting spelling errors and providing word suggestions if necessary. Spelling correction is one of the crucial preprocessing phases in text mining. The Damerau-Levenshtein Distance method is one of the spelling correction methods that has high accuracy. This method has four types of operations: insertion, deletion, substitution, and transposition. The basic approach in detecting spelling errors in the Indonesian language is to use a dictionary search. Despite its accuracy, the Damerau-Levenshtein Distance method has a slow computation time. Furthermore, when the dictionary contains several suggested words that have the same distance from the target word, it will be difficult to prioritize the most appropriate suggestions. To overcome this problem, we introduce a caching mechanism to store previously calculated corrections, thereby speeding up the computation process. In addition, we use the cosine similarity method to rank words in Damerau-Levenshtein Distance results. The results of our approach have a significant improvement in accuracy, increasing from 72.13% to 83.60% by integrating caching and cosine similarity for ranking, which shows a significant improvement in both efficiency and effectiveness