International Journal of Informatics and Communication Technology (IJ-ICT)
Vol 11, No 1: April 2022

Correcting optical character recognition result via a novel approach

Otman Maarouf (Sultan Moulay Slimane University)
Rachid El Ayachi (Sultan Moulay Slimane University)
Mohamed Biniz (Sultan Moulay Slimane University)



Article Info

Publish Date
01 Apr 2022

Abstract

Optical character recognition (OCR) is a recognition system used to recognize the substance of a checked picture. This system gives erroneous results, which necessitates a post-treatment, for the sentence correction. In this paper, we proposed a new method for syntactic and semantic correction of sentences it is based on the frequency of two correct words in the sentence and a recursive technique. This approach starts with the frequency calculation of each two words successive in the corpora, the words that have the greatest frequency build a correction center. We found 98% using our approach when we used the noisy channel. Further, we obtained 96% using the same corpus in the same conditions.

Copyrights © 2022






Journal Info

Abbrev

IJICT

Publisher

Subject

Computer Science & IT

Description

International Journal of Informatics and Communication Technology (IJ-ICT) is a common platform for publishing quality research paper as well as other intellectual outputs. This Journal is published by Institute of Advanced Engineering and Science (IAES) whose aims is to promote the dissemination of ...