Claim Missing Document
Check
Articles

Found 1 Documents
Search

Integrating OCR and NLP Techniques for Accurate Text Extraction and Plagiarism Detection in Image-Based Content Palvadi, Dr Srinivas Kumar
International Journal of Advanced Science and Computer Applications Vol. 4 No. 1 (2025): March 2025
Publisher : Utan Kayu Publishins

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47679/ijasca.v4i1.105

Abstract

In the digital age, images often contain valuable text-based information, including numbers, symbols, and other data. Efficient extraction and verification of this content is critical, particularly in academic and content-driven domains where originality is paramount. This paper presents a novel approach to detecting plagiarism in text embedded within images. The proposed method leverages Optical Character Recognition (OCR) to extract text from images and applies Natural Language Processing (NLP) techniques to evaluate the originality of the extracted content. By comparing the text against a comprehensive database of existing sources, the system is capable of identifying potential plagiarism while distinguishing between original and copied content. This approach ensures that not only text in conventional documents but also in images is scrutinized for authenticity, enhancing the reliability of plagiarism detection in diverse content formats. The proposed solution offers an efficient and automated pipeline for image-based text extraction and plagiarism detection, applicable in educational, legal, and content creation environments.