cover
Contact Name
Heri Nurdiyanto
Contact Email
jurnal.ijasca@gmail.com
Phone
+6285766661199
Journal Mail Official
jurnal.ijasca@gmail.com
Editorial Address
Lucky Arya Residence 2 No. 18 Jalan HOS. Cokroaminoto Kab. Pringsewu 35373
Location
Kab. pringsewu,
Lampung
INDONESIA
International Journal of Advanced Science and Computer Applications
Published by UK Institute
ISSN : 28097599     EISSN : 28097467     DOI : https://doi.org/10.47679/ijasca
International Journal of Advanced Science and Computer Applications (IJASCA) is a peer-reviewed open-access journal. The journal invites scientists and engineers throughout the world to exchange and disseminate theoretical and practice-oriented the whole spectrum of Advanced Science and Computer Applications. Submitted papers must be written in English for an initial review stage by editors and further review process by a minimum of two international reviewers. Accepted papers will be freely accessed in this website
Articles 52 Documents
Integrating OCR and NLP Techniques for Accurate Text Extraction and Plagiarism Detection in Image-Based Content Kumar, Palvadi Srinivas; Prasad, Krishna
International Journal of Advanced Science and Computer Applications Vol. 4 No. 1 (2025): March 2025
Publisher : Utan Kayu Publishins

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47679/ijasca.v4i1.105

Abstract

In the digital age, images often contain valuable text-based information, including numbers, symbols, and other data. Efficient extraction and verification of this content is critical, particularly in academic and content-driven domains where originality is paramount. This paper presents a novel approach to detecting plagiarism in text embedded within images. The proposed method leverages Optical Character Recognition (OCR) to extract text from images and applies Natural Language Processing (NLP) techniques to evaluate the originality of the extracted content. By comparing the text against a comprehensive database of existing sources, the system is capable of identifying potential plagiarism while distinguishing between original and copied content. This approach ensures that not only text in conventional documents but also in images is scrutinized for authenticity, enhancing the reliability of plagiarism detection in diverse content formats. The proposed solution offers an efficient and automated pipeline for image-based text extraction and plagiarism detection, applicable in educational, legal, and content creation environments.
Developing Semantic Textual Similarity for Guragigna Language Using Deep Learning Approach Getnet Degemu
International Journal of Advanced Science and Computer Applications Vol. 5 No. 1 (2026): March 2026
Publisher : Utan Kayu Publishins

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47679/ijasca.v4i2.106

Abstract

Semantic Similarity is one of the highest levels of NLP. STS has significant advantages in NLP applications like information retrieval, information extraction, text summarization, data mining, machine translation, and other tasks. This research aims to present a deep learning approach for capturing semantic textual similarity (STS) in the Guragigna. The methodology involves collecting a Guragigna language corpus and preprocessing the text data and text representation is done using the Universal Sentence Encoder (USE), along with word embedding techniques including Word2Vec and GloVe and mean Square Error (MSE) is used to measure the performance. In the experimentation phase, models like LSTM, Bidirectional RNN, GRU, and Stacked RNN are trained and evaluated using different embedding techniques. The results demonstrate the efficacy of the developed models in capturing semantic textual similarity in the Guragigna language. Across different embedding techniques, including Word2Vec, GloVe, and USE, the Bidirectional RNN model with USE embedding achieves the lowest MSE of 0.0950 and the highest accuracy of 0.9244. GloVe and Word2Vec embedding also show competitive performance with slightly higher MSE and lower accuracy. The Universal Sentence Encoder consistently emerges as the top-performing embedding across all RNN architectures. The research results demonstrate the effectiveness of LSTM, GRU, Bi RNN, and Stacked RNN models in measuring semantic textual similarity in the Guragigna language.