INOVTEK Polbeng - Seri Informatika
Vol. 10 No. 2 (2025): July

Analysis of Differences Between AI and Human Texts Using the Natural Language Processing Method

Cahyana, Dinda (Unknown)
Sijabat, VitoReyLukito (Unknown)
Irfan Fahmi, Mohammad (Unknown)



Article Info

Publish Date
18 Jun 2025

Abstract

Artificial Intelligence has become increasingly proficient in generating text that mimics human writing, yet existing detection tools remain limited in accuracy and adaptability. Previous studies indicate that systems like Turnitin and GPTZero often perform below 80% accuracy and struggle with paraphrased or advanced AI-generated content. This study addresses that gap by analyzing linguistic differences between AI-generated and human-written texts using Natural Language Processing. A dataset of 487,235 texts (305,797 human-written and 181,438 AI-generated) was processed using TF-IDF vectorization and classified with the Multinomial Naive Bayes algorithm. The model achieved 99.35% accuracy and an F1-score of 0.9948, with balanced performance in detecting both text types. Results show that while AI-generated texts are structurally consistent, they often lack the emotional depth and cultural nuance found in human writing. These findings suggest NLP methods are highly effective in distinguishing between the two, and have practical implications for developing more reliable detection systems to ensure textual authenticity in education, journalism, and digital media monitoring.

Copyrights © 2025






Journal Info

Abbrev

ISI

Publisher

Subject

Computer Science & IT

Description

The Journal of Innovation and Technology (INOVTEK Polbeng—Seri Informatika) is a distinguished publication hosted by the State Polytechnic of Bengkalis. Dedicated to advancing the field of informatics, this scientific research journal serves as a vital platform for academics, researchers, and ...