False Positives in Artificial Intelligence Detectors in Academic Assignments: A Comparative Analysis of Digital Platforms in Educational Contexts

Authors

DOI:

https://doi.org/10.70577/asce.v5i2.861

Keywords:

Artificial Intelligence, false positives, AI detectors, academic evaluation, education, academic assignments, educational ethics, students

Abstract

This research aimed to analyze the reliability of different Artificial Intelligence detection tools applied to academic assignments in educational contexts. The study was conducted using a quantitative, descriptive, and comparative approach, involving five teachers who evaluated the same textual content corresponding to an old poem by Mario Benedetti through the platforms JustDone, Reprism, Grammarly, Sidekicker, and ZeroGPT.

The results revealed significant differences among the analyzed tools. JustDone, Reprism, and Sidekicker classified the human-written content with high AI detection percentages, reaching values of up to 99%, whereas ZeroGPT and Grammarly identified the text as entirely human, reporting 0% AI-generated content. Additionally, a survey was conducted with participating teachers, whose responses reflected distrust regarding the accuracy of these tools and concern about their potential negative impact on Basic Education and High School students. The study concludes that some AI detectors present false positives and important limitations in accuracy; therefore, they should not be used as definitive evidence in academic evaluation processes. It is recommended that these tools be complemented with human review, contextual analysis, and pedagogical criteria before making judgments about the authenticity of student academic work.

Downloads

Download data is not yet available.

References

OpenAI. (2023). New AI classifier for indicating AI-written text.

https://openai.com/index/new-ai-classifier-for-indicating-ai-written-text/

Dalalah, D., & Dalalah, O. M. A. (2023). The false positives and false negatives of generative AI detection tools in education and academic research: The case of ChatGPT. The International Journal of Management Education, 21(2), 100822.

https://doi.org/10.1016/j.ijme.2023.100822 DOI: https://doi.org/10.1016/j.ijme.2023.100822

Turnitin. (2024). Using the AI Writing Report.

https://guides.turnitin.com/hc/en-us/articles/22774058814093-Using-the-AI-Writing-Report

Deep, P. D. (2025). Evaluating the effectiveness and ethical implications of AI detection tools in academic settings. Information, 16(10), 905. DOI: https://doi.org/10.3390/info16100905

https://www.mdpi.com/2078-2489/16/10/905

Northern Illinois University. (2024). AI detectors: An ethical minefield.

https://citl.news.niu.edu/2024/12/12/ai-detectors-an-ethical-minefield/

Published

2026-05-26

How to Cite

Yanangómez Suárez , P. A., Díaz Díaz , E. R., Ojeda Cabrera, L. M., Yaguachi Quichimbo , C. B., & Carrillo Díaz , M. C. (2026). False Positives in Artificial Intelligence Detectors in Academic Assignments: A Comparative Analysis of Digital Platforms in Educational Contexts. ANNALS SCIENTIFIC EVOLUTION, 5(2), 1976–1999. https://doi.org/10.70577/asce.v5i2.861

Similar Articles

1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.