Empirical Study of AI-Generated Text Detection — Results as per An Empirical Study of AI-Generated Text Detection Tools
Originality.ai exhibited outstanding AI detection capabilities, according to research in An Empirical Study of AI-Generated Text Detection Tools, 2023.
Based on An Empirical Study of AI-Generated Text Detection Tools by Arslan Akram (from The Department of Computer Science, Faculty of Computer Science and Information Technology, The Superior University, Pakistan), when distinguishing AI-generated content from machine-written content, Originality.ai outperforms other detection tools. It is the most reliable alternative for AI text identification.
Key Findings (TL;DR)
Originality.ai achieved the highest accuracy rate at97% of all the tools in the study.
Originality.ai demonstrated outstanding precision (98%), recall (96%), and F1-score (97%).
Originality.ai stood out in the confusion matrix with the highest true positives and lowest false negatives — demonstrating an exceptional ability to correctly identify human-written content from AI-generated content.
Study Details
The study evaluated six AI text detection tools: Zylalab, GPTKIT, GPTZero, Sapling, Originality.ai, and Writer, with a particular emphasis on their accuracy, precision, recall, and F1-score. Although all the tools fared well in the evaluations, Originality.ai was the most effective. It demonstrated an exceptional capability to detect AI-generated text from human-written text.
AI Text Detection Tools
Six AI text detection tools (Zylalab, GPTKIT, GPTZero, Sapling, Originality.ai, and Writer)
Dataset
The total number of samples in the dataset is 11,580.
The dataset, named AH&AITD (Arslan’s Human and AI Text Database) includes:
Human-written text samples from academic databases (Google Scholar and ResearchGate), content producer and blogger databases (Wikipedia), and other knowledge aggregators.
AI-text samples were generated from the models (ChatGPT, GPT-4, GPT-3, GPT-3.5, GPT-2, etc).
Evaluation Criteria
Accuracy, Precision, Recall, F1 score, ROC curve, and Confusion Matrix
Originality.ai’s AI Detector Results
Finding 1: Originality.ai achieved the highest accuracy rate of 97.0% among all evaluated tools
Finding 2: Originality.ai demonstrated outstanding precision, recall, and F1-score
Originality.ai showed:
Precision: 98% — it correctly identified AI-generated text.
Recall: 96% — it captured the majority of AI-generated text.
F1-score: 97% — it performed well in both precision and recall with an excellent balance.
Finding 3: Originality.ai achieved the highest AUC among all tested tools
Originality.ai demonstrated a superior ROC curve, giving an AUC score of 0.97,which is the highest in comparison to the others. It means that Originality’s AI Checker has an excellent ability to distinguish between AI-generated and human-written text.
Finding 4: Originality.ai stands out with its results on Confusion Matrix
The highest true positives = 5,547 (Correctly identified AI text).
The lowest false negatives = 243 (AI text incorrectly identified as human).
The second lowest false positives = 94 (Human text incorrectly identified as AI).
The second highest true negatives = 5,696 (Correctly identified human text).
In the above two readings (False Positives and True Negatives), Originality.ai ranked second with a very close margin of 17 samples when compared to other tools.
Final Thoughts
Originality.ai was the most reliable and effective tool for AI-generated text detection, with high accuracy, precision, and recall. The author’s in-depth research and fair evaluation provide a clear comparison of the performance of AI text detection tools. Further, the research demonstrated that Originality.ai is a must-have tool for anyone who wants to ensure the authenticity of their work.
Founder / CEO of Originality.ai I have been involved in the SEO and Content Marketing world for over a decade. My career started with a portfolio of content sites, recently I sold 2 content marketing agencies and I am the Co-Founder of MotionInvest.com, the leading place to buy and sell content websites. Through these experiences I understand what web publishers need when it comes to verifying content is original. I am not For or Against AI content, I think it has a place in everyones content strategy. However, I believe you as the publisher should be the one making the decision on when to use AI content. Our Originality checking tool has been built with serious web publishers in mind!