Rehan N's Senior Project Blog
|
|
Project Title: Adapting AI Detection Tools to Mitigate Biases for ESL Writers. BASIS Advisor: Ms. Ainslie Internship Location: Remote Onsite Mentor: N/A |
Project Abstract
As AI-generated text from large language models becomes ever more prevalent, professionals in fields ranging from education to journalism have begun to rely on detection tools to assess content authenticity. Yet, a growing body of research highlights potential biases in these systems, which is particularly concerning as there is a lack of transparency on performance metrics of commercial tools like GPTZero or Turnitin. These systems—often trained on limited linguistic profiles—may disproportionately misclassify writing from ESL (English as a Second Language) speakers as AI generated—raising ethical concerns around the equity of AI-assisted content evaluation. My project seeks to develop a solution that addresses these biases by analyzing text samples from native English speakers, non-native English speakers from diverse geographic locations and ethnic backgrounds, and AI-generated outputs. I begin by feeding these samples into a customizable open source AI detector and examine key metrics like accuracy and false positive rate in regards to their impact on non-native English speakers. The resulting data quantifies the existing disparities and showcases systemic blindspots in current detection algorithms. Building on these findings, I modify the text-classification model by retraining it with a broader spectrum of linguistic styles to optimize for inclusive performance. Then, I rerun the tests and perform a comparative statistical analysis to evaluate improvements. The post-training results underscore the potential for thoughtful data curation mitigating biases. Ultimately, my research advocates for greater awareness of how detection tools are developed and deployed and charts a path towards systems that more accurately and more fairly distinguish between human-authored and machine-generated text.
Conclusion
For the last time, hey everyone! It’s hard to believe that this is the last blog post of my whole senior project journey. Over the past few months I have explored a question that initially seemed kind of niche, but turned out to have tons of real world relevance: how can AI detectors be more... Read More
Limitations
Hey guys! So with all the data collection and testing put behind me, I’ve finally had a change of pace and began to transition from the hustle of gathering results to more of the analysis with the write-up portion for my research paper. A large part of this involves putting phase 1 (quantifying the extent... Read More
Results
Hey guys! So the big moment has finally arrived and I’ve wrapped up both the data collection and testing for phase 2 of my research project, which means all the data collection is done and fully synthesized. Let’s get right into the final numbers and what they say about how effective the retraining was. Chinese... Read More
Adaption
Hello again! After all the patience from last week, I’m thrilled to share that I’ve begun to fully dive into phase 2 of the process. The datasets that I have been refining to make the three models are now officially ready to actually be fed into the pipeline for retraining. So, let’s get right into... Read More
Scenes
Hey everyone! This week, I’ve finally officially moved onto the next phase of my research. Now that I’ve confirmed that there is a disproportionate false positive bias, which aligns with existing literature, but is more up to date, it’s time to see what adaptations can be made for a difference to be seen. Unfortunately, other... Read More
Analysis
Hey guys! This week, I’ve been wrapping up some of the initial statistical analysis for Phase 1 of my project. As you may recall from previous posts, I’ve been collecting samples for four datasets, which results in 200 text samples to test in total. So, let’s get right into the findings. For starters, the AI-generated... Read More
Testing
Hello everyone! After finalizing my three main datasets and the control one as well, which you might recall from my last post, I’ve finally got to begin the first round of testing how the AI detector performs on the data. This week has been an exciting step because I’m seeing for the first time how... Read More
Datasets
Hey guys! This week, I’ve been finishing up curating the rest of my datasets apart from my control. According to my schedule, this was supposed to have been done last week, but I’ve run into some unexpected delays in the data collection process. As a refresher, my research focuses on biases in AI detection tools... Read More
Distinctions
As I work towards finalizing the datasets that I will be using to test modern AI detection tools across writing from people from differing linguistic backgrounds, I’ve been reflecting on the subtle yet striking distinctions between human writing and AI generated text. Large Language Models (LLMs) like ChatGPT or DeepSeek are trained on massive datasets... Read More
Ethics
Artificial intelligence is growing in nearly every sector from healthcare and finance to more creative industries at an unprecedented rate. Naturally, with this growth, AI ethics has become even more relevant which raises several questions. Who ensures that AI remains fair? Companies? The Government? How do they prevent AI from reinforcing inequalities that exist in... Read More
Control
Hey guys! This week I have been focusing on building the control dataset for my research. As a refresher, my project is on testing equitable performance of AI detectors like GPTzero for ESL (English as a Second Language) speakers. The control dataset, which is a pool of AI generated texts that will be inputted into... Read More
Introduction
Hello everyone! I’m Rehan Nagabandi and I’m a senior at BASIS Peoria. I’m excited to welcome you to the introductory post of my senior project blog. So, let’s get right into it! Lately, I’ve found myself drawn into artificial intelligence, particularly “generative AI,” tools like ChatGPT which can generate everything from prose to code with... Read More
