The trainer didn’t reply, and docked Ostovitz’s grade.
Ostovitz’s mother, Stephanie Rizk, says her daughter is a high-achieving pupil who cares about doing nicely in class and she or he was alarmed when the trainer jumped to conclusions about Ostovitz’s work so early within the faculty yr.
“Get to know their degree of ability, after which perhaps your AI detector is beneficial,” Rizk says.
Rizk advised NPR she met with the trainer in mid-November and the trainer mentioned they by no means noticed her daughter’s message.

The college district, Prince George’s County Public Faculties, made clear in a press release that Ostovitz’s trainer used an AI detection device on their very own and that the district doesn’t pay for this software program.
“Throughout employees coaching, we advise educators to not depend on such instruments, as a number of sources have documented their potential inaccuracies and inconsistencies,” the assertion mentioned.
PGCPS declined to make Ostovitz’s trainer accessible for an interview. Rizk advised NPR that after their assembly, the trainer not believed Ostovitz used AI.
However what occurred to Ostovitz isn’t stunning.
Greater than 40% of surveyed Sixth- to Twelfth-grade lecturers used AI detection instruments over the past faculty yr, in response to a nationally representative poll by the Middle for Democracy and Expertise, a nonprofit that advocates for civil rights and civil liberties within the digital age.
That’s regardless of numerous research studies displaying that AI detection instruments are removed from dependable.
“It’s now pretty nicely established within the tutorial integrity area that these instruments will not be match for objective,” says Mike Perkins, a number one researcher on tutorial integrity and AI at British College Vietnam.
Perkins discovered that a few of the hottest AI detectors — together with Turnitin, GPTZero and Copyleaks — flagged some issues as AI that weren’t, and vice versa. Their accuracy charges dropped even additional when AI textual content was manipulated to seem extra human.
“We noticed some actually regarding issues with a few of the most prolific AI textual content detection instruments,” he says.
Regardless of these issues, NPR discovered that college districts from Utah to Ohio to Alabama are spending hundreds of {dollars} on these instruments.
Why one of many nation’s largest districts makes use of AI detection software program
Close to Miami, Broward County Public Faculties is spending greater than $550,000 on a three-year contract with Turnitin. The long-standing ed-tech firm has traditionally supplied colleges with plagiarism detection software program; in 2023, it launched an AI detection function. When educators put pupil work by this device, it generates a share, which displays the quantity of textual content the software program determines was possible generated by AI. One caveat: According to the company, scores of 20% or decrease are much less dependable.
“The Turnitin device is one thing that helps us facilitate dialog and suggestions, not grading,” says Sherri Wilson, director of modern studying for the Broward faculty district, which enrolls greater than 230,000 college students and is among the largest faculty districts within the nation.
Wilson says the district is “completely conscious” of the analysis displaying AI detection instruments, together with Turnitin, aren’t 100% correct or dependable.
Turnitin additionally acknowledges this: On the company’s website, it says, “our AI writing detection could not at all times be correct … so it shouldn’t be used as the only foundation for opposed actions towards a pupil.”
Turnitin wrote in a press release to NPR that it’s extra essential to keep away from falsely accusing college students of dishonest than to catch all AI writing.
Wilson says the Turnitin device continues to be precious as a result of it saves lecturers time by rapidly scanning pupil work for suspected AI use.
Another excuse that Broward lecturers have entry to the device, Wilson says, is that the district participates in tutorial applications, resembling Worldwide Baccalaureate, or IB, by which pupil work have to be authenticated by lecturers earlier than it’s despatched out for exterior assessment.
Each of the applications Broward affords, IB and Worldwide Schooling at Cambridge, advised NPR that colleges will not be required to make use of AI detection software program as a part of the authentication course of. Nonetheless, Broward advised NPR in a press release, “now we have chosen to offer our lecturers with [Turnitin] as one of many instruments to satisfy the necessities.”
However Wilson says lecturers are the final word authority on whether or not a pupil’s work is their very own — not the AI detection device.
“They’re utilizing these instruments as suggestions to then have these teachable moments with college students,” she says.
Why one trainer makes use of AI detection instruments
Language and literature trainer John Grady says, for him, AI detection instruments present “a leaping off level” to begin a dialog with a pupil who could have used AI.

“It’s actually not foolproof,” he says. “But it surely offers you one thing to hold your hat on.”
Grady teaches at Shaker Heights Excessive Faculty, a part of the Shaker Heights Metropolis Faculty District outdoors Cleveland. The district serves roughly 4,400 college students, and is paying GPTZero, one other AI detection software program firm, about $5,600 this yr for annual licenses for 27 of the district’s lecturers. The device calculates a share chance {that a} pupil’s work is AI-generated.
Grady says he places all pupil essays by GPTZero; if the device exhibits greater than a 50% chance AI was used for the task, Grady digs deeper. That features utilizing revision historical past instruments to see how a lot time a pupil spent on an task, and what number of edits they made in the course of the writing course of. If it seems that a pupil made just a few edits and spent hardly any time writing, he’ll test in with that pupil.
“And I’ll say, ‘Hey, this flagged. Are you able to speak to me about why?’ I’d say the majority of the time, like 75%, if it was AI, they’d be like, ‘Yeah, I did.’ And I’m like, ‘OK, nicely now you’ve obtained to rewrite it with much less credit score,’” Grady says.
Edward Tian, co-founder and CEO of GPTZero, says that is how educators ought to be utilizing his firm’s device.
“We positively don’t consider it is a punishment device,” Tian says. “This must be a device within the toolkit and never the ultimate smoking gun.”
He says it’s essential to grasp {that a} GPTZero likelihood rating beneath 50% means it’s extra possible the textual content was human versus AI-generated. He says scores over 50% warrant nearer examination — like what Grady describes.
Tian doesn’t dispute the analysis that exhibits GPTZero isn’t at all times dependable. However he notes that there are educators, like Grady, who nonetheless discover it precious for the data it supplies.
He says that instruments like his provide a “sign on what’s taking place in your classroom” however that lecturers ought to at all times comply with up with college students if that sign exhibits one thing regarding.
The AI detection skeptics
Shaker Heights junior Zi Shi, whose first language is Mandarin, says his writing type can typically seem like AI “due to the repetition of phrases I exploit. I really feel prefer it’s due to how restricted my vocabulary is.”
Shi — who isn’t a pupil of Grady’s — says he’s nonetheless engaged on his writing expertise and he’s involved that AI detection software program is likely to be biased towards non-native English audio system like himself.
Some educators share this concern, although the analysis up to now is restricted and contradictory.
Shi says an task he accomplished for his English class earlier this fall was flagged by GPTZero as probably AI-generated. He says his trainer advised that his use of a web-based device known as Grammarly could have triggered the detection software program. Grammarly makes use of AI to appropriate grammar and, if prompted, generate textual content. (The trainer confirmed Shi’s account with NPR.)
Shi says he solely used Grammarly to scrub up his writing and that he wrote the task himself. “It was positively disappointing to see the remark of it being flagged as AI,” Shi says.
Shi thinks AI detectors needs to be considered a “smoke alarm, the place it’s an indication, or warning. However, you realize, typically it could possibly be like a false alarm.”
He questions whether or not the varsity district needs to be spending hundreds of {dollars} on AI detection software program. He says that cash could possibly be higher spent on skilled improvement for lecturers.
Carrie Cofer, a highschool English trainer within the Cleveland Metropolitan Faculty District — only a few miles from Shaker Heights — shares that view.
Final yr, as an experiment, she uploaded a chapter of her Ph.D. dissertation into GPTZero. “And it got here up with like 89% or 91% AI-written, and I’m like, ‘Oh, no, I don’t suppose that’s proper, as a result of it was all mine,’” Cofer says.

Cofer helps her district form its AI coverage and pointers; she says Cleveland colleges don’t presently pay for AI detection software program and she or he’d advocate towards it.
“I don’t suppose it’s an efficacious use of their cash,” Cofer says. “The youngsters are going to get round it by some means.”
Some workarounds that college students may flip to incorporate utilizing AI detection software program themselves, to workshop assignments so that they don’t get flagged, and utilizing “AI humanizer” programs, which declare to make AI-generated writing seem extra human.
In the end, she says, lecturers might want to adapt to AI by altering how they educate and assess pupil studying.
Again in Maryland, highschool junior Ailsa Ostovitz can be adapting. She now runs all her homework assignments by a number of AI detection instruments earlier than she turns them in.
The writing is her personal, she says, however she’ll rewrite sentences the software program identifies as probably AI-generated, an additional step that provides about half an hour to each task.
“I believe I’ve positively turn into extra vigilant about presenting my work as mine and never AI,” she explains.
She doesn’t need to take any possibilities.
This reporting was supported by a grant from the Tarbell Center for AI Journalism.


