The Witch's Mark
The AI Slop Police and the Unfalsifiable Accusation
In February 2026, the YouTuber Frankie’s Shelf published a video essay, more than two and a half hours long, about the horror novel Shy Girl by Mia Ballard. The YouTuber argued the book was likely created by artificial intelligence.
This video was not an isolated piece of content. It was one voice in a chorus that had been building since early 2026. Readers on Goodreads and Reddit had been forensically dissecting isolated passages of Ballard’s novel, identifying what they took to be the fingerprints of machine authorship: flat sentence rhythms, uniform vocabulary, overuse of em-dashes, and the particular brand of structural tidiness associated with generative text.
The stakes were substantial. Ballard’s self-published novel had been picked up by Orbit, a prestigious imprint of Hachette, after acquiring nearly five thousand ratings on Goodreads. It was released in the United Kingdom in November 2025 and was scheduled for a major American launch in April 2026.
By March 2026, after a coordinated campaign of denunciation amplified by social media, Hachette retreated. The American launch was canceled. The British edition was discontinued, and the book was pulled from Amazon globally. Ballard denied writing the novel with AI and attributed any algorithmic residue to an editor she had hired during the self-publishing phase. She told the New York Times that her mental health had reached an “all-time low” and that her professional name had been ruined “for something she didn’t even personally do.”
What is frustrating to me is that the countless readers who believed they could sense the hand of a large language model in Ballard’s prose were almost certainly wrong about their own ability to sense any such thing. But the accuracy of their detection ultimately did not matter. The accusation alone was sufficient to destroy a book’s commercial life. What we are seeing is a digital enforcement apparatus built on a perceptual claim that the empirical evidence does not support, and we are giving it the power of commercial excommunication.
Some call this enforcement apparatus the “AI Slop Police.”
The case for vigilance
I need to acknowledge that the underlying anxiety driving this movement is not irrational. Generative AI has introduced actual harm to creative ecosystems. Creative as well as scholarly integrity is under genuine strain. Artists have watched their labor compete against tools trained, in part, on their uncompensated work. And publishers have every reason to fear the flood of synthetic manuscripts hitting their inboxes.
This is not limited to written or visual content. Music streaming platform Deezer reported in late 2025 that roughly fifty thousand fully AI-generated tracks were being uploaded to its servers each day, up from ten thousand at the start of that year. This made up about 34 percent of their entire streaming catalog. Readers, viewers, and listeners have a legitimate interest in knowing whether the cultural artifacts they consume originated in a human mind. The desire for authenticity is not the problem. The problem is the mechanism we have improvised to enforce it.
The ghosts of plagiarism hunters past
The contemporary hunt for AI content repeats a historic pattern. In the early 1990s, two researchers at the National Institutes of Health, Walter Stewart and Ned Feder, built an automated plagiarism detection system and turned it on the work of established scholars. They later submitted their algorithmic findings to the American Historical Association, primarily targeting the historian Stephen B. Oates. They alleged that his 1977 biography of Abraham Lincoln, With Malice Toward None, had lifted portions from Benjamin P. Thomas’s 1952 work on the same subject.
The American Historical Association eventually cleared Oates, concluding that the overlapping phrases reflected standard historical practice, shared primary sources, and the limited vocabulary available for narrating specific events. Stewart and Feder were censured by their peers for substituting rigid algorithmic logic for nuanced judgment.
The parallel to our current moment is genuine. But the parallel breaks at one crucial point.
The plagiarism hunters were wrong about Oates, but the thing they were hunting was at least a definable object. Plagiarism, in its strong sense, involves the reproduction of specific strings of words that appeared first in a specific prior source. When the accusation is serious, a skeptic can compare the two documents directly. The evidence is public, verifiable, and falsifiable. A charge of plagiarism can be decisively refuted or decisively sustained.
AI authorship has no such evidentiary structure. There is no prior document to compare against. There is only a probability distribution over token sequences. Any given piece of scrutinized prose could have been generated by a human mimicking those probabilities, by a machine sampling from them, or by a human whose natural style happens to resemble them. The vigilantes of the 1990s at least had a falsifiable claim to make. Their AI-hunting descendants do not. The AI accusation is unfalsifiable.
The perceptual chasm
The empirical literature on human AI detection is sobering. In late 2025, Deezer and Ipsos conducted a blind listening study with nine thousand respondents across eight countries, asking participants to distinguish human-made music from fully AI-generated music. An astonishing ninety-seven percent failed.
In the 2024 study “The Great AI Witch Hunt,” Hilda Hadan and her colleagues ran the same experiment on academic gatekeepers. They asked seventeen experienced peer reviewers from top-tier human-computer interaction conferences to classify academic snippets as human-written, AI-paraphrased, or fully AI-generated. The reviewers performed, in the researchers’ words, at coin-flip levels of accuracy. A Penn State study led by Dongwon Lee similarly found that human evaluators correctly distinguish AI text roughly 53 percent of the time in a binary setting, a rate statistically indistinguishable from guessing.
This changes what we should make of the Frankie’s Shelf video. A reviewer who spends two and a half hours explaining how they can feel the machine behind a novel’s sentences is describing an ability humans do not possess. Whatever such feelings are, they are not dependable evidence of authorship.
The linguistic trap
None of this is to say AI text has no identifiable features. Linguistic analysis of large language model output has identified certain consistent traits. The prose tends to exhibit high formality and flawless structure. But many of these traits are also what centuries of composition instruction have taught human writers to produce: clear topic sentences, consistent tone, careful transitions, and grammatical polish. These are therefore not necessarily signs of machine authorship. They are often simply signs of an education in writing.
The consequences of this overlap fall hardest on writers whose English is not their first language. Stanford researchers James Zou and colleagues showed that commercial AI detectors flag legitimate TOEFL essays written by non-native English speakers at alarming rates. The reason is straightforward. Non-native speakers tend to rely on smaller, more predictable vocabularies. The detectors therefore cannot distinguish such a student from a language model, because at the level of surface statistics there is very little to distinguish.
The same pattern penalizes any writer whose prose depends on structural scaffolding. A neurodivergent student using formulaic sentence patterns to organize thought will be flagged. So will a self-taught author who has internalized the conventions of genre fiction, or a professional copywriter trained to produce clear, uncluttered prose. The judge who accepts a detector’s verdict on such work is not, in most cases, identifying AI use. They are automating suspicion against the writers least equipped to defend themselves.
The mark
In early modern Europe, examiners looked for a physical feature on the body of those accused of forming a pact with the devil. The witch’s mark was whatever the examiner decided it was, whether a birthmark, a skin tag, or something else. Whoever was accused could not remove the mark and its interpretation was not theirs to control. The witch’s mark was a surface feature treated as proof of a hidden condition that could not itself be directly observed.
The AI slop police are looking for something remarkably similar. The “AI mark” is a prose cadence, a painterly polish, or a clean grammatical structure. As with the historical mark, the accused cannot easily remove it, because in most cases it is simply the shape their work has always taken.
But the AI mark, unlike its historical predecessor, will not sit still. The witch’s mark was a fixed feature on a human body. The AI mark is a statistical signature produced by software, and software can be retrained. As vigilantes develop sharper heuristics for what algorithmic prose might look like, the technology itself is being continuously redesigned to evade them.
Nous Research, an open-source laboratory, has recently introduced an autonomous pipeline named autonovel. It can generate full book manuscripts without human intervention. The pipeline is explicitly equipped with what its documentation calls anti-slop and anti-pattern modules, designed to inject the very irregularities that vigilantes treat as signs of humanity. Similar developments are visible wherever generative tools compete against detection tools. The heuristic of detection is already obsolete.
From detection to authentication
None of this is an argument against caring about human authorship. I care about it a great deal. Students who submit AI-generated work without doing the cognitive labor of writing are short-changing their own development. I have argued elsewhere that the preservation of that labor is the central pedagogical stake of the generative era. Publishers who market synthetic manuscripts as human-authored are misleading their customers. These are real problems.
But focusing on detection is the wrong approach. Detection proceeds from the product and tries to reconstruct the process. It asks whether a finished object bears the marks of a machine, a question we have now seen humans cannot reliably answer. There is a better way to think about this. Authentication proceeds in the opposite direction. It proceeds from the process and documents the human labor that produced the product.
There are genuine possibilities here. Platforms could build optional authorship trails that timestamp the composition process, and publishers could request drafts and revision histories for works they intend to market as human-authored. None of these methods would be perfect, and none would stop a sufficiently determined bad actor. What they would do is replace accusation with documentation and suspicion with evidence.
The crucial difference is structural. Authentication is something creators can choose to offer. Detection is something inflicted upon them.
An old kind of hunt
I began this essay with Mia Ballard and the video that helped shutter her book’s American publication. Whether and to what degree Shy Girl was assisted by AI, I do not know. Neither can Frankie’s Shelf or Hachette establish this from stylistic evidence alone. The mark the accusers saw is not reliably diagnostic. Readers felt an absence of soul in a novel and translated that feeling into a conviction about its origins, and a multinational publisher panicked.
The desire to defend human creativity is legitimate and important, even admirable. The method of decentralized accusation is not. Every reader and every publisher who accepts the testimony of self-appointed detectors over the empirical record becomes complicit in the next Shy Girl and the one after that. If we genuinely value human authorship, we should be building mechanisms that let human authors prove their work, rather than demanding that they prove the impossible negative under threat of career destruction.
The witch’s mark was never really on the witch. It was always in the examiner’s eye. The AI slop police have not discovered a new capacity for perception. What they have discovered is a new pretext for an old kind of hunt, and the creators they catch will, in the end, be the very humans they set out to protect.
The images in this article were generated with Nano Banana 2.
P.S. I believe transparency builds the trust that AI detection systems fail to enforce. That’s why I’ve published an ethics and AI disclosure statement, which outlines how I integrate AI tools into my intellectual work.






