Fiction Analytics Site Prosecraft Shut Down After Author Backlash

Prosecraft.io, a site that used novels to help power a data-driven project to display word count, passive voice, and other much more subjective, writing-style markers such as vividness, shut down today after authors protested the project. Prosecraft used the full text of over 25,000 books—which is entirely copyrighted material—in order to develop a library of data. Authors, once they caught wind of what was happening, immediately hated this.

How DARE you, @benji_smith

I demand you take my book off your site immediately. I do not consent to this, and never did. And I know my publisher never would pic.twitter.com/QvPkRme5pr

— Zattack The Block (@ZachRoseWriter) August 7, 2023

Zach Rosenberg was the author who first brought this site to the larger attention of authors on X, the site formerly known as Twitter. Pretty soon, more and more authors spoke out, including high-profile authors like Jeff VanderMeer (The Southern Reach trilogy), Indra Das (The Devourers), Gretchen Felker-Martin (Manhunt)

Remove all books and analysis for Jeff VanderMeer. You absolutely do not need a title by title run down. Just run a search on your own damn site.

— Jeff VanderMeer (@jeffvandermeer) August 7, 2023

I think you can safely assume that the default for any artist or writer is 'doesn't want them to be there' (there being any AI training project) unless you have their written and confirmed consent. Also, please remove my book (The Devourers) from this as well, thanks.

— Indrapramit Das (@IndrapramitDas) August 7, 2023

I've just discovered that MANHUNT has been uploaded to a content mining site so that it can be indifferently plagiarized by anyone who wants to feed it into their so-called "AI".@benji_smith, I demand you remove my work from your site immediately.

— Gretchen Felker-Martin (@scumbelievable) August 7, 2023

Part of this is because Prosecraft has admitted to using “AI algorithms.” In a blog post dated October 5, 2018, Benji Smith, the developer of both Prosecraft and the writing program Shaxpir that was based on the data mined from Prosecraft’s library, stated that “we taught our machine-learning [AI] algorithms to recognize which kinds of words can be used in which kinds of contexts, by looking at the types of words and phrases that tend to occur within similar sentences and paragraphs.” Additionally, he wrote that Shaxpir “[analyzed] more than 560 million words of fiction, from more than 5,800 books, written by more than 3,300 popular authors.” He does not disclose where he received those works of fiction, or whether or not he received permission to do so.

While the technology used is not necessarily a large language generative model like ChatGPT, it is not a stretch to say that incorporating generative LLM algorithms could have been on the horizon for Prosecraft. And since the site had a massive library of books, author’s fears are incredibly valid. In the wake of this backlash, Smith has written a lengthy blog on mediumexplaining why he voluntarily took down Prosecraft.

Although Prosecraft was only using portions of the text, it did not have permission from any authors or publishers to create a database based on the entire work of an author or the full text of a book. Smith wrote on the blog, “since I was only publishing summary statistics, and small snippets from the text of those books, I believed I was honoring the spirit of the Fair Use doctrine, which doesn’t require the consent of the original author.”

While this holds some water, Fair Use does not, by any stretch of the imagination, allow you to use an author’s entire copyrighted work without permission as a part of a data training program that feeds into your own “AI algorithm.” While this situation is certainly going to be a lesson for many people, it’s clear that authors are not going to allow their work to be used to train LLMs and vector networks.

Update August 8, 11:35 a.m.: Fixed the mistaken legal definition where copyrighted works were referred to as ‘copywritten.’ io9 sincerely regrets the error.

Want more io9 news? Check out when to expect the latest Marvel, Star Wars, and Star Trek releases, what’s next for the DC Universe on film and TV, and everything you need to know about the future of Doctor Who.

Fiction Analytics Site Prosecraft Shut Down After Author Backlash

Sign up for our newsletters

Latest news

The Asteroid That Killed the Dinosaurs May Not Have Done It Exactly How We Thought

Toshiba 65-Inch LED 4K UHD Smart Fire TV Is 53% Off, Letting You Buy It for Portable Monitor Money

Astronomers Found the Sun’s Missing Silver Hiding in Plain Sight

Galaxy Watch Ultra Is Now Hundreds Cheaper Than Buying Directly From Samsung as a Grade-A Refurbished Model

LG Monitors Fill PCs With Adware, and It’s Not Just Recent Displays

‘Avatar Aang: The Last Airbender’ Is Sensational

This Year’s Budget Pixel Might Be Less of a Cop-Out

Amazon Offloads This 15.6″ Portable Monitor at 50% Off, Built-in Speaker With Protective Case

Latest Reviews

Anker Solix S2000 Review: The Little 2kWh Battery That Could

SwitchBot Home Dashboard Review: An E Ink Smart Display for the Weather-Obsessed

Asus ROG Kithara Review: A Huge Gaming Headset With Even Bigger Sound

Geekom A9 Max (2026) Review: Not Much ‘Max’ About It

The Best Budget Laptops Under $1,000 for Back to School

Roborock Saros 20 Review: Jack of All Trades, Master of Most

You Know What Your Bathroom Needs? A Smart Mirror With Party Lighting

Narwal Freo Z10 Turbo Review: Midrange Vacuum, High-End Performance

Related Articles

Fiction Analytics Site Prosecraft Shut Down After Author Backlash

Sign up for our newsletters

The Asteroid That Killed the Dinosaurs May Not Have Done It Exactly How We Thought

Toshiba 65-Inch LED 4K UHD Smart Fire TV Is 53% Off, Letting You Buy It for Portable Monitor Money

Astronomers Found the Sun’s Missing Silver Hiding in Plain Sight

Galaxy Watch Ultra Is Now Hundreds Cheaper Than Buying Directly From Samsung as a Grade-A Refurbished Model

LG Monitors Fill PCs With Adware, and It’s Not Just Recent Displays

‘Avatar Aang: The Last Airbender’ Is Sensational

This Year’s Budget Pixel Might Be Less of a Cop-Out

Amazon Offloads This 15.6″ Portable Monitor at 50% Off, Built-in Speaker With Protective Case

Anker Solix S2000 Review: The Little 2kWh Battery That Could

SwitchBot Home Dashboard Review: An E Ink Smart Display for the Weather-Obsessed

Asus ROG Kithara Review: A Huge Gaming Headset With Even Bigger Sound

Geekom A9 Max (2026) Review: Not Much ‘Max’ About It

The Best Budget Laptops Under $1,000 for Back to School

Roborock Saros 20 Review: Jack of All Trades, Master of Most

You Know What Your Bathroom Needs? A Smart Mirror With Party Lighting

Narwal Freo Z10 Turbo Review: Midrange Vacuum, High-End Performance

Related Articles

The Best Budget Laptops Under $1,000 for Back to School

The Best Tech to Level Up Summer 2026

Apple Is Coming for the People Building OpenAI’s Future

China Just Dropped Another Bomb on America’s Frontier AI Companies

Body Bags Found Outside OpenAI HQ as Execs Increasingly Fear for Their Lives

OpenAI Just Launched Its First Hardware Product—and It’s a Tiny Keyboard for Bossing Around AI Agents