It’s been a bit (im very bad at keeping up with blogs) and i had a question about privacy and bot concerns.
With news that several comapnies are using bots to try and crawl the web and use peopes writing to train AI (see https://twitter.com/AngFdz/status/1598440293103460352?t=-k8Ju6ca9770TCfRgA5eUA&s=33 and i'm glad to have shared in your perils : r/AO3 - Sudowrites scraping and mining AO3 for... ) are write.as blogs and posts indexed? Are there ways to keep a bot from trawling our content?
I’d assume this woild concern p much all users considering theres a posisbility they could use user generated content on the site and profit off it.
matt
October 27, 2023, 6:47pm
3
These bots probably don’t run javascript, so that solution might not work. I think we’ll need to add something at the platform level, or as an option on all blogs. Here’s the latest discussion:
Naturally I’ve been following all the “AI” developments lately, trying it out, and listening / thinking about its implications. If you already follow me, or saw the April Fools’ day “AI” we launched, you’ll have a pretty good idea of my general thoughts on it.
This recent Washington Post article brought the whole extraction side to light again, showing how these tools “train” on all the lovely language we publish on this writing platform of ours, among other places across the web. Here’s 170,00…