199
Dutch cycling website outspoken about having to close because of AI-related content theft
(www.holland-cycling.com)
"We did it, Patrick! We made a technological breakthrough!"
A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.
AI, in this case, refers to LLMs, GPT technology, and anything listed as "AI" meant to increase market valuations.
so, when AI scrapes a website do you think it's doing a raw text dump scrape or is it doing a full render?
I have an idea that's low level enough that might poison these scrapers. it will unfortunately break SEO too, but that can be fixed with proper header keys etc.
I think it also scrapes all images with an alt text, as they can be useful for training.