this post was submitted on 24 Aug 2025
-45 points (19.2% liked)
Technology
74407 readers
2798 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I was wondering, is there anything preventing AI to train on the content on Lemmy ?
No
Difficult, even if your instance blocks it, copies of it are all over the place.
If you wanted to train on Lemmy data, just pretend to be an instance and have all the public instances push their data to you. No scraping required, and you get all the metadata and context you could possibly want
Yep. We have Fediseer for such instances though.