this post was submitted on 18 Aug 2025
772 points (98.9% liked)

Technology

74193 readers
3790 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
(page 2) 50 comments
sorted by: hot top controversial new old
[–] r00ty@kbin.life 13 points 13 hours ago (4 children)

For mbin I managed to kill the attack of the scrapers only using cloudflare managed challenge for all except to fediverse post endpoints, from fediverse ua agents on certain get endpoints. Managed challenge on everything else.

So far, they've not gotten past it. But, a matter of time.

load more comments (4 replies)
[–] wetbeardhairs@lemmy.dbzer0.com 23 points 14 hours ago (1 children)

Gosh. Corporations are rampantly attempting to access resources so they can perform copyright infringement en-masse. I wonder if there is a legal mechanism to stop them? Oh, no there isn't because our government is fully corrupted.

[–] aquovie@lemmy.cafe 8 points 13 hours ago (1 children)

I think, in this particular case, it's aggressive apathy/incompetence and not malice. Remember, Trump didn't even know what Nvidia was.

AI's don't have a skin color or use the bathroom so you can't whip your cult into a frenzy by Othering it. You can't solidify your fascism by getting bogged down in the details of IP law.

[–] Corkyskog@sh.itjust.works 2 points 10 hours ago (1 children)

Just say that the AI will be used to train the immigrants to take der jerbs.

load more comments (1 replies)
[–] UnderpantsWeevil@lemmy.world 42 points 16 hours ago (2 children)

I mean, we really have to ask ourselves - as a civilization - whether human collaboration is more important than AI data harvesting.

[–] willington@lemmy.dbzer0.com 5 points 11 hours ago* (last edited 11 hours ago)

I was fine before the AI.

The biggest customer of AI are the billionaires who can't hire enough people for their technofeudalist/surveillance capitalism agenda. The billionaires (wannabe aristocrats) know that machines have no morals, no bottom lines, no scruples, don't leak info to the press, don't complain, don't demand to take time off or to work from home, etc.

AI makes the perfect fascist.

They sell AI like it's a benefit to us all, but it ain't that. It's a benefit to the billionaires who think they own our world.

AI is used for censorship, surveillance pricing, activism/protest analysis, making firing decisions, making kill decisions in battle, etc. It's a nightmare fuel under our system of absurd wealth concentration.

Fuck AI.

[–] devfuuu@lemmy.world 19 points 16 hours ago* (last edited 16 hours ago) (1 children)

I think every company in the world is telling everyone for a few months now that what matter is AI data harvesting. There's not even a hint of it being a question. You either accept the AI overlords or get out of the internet. Our ONLY purpose it to feed the machine, anything else is irrelevant. Play along or you shall be removed.

load more comments (1 replies)
[–] 0x0@lemmy.zip 21 points 16 hours ago (1 children)

It's always a cat-n-mouse game.

[–] Allero@lemmy.today 9 points 15 hours ago (2 children)

Except previously bombarding another person's server for personal gain was illegal.

[–] carrylex@lemmy.world 4 points 14 hours ago

I don't know if this is news to you, but most of the internet never cared about what's legal or not.

load more comments (1 replies)
[–] Kyrgizion@lemmy.world 17 points 16 hours ago (3 children)

Eventually we'll have "defensive" and "offensive" llm's managing all kinds of electronic warfare automatically, effectively nullifying each other.

[–] ProdigalFrog@slrpnk.net 28 points 16 hours ago (3 children)

That's actually a major plot point in Cyberpunk 2077. There's thousands of rogue AI's on the net that are constantly bombarding a giant firewall protecting the main net and everything connected to it from being taken over by the AI.

load more comments (3 replies)
[–] ChaoticNeutralCzech@feddit.org 2 points 13 hours ago

Obligatory AI ≠ LLM. How would scrapers benefit from the LLMs they help train? The defense is obvious, LLM-generated slop traps against scrapers already exist.

load more comments (1 replies)
[–] sailorzoop@lemmy.librebun.com 14 points 16 hours ago (1 children)

I'm ashamed to say that I switched my DNS nameservers to CF just for their anti crawler service.
Knowing Cloudflare, god know how much longer it'll be free for.

[–] AmbiguousProps@lemmy.today 5 points 14 hours ago (1 children)

Did you enable the AI black hole/tarpit? It's the main reason I've used their stuff.

load more comments (1 replies)
[–] Goretantath@lemmy.world 7 points 14 hours ago (1 children)

I knew that was the worse option. Use the one that traps them in an infinite maze.

[–] aquovie@lemmy.cafe 16 points 14 hours ago (3 children)

You need to properly detect that they're bots first and then they'll just figure out how to spoof that. Then you're back to square one.

Abstractly, POW doesn't need to determine if you're a bot or not. To make a request, as a human or bot, you need to pay in cpu-time. The hope is that the cost is not so high that a human notices very much but for a bot trying to hoover up data as fast as possible, the aggregate cost is high.

I think the more horrifying aspect is that they'll just build ever bigger datacenters to crunch POW tests faster and the carbon cost will skyrocket even more.

[–] mic_check_one_two@lemmy.dbzer0.com 10 points 13 hours ago

Exactly. Imagine needing to pay a penny for every request. Not a huge deal for someone who only makes one or two requests per year. But if you’re running a bot farm and making tens of millions of requests per day, you’ll quickly find that your operating costs have skyrocketed. That’s basically the idea behind Anubis; Make someone pay in CPU time, so the legit users don’t really notice but bots quickly eat up all of their servers’ CPU.

load more comments (2 replies)
load more comments
view more: ‹ prev next ›