I... have my doubts. I do not doubt that a wider variety of poisoned data can improve training, by implementing new ways to filter out unusable training data. In itself, this would, indeed, improve the model.
But in many cases, the point of poisoning is not to poison the data, but to deny the crawlers access to the real work (and provide an opportunity to poison their URL queue, which is something I can demonstrate as working). If poison is served instead of the real content, that will hurt the model, because even if it filters out the junk, it will have access to less new data to train on.
Same here. We have a joint account, I'm the sole earner, apart from mortgage and utilities and whatnot, she's spending all of it, and that's great, because she does it much better than I could. I suck at managing money, she does not. I wish I could give her more money.