Wikipedia is under assault: rogue users keep posting AI generated nonsense

ForgottenFlux@lemmy.world · edit-2 3 months ago

Wikipedia is under assault: rogue users keep posting AI generated nonsense

schizo@forum.uncomfortable.business · 3 months ago

Ah, so the AI version of the chewbacca defense.

I have to wonder if intentionally shitting on LLMs with plausible nonsense is effective.

Like, you watch for certain user agents and change what data you actually send the bot vs what a real human might see.

Dragonstaff@leminal.space · 3 months ago

I suspect it would be difficult to generate enough data to intentionally change a dataset. There are certainly little holes, like the glue pizza thing, but finding and exploiting them would be difficult and noticing you and blocking you as a data source would be easy.

Petter1@lemm.ee · 3 months ago

I never told that I think it is smart…

T156@lemmy.world · 3 months ago

I have to wonder if intentionally shitting on LLMs with plausible nonsense is effective.

I don’t think so. The volume of data is too large for it to make much of a difference, and a scraper can just mimic a human user agent and work that way.

You’d have to change so much data consistently across so many different places that it would be near-impossible for a single human effort.