Britain’s data protection regulator, the Information Commissioner’s Office (ICO), is scrutinizing the legality of web scraping to collect data to train generative AI models.

  • BetaDoggo_@lemmy.world
    link
    fedilink
    arrow-up
    14
    ·
    edit-2
    11 months ago

    I’m not sure why it would be any different from how this is treated with search engines. Both scrape massive amounts of openly available data and make it available in some form. Any training data or information that a model could potentially spit out is already available through a search engine’s index.