Generative AI improvements are increasingly being made through data curation and collection — not architectural — improvements. Big Tech has an advantage.
It’s not clear if this is piracy. In the US, it’s obviously an ongoing fight. Basically, what you describe is “books3”, put together with scripts by Aaron Swartz.
It’s legal in Japan, if the purpose is only AI training and not enjoyment. I’m not sure if there are issues regarding DRM or such.
In the EU, the dataset and resulting model would be illegal. Any business offering the model would be in hot water, but I think internal use would be fine.
It’s not clear if this is piracy. In the US, it’s obviously an ongoing fight. Basically, what you describe is “books3”, put together with scripts by Aaron Swartz.
It’s legal in Japan, if the purpose is only AI training and not enjoyment. I’m not sure if there are issues regarding DRM or such.
In the EU, the dataset and resulting model would be illegal. Any business offering the model would be in hot water, but I think internal use would be fine.