Tbh, I’d be less testy about bots scraping my sites for AI input IF they respected my robots.txt file and didn’t slam the server. They’re just rude and I don’t like it. Sometimes they’re so rude it’s effectively a DOS attack.
Tbh, my sites exist to get information out there and I don’t care if someone mirrors my sites, as long as the information is still accurate.
I mean, that’s great and you’re well within your rights, but that’s not what people generally say when they express outrage about AI scraping. People straight up call it theft very often and seem to consider using online content for training is the equivalent of copying or distributing it.
Which stands out to me because that was not what happened when the EU decided that Google News was effectively piracy after a whole bunch of news outlets complained. The consensus there seemed to be that it was a bummer to lose the service despite all the scraping.
Oh yeah, I get that there’s more than 2 reasons to be upset about AI scraping. I work in the academic library world and the vibe here is
bots are rude
AI is not a reliable source of facts
We work with facts and information, and I have no expectation that my collection of facts is something to defend against replication.
On the other hand, I’d be pissed AF if someone stole my research paper on 1800s family drama and reprinted it without attribution, or AI-hallucinated new pseudo-facts that were not in the source materials.
Edit: my situation isn’t that of others and I totally get why artists and authors would be upset about AI bots stealing their work.
Tbh, I’d be less testy about bots scraping my sites for AI input IF they respected my robots.txt file and didn’t slam the server. They’re just rude and I don’t like it. Sometimes they’re so rude it’s effectively a DOS attack.
Tbh, my sites exist to get information out there and I don’t care if someone mirrors my sites, as long as the information is still accurate.
I mean, that’s great and you’re well within your rights, but that’s not what people generally say when they express outrage about AI scraping. People straight up call it theft very often and seem to consider using online content for training is the equivalent of copying or distributing it.
Which stands out to me because that was not what happened when the EU decided that Google News was effectively piracy after a whole bunch of news outlets complained. The consensus there seemed to be that it was a bummer to lose the service despite all the scraping.
Oh yeah, I get that there’s more than 2 reasons to be upset about AI scraping. I work in the academic library world and the vibe here is
We work with facts and information, and I have no expectation that my collection of facts is something to defend against replication.
On the other hand, I’d be pissed AF if someone stole my research paper on 1800s family drama and reprinted it without attribution, or AI-hallucinated new pseudo-facts that were not in the source materials.
Edit: my situation isn’t that of others and I totally get why artists and authors would be upset about AI bots stealing their work.