Multiple artificial intelligence companies are circumventing a common web standard used by publishers to block the scraping of their content for use in generative AI systems, content licensing startup TollBit has told publishers.
Multiple artificial intelligence companies are circumventing a common web standard used by publishers to block the scraping of their content for use in generative AI systems, content licensing startup TollBit has told publishers.
Sounds like we’re all going to need to start putting the equivalent of Trap Streets in all our web content, source code, etc.
I heard someone has already had success placing nonsense in a white-on-white box of their site, later querying commercial AI to prove it was ingested w/o permission.
Here’s another example/variant (The Office - Recorder)
My fear is that those techniques will make the lives of people using screen readers increasingly harder
There probably is a way to poison AI training material and it could be handy feature for social media.
Anti Commercial-AI license