Today, a prominent child safety organization, Thorn, in partnership with a leading cloud-based AI solutions provider, Hive, announced the release of an AI model designed to flag unknown CSAM at upload. It’s the earliest AI technology striving to expose unreported CSAM at scale.
Jesus Christ. If someone ever got their hands on this model they could use it to generate new material. The grossest possible AI model to date
deleted by creator
A generative model uses the classifier as part of its training. If you generate a picture of pure random noise, then iteratively pick random noise that the classifier says “looks” more like csam, then you can effectively generate images that the classifier says it’s 100% certain is csam. Whether or not that looks anything like what a human would consider to be csam depends on other factors but it remains a possibility.
deleted by creator
I thought being able to do that was already a thing. This is designed to do the opposite.
I know, I know… bad actors and such.
…but if simple posession defines who a bad actor is…
The irony of this never ceases to amaze me.