Image verification -- keeps bots out, and humans too!

Sapo · August 20, 2007, 12:51pm

equally effective for bots and stupid humans. Very good.

fusoya · August 20, 2007, 2:24pm

That dog & cat solution is great. I don’t think the trivia question one would be very effective, because there are only so many trivia questions that can be written, and it wouldn’t take very long for the EVIL SPAMMERS to figure out every single question that can be asked, and have their spamming bots recognize the questions, and then program them to answer them.

The original reason I made this pit is to point out that some websites are making image verifications that not only are so hard for humans to read (by using crazy color combinations) but have effects that wouldn’t hurt a computer - I’m sure an OCR has WAY WAY less trouble distinguishing between similar colors than a human eye, so I think they’re keeping less legit humans out than EVIL SPAMMERS! It also came to my realization that blind users are completely left in the dark (pun intended) when it comes to getting registered!

Ludovic · August 20, 2007, 2:44pm

I think we should dust off and nuke 'em from orbit. It’s the only way to be sure.

Mangetout · August 20, 2007, 2:49pm

I stumbled across a collection of unreadable or otherwise inappropriate CAPTCHAs a day or two ago - but I can’s seem to find it again - most of them were just completely illegible blobs, but there were a few that contained dozens and dozens of readable characters, plus one or two where the random string had happened to spell out a valid, but offensive word or phrase (like this one.

Mighty_Girl · August 20, 2007, 2:52pm

Each webmaster can create his/her own questions. Each would be unique to a site and they can be changed as often as one wishes. I think it would be pretty hard for a bot to crack that.

Furthermore, if you run a niche site you can ask a question that only other enthusiasts would know, if you wish to keep non-enthusiasts out of your site.

CaerieD · August 20, 2007, 4:00pm

If the puppies and kittens aren’t in the same order every time, that won’t work. Set it to randomize the order of the images with a large pool of potential images to use. Each of the five images is randomly of a different common animal or object. The script asks to select the image of the kitten/cup/puppy/duckling/shoe/etc, followed by another level of the same brief, simple security so that the bot would have to be randomly correct twice in a row to get through. Anyone capable of reading this would also be capable of identifying the images, but it would be far more difficult to get a bot that could identify each one of those images.

Sapo · August 20, 2007, 4:16pm

Actually, stupid questions are all that is needed. “What colour is a red car” is just as good to weed people from bots as any other.

Encinitas · August 20, 2007, 4:22pm

But I’m still not convinced that fusoya is human!

pulykamell · August 20, 2007, 4:52pm

I’m not understanding your explanation. If the puppies and kittens aren’t in the same order every time, and your pool is five images, guessing the first image every single time should get you a success rate of about 20% (1 in 5). If you have two levels of this, the success rate is about 4% (1 in 25). For a bot, not a problem.

If the puppies and kittens stay in the same order, then the bot just needs to randomize its selection every time, and the hit rate still should average out to 20% over time for the first level and 4% through both levels.

What am I missing here?

CaerieD · August 20, 2007, 5:28pm

:smack: You’re not missing anything. That’s just what I get for trying to fix the flaws in a fundamentally flawed concept. The proper identification of all of the images, as Left Hand of Dorkness mentioned, is a far better idea.

Jayrot · August 20, 2007, 6:09pm

The big point that most people miss when trying to come up with some clever new CAPTCHA system like the kittens and puppies, is that there has to be a finite number of possible pictures. Even if there are a thousand pictures of cats and dogs a human could easily go through and associate each one with the proper definition. Then all the bot has to do is recognize (most likely a hashed down version of the jpeg) and consult the list.

YES, the squiggly letters are uniquely generated and obfuscated ON THE FLY.

The ideas of trivia or cuteness and everything else is clever, but ultimately won’t work because you can’t have a computer program easily WRITE a unique question every time (or go out and snap a unique picture of a cat or dog).

gazpacho · August 20, 2007, 6:15pm

It would be pretty easy to add some random changes to the images on the fly such that they would not give the same checksum every time.

Jayrot · August 20, 2007, 6:33pm

Yes, but they’d have to be pretty substantial changes to confuse a clever computer (checksum is, of course, only one way to do it). Pretty soon you have a jpeg that’s been so warped (on the fly) that you have the same problem you were trying to avoid in the first place: “What the hell is that?”

netsplit · August 20, 2007, 8:10pm

yep:p

took a whole 5 seconds to fill in the blanks vs a minute or two of “is that a zero or the letter o? and is that a t or perhaps an l or I with one of those random lines through it?”

Mighty_Girl · August 20, 2007, 8:13pm

If you run a large site that thousands of spammers want to crack you might need to go for multiple levels of security, for the smaller sites, like mine, the questiosn work fantastically. You can add as many questions as you want, each would be unique to your site and you can change them as often as you want. A bot will still have to be walked through by a human. There is no bot capable of answering certain questions, however simple.

If bots are walked through by humans, like they are doing now, no amount of “on the fly” levels of security will work.

netsplit · August 20, 2007, 8:31pm

yep it is!

Yep it is!

We had some sneak through even with image verification. The best anyone can figure is its all automated except the verification. So that the bot does all the signing up then shows the image to a human who types in the letters. A spammer could still quiet efficiently spam heaps of sites that way. The question seems to throw em off us enough that they don’t bother.

Capt.Ridley_s_Shooting_Party · August 20, 2007, 9:54pm

Only if the computer is selecting the classification randomly. As has already been mentioned, image analysis is a massive research area in AI and results are significantly better than random (~99% accuracy in some domains).

Mighty_Girl · August 20, 2007, 10:00pm

From what I can see that seems to be the case. All bots seem to follow the same pattern. Which makes it even easier to prune accounts that fit that pattern. It is still a massive PITA.

borschevsky · August 20, 2007, 10:17pm

One mechanism spammers use for this is to set up their own site with a captcha in the registration, but the captcha is actually pulled from the site they’re trying to spam. The user of their site fills out the captcha, and behind the scenes their site has a bot that passes this through the spammed site.

Mangetout · August 20, 2007, 11:27pm

You could present a 3D-rendered picture of a cat/kitten or dog/puppy, in a randomised pose each time, with a randomised coat pattern, eye colour, background, etc. (like Nintendogs)

Topic		Replies	Views
Image verification fields anger me The BBQ Pit	41	4457	December 17, 2008
Would a picture identifying question work in place of CAPTCHA? Factual Questions	33	4194	September 27, 2011
How do spambots get past "human-proving" tests? Factual Questions	22	1911	June 13, 2006
Are captchas more trouble than they are worth? Factual Questions	25	2308	October 15, 2008
I've had it with these fucking Turing tests. The BBQ Pit	56	5649	December 16, 2009

Image verification -- keeps bots out, and humans too!

Related topics