how can the spammers pass the recaptcha ?

deniz · November 2012

i realize that fake users manage to be registered, is there any extra precaution for protecting

mcu_hq · November 2012

There are cheap living areas in the world like India where paying an actual human to spam is a business. If your community is large enough, then your users should be able to self moderate things by flagging spam posts. An active and loyal forum is your protection.

I've seen sites (yii for example) that do not allow links posted in messages for users that have either just registered or have a low post count.

HalfCat · November 2012

I can also recommend the BotStop plugin: http://vanillaforums.org/addon/botstop-plugin

peregrine · November 2012

As Halfcat said you can't beat botstop. Even better with approval, then no real need for recaptcha. And as anonymouse pointed out applicants can still pm. with a fix here

http://vanillaforums.org/discussion/comment/169912/#Comment_169912

Anonymoose · November 2012

There are also tools like which make use of image manipulation techniques to bypass captchas.

Averaging is a common method in physics to reduce noise in input data. The averaging attack can be used on image-based captchas if the following conditions are met:

The predominant distortion in the captcha is of noise-like nature. It is possible to extract a series of different images with the same information encoded in them. Averaging of a series of images can be used to improve image quality (reduce distortion, or improve signal-to-noise ratio, so to say) of captchas and hence to make them more easily recognizable by OCR (optical character recognition) systems.

The fact that noise and payload behave differently on "reload" is exploited. This allows the program to separate them and hence defeat the captcha without the need for a sophisticated algorithm.
http://en.wikipedia.org/wiki/XRumer

HalfCat · November 2012

This does not work with reCaptcha though. The distortion is not the only factor but also the different scan quality and font usage make it close to impossible to defeat with algorithms. However, it has been cracked multiple times in the past. The most recent one - to my knowledge - was by defeating the audio code which is mainly designed for blind users. Also there is always the possibility of letting real people solve it. It mustn't necessarily be by paid people but also could be done by users who expect to unlock something else but are in fact solving a captcha for a registration bot. So many possibilities

It is safe to say that the safest captcha is the one that you customized on your own. BotStop is a good way to go.

OnlyAnExcuse · November 2012

Only half of a recaptcha is actually needed. You can enter the half it needs and anything else as the other half. The way to tell is the "other half" is that it pretty much uses the same text style in every one.

The "other half" is you translating things for Google.

HalfCat · November 2012

@OnlyAnExcuse said:
Only half of a recaptcha is actually needed. You can enter the half it needs and anything else as the other half. The way to tell is the "other half" is that it pretty much uses the same text style in every one.

The "other half" is you translating things for Google.

That's not what I experienced and read about. The "other half" is just a part of text that others have solved correctly. It is in no way similar looking.

OnlyAnExcuse · November 2012

@HalfCat I was under the impression the capchas were solved by users, and the best answer of the "other half" was used in translating (meaning anything could be added), while the other side was already translated and a correct answer expected for that.

LeeH · November 2012

@HalfCat said:
That's not what I experienced and read about. The "other half" is just a part of text that others have solved correctly. It is in no way similar looking.

Straight from source:

Each new word that cannot be read correctly by OCR is given to a user in conjunction with another word for which the answer is already known. The user is then asked to read both words. If they solve the one for which the answer is known, the system assumes their answer is correct for the new one.

The unidentified words are verified by many different users, but you can enter whatever text you'd like. The control word must be identified correctly, but not the second one. You are assisting Google & others' OCR efforts by identifying it yourself.

HalfCat · November 2012

@OnlyAnExcuse That is correct. However, this does not imply that the part that is already known does always look the same. It is also taken from random books, has a different font, scan quality etc.

@LeeH Yes, I know.

how can the spammers pass the recaptcha ?

Comments