Page 1 of 1

is_bot failed useragents

Posted: Fri Jun 07, 2013 2:13 am
by Osiris
Hi,

We have recently been looking at the bot traffic hitting our sites and have noticed that there are a few bots sneaking passed our WUFRL detection. We are using the latest version of the definition file June 2, 2013 and v1.4.4.0 of the .Net API.

Here are the UserAgents that are not being correctly identified:
  • Mozilla/5.0 (Linux; U; Android 2.2.1; ja-jp; SC-02B Build/FROYO) AppleWebKit/533.1 (KHTML, like Gecko) Version/4.0 Mobile Safari/533.1 YJApp-ANDROID jp.co.yahoo.android.yjtop/2.1.5
  • Mozilla/5.0 (Linux; U; Android 4.1.1; ja-jp; SC-03E Build/JRO03C) AppleWebKit/534.30 (KHTML, like Gecko) Version/4.0 Mobile Safari/534.30 YJApp-ANDROID jp.co.yahoo.android.yjtop/2.1.5
  • Mozilla/5.0 (Linux; U; Android 4.1.2; ja-jp; 201M Build/9.8.2Q-34_SMJ-102) AppleWebKit/534.30 (KHTML, like Gecko) Version/4.0 Mobile Safari/534.30 YJApp-ANDROID jp.co.yahoo.android.yjtop/1.7.6
  • Mozilla/5.0 (Linux; U; Android 4.0.3; ja-jp; F-05D Build/V09R32B) AppleWebKit/534.30 (KHTML, like Gecko) Version/4.0 Mobile Safari/534.30 YJApp-ANDROID jp.co.yahoo.android.yjtop/2.1.2
  • Mozilla/5.0 (Linux; U; Android 4.1.2; ja-jp; SO-02E Build/10.1.D.0.343) AppleWebKit/534.30 (KHTML, like Gecko) Version/4.0 Mobile Safari/534.30 YJApp-ANDROID jp.co.yahoo.android.yjtop/2.1.5
  • Mozilla/5.0 (compatible; Konqueror/3.5; Linux) KHTML/3.5.5 (like Gecko) (Exabot-Thumbnails)
  • Mozilla/5.0 (Linux; U; Android 4.0.4; ja-jp; SC-03D Build/IMM76D) AppleWebKit/534.30 (KHTML, like Gecko) Version/4.0 Mobile Safari/534.30 YJApp-ANDROID jp.co.yahoo.android.yjtop/2.1.5
  • Mozilla/5.0 (Linux; U; Android 2.3.4; ja-jp; IS11T Build/FGK400) AppleWebKit/533.1 (KHTML, like Gecko) Version/4.0 Mobile Safari/533.1 YJApp-ANDROID jp.co.yahoo.android.yjtop/2.0.5
  • YahooCacheSystem
  • WordPress/3.5.1; http://.* (different version numbers)
The last one "WordPress" is the WordPress both that starts with the word WordPress, followed by the version and then the URL that was searched/ visited.

Also the Capabilities doc http://www.scientiamobile.com/wurflCapability says that is_bot bot will return default, is_bot, is_not_a_bot while it is currently returning me true or false? Which behaviour should I be expecting?

Let me know if you need any help with the above list and if you think you will be able to detect them in the future.
Thanks for the help.

Re: is_bot failed useragents

Posted: Fri Jun 07, 2013 9:30 am
by sriram
Hi Osiris,

Thank you for your post and heads-up about the is_bot capability. We are currently working on optimizing the bot detection logic in the WURFL API and we'll also make sure the capability values are fixed.

Thanks and cheers!

Re: is_bot failed useragents

Posted: Mon Jun 10, 2013 1:14 am
by Osiris
sriram wrote:Hi Osiris,

Thank you for your post and heads-up about the is_bot capability. We are currently working on optimizing the bot detection logic in the WURFL API and we'll also make sure the capability values are fixed.

Thanks and cheers!
Not a problem sriram. If you could, please let us know when you release the above mentioned changes. Good luck with the work!