黑帽联盟

 找回密码
 会员注册
查看: 1863|回复: 1
打印 上一主题 下一主题

[其它] 搜索引擎爬虫蜘蛛的UserAgent收集

[复制链接]

895

主题

38

听众

3329

积分

管理员

Rank: 9Rank: 9Rank: 9

  • TA的每日心情
    难过
    昨天 22:31
  • 签到天数: 1652 天

    [LV.Master]伴坛终老

    百度爬虫
        * Baiduspider+(+http://www.baidu.com/search/spider.htm”)

    google爬虫
        * Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)
        * Googlebot/2.1 (+http://www.googlebot.com/bot.html)
        * Googlebot/2.1 (+http://www.google.com/bot.html)

    雅虎爬虫(分别是雅虎中国和美国总部的爬虫)
        *Mozilla/5.0 (compatible; Yahoo! Slurp China; http://misc.yahoo.com.cn/help.html”)
        *Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp”)

    新浪爱问爬虫
        *iaskspider/2.0(+http://iask.com/help/help_index.html”)
        *Mozilla/5.0 (compatible; iaskspider/1.0; MSIE 6.0)

    搜狗爬虫
        *Sogou web spider/3.0(+http://www.sogou.com/docs/help/webmasters.htm#07″)
        *Sogou Push Spider/3.0(+http://www.sogou.com/docs/help/webmasters.htm#07″)

    网易爬虫
        *Mozilla/5.0 (compatible; YodaoBot/1.0; http://www.yodao.com/help/webmaster/spider/”; )

    MSN爬虫

        *msnbot/1.0 (+http://search.msn.com/msnbot.htm”)



    GOOGLE
    ---------------------------------------------------------------------
    66.249.70.212 - - [11/Jan/2016:00:03:35 -0700] "GET  www.cnblackhat.com/user-f2fc990265c712c49d51a18a32b39f0c.html?umid=f2fc990265c712c49d51a18a32b39f0c HTTP/1.1" 200 8148 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
    Referer: ""
    UserAgent: "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"

    66.249.70.212 - - [11/Jan/2016:03:27:23 -0700] "GET  index.jpg HTTP/1.1" 200 2367 "-" "Googlebot-Image/1.0"
    Referer: ""
    UserAgent: "Googlebot-Image/1.0"

    209.85.238.7 - - [11/Jan/2016:00:02:58 -0700] "GET  www.cnblackhat.com/rss/c/1009 HTTP/1.1" 404 37 "-" "Feedfetcher-Google; (+http://www.google.com/feedfetcher.html; 10 subscribers; feed-id=8474979256887526569)"
    Referer: ""
    UserAgent: "Feedfetcher-Google; (+http://www.google.com/feedfetcher.html; 10 subscribers; feed-id=8474979256887526569)"


    百度
    ---------------------------------------------------------------------
    60.28.22.38 - - [11/Jan/2016:01:28:09 -0700] "GET  www.cnblackhat.com/vwsoft-vwantileechs-download.html?pr=vwantileechs&vi=download HTTP/1.1" 200 27406 "http://www.vidun.com/" "Baiduspider+(+http://www.baidu.com/search/spider.htm)"
    Referer: ""
    UserAgent: "Baiduspider+(+http://www.baidu.com/search/spider.htm)"


    YAHOO
    ---------------------------------------------------------------------
    202.160.180.81 - - [11/Jan/2016:00:02:44 -0700] "GET  www.cnblackhat.com/ HTTP/1.0" 200 14250 "-" "Mozilla/5.0 (compatible; Yahoo! Slurp China; http://misc.yahoo.com.cn/help.html)"
    Referer: ""
    UserAgent: "Mozilla/5.0 (compatible; Yahoo! Slurp China; http://misc.yahoo.com.cn/help.html)"

    67.195.37.167 - - [11/Jan/2016:00:23:00 -0700] "GET  www.cnblackhat.com/postmsg-tech-2-120.html?type=tech&id=2&tid=120 HTTP/1.0" 200 12609 "-" "Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)"
    Referer: ""
    UserAgent: "Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)"


    有道
    ---------------------------------------------------------------------
    2016-03-04 09:54:12 W3SVC226223753 222.33.192.54 GET /index.PHP - 80 - 61.135.219.7 Mozilla/5.0+(compatible;+YodaoBot/1.0;+http://www.yodao.com/help/webmaster/spider/;+) - 200 0 0
    Referer: ""
    UserAgent: "Mozilla/5.0 (compatible; YodaoBot/1.0; http://www.yodao.com/help/webmaster/spider/; )"


    61.135.249.120 - - [11/Jan/2016:09:44:46 -0700] "GET  www.cnblackhat.com/robots.txt HTTP/1.1" 404 - "-" "Mozilla/5.0 (compatible; YodaoBot/1.0; http://www.yodao.com/help/webmaster/spider/; )"
    Referer: ""
    UserAgent: "Mozilla/5.0 (compatible; YodaoBot/1.0; http://www.yodao.com/help/webmaster/spider/; )"


    SOSO
    ---------------------------------------------------------------------
    58.61.164.207 - - [11/Jan/2016:03:13:53 -0700] "GET  www.cnblackhat.com/robots.txt HTTP/1.1" 404 - "http:// www.cnblackhat.com/robots.txt" "Sosospider+(+http://help.soso.com/webspider.htm)"
    Referer: ""
    UserAgent: "Sosospider+(+http://help.soso.com/webspider.htm)"

    2016-03-04 10:48:28 W3SVC226223753 222.33.192.54 GET /index.php - 80 - 124.115.4.218 Sosoimagespider+(+http://help.soso.com/soso-image-spider.htm) http:// www.cnblackhat.com/ 200 0 0
    Referer: ""
    UserAgent: "Sosoimagespider+(+http://help.soso.com/soso-image-spider.htm)"


    Sogou
    ---------------------------------------------------------------------
    219.234.81.41 - - [11/Jan/2016:03:26:49 -0700] "GET  www.cnblackhat.com/ HTTP/1.0" 200 14250 "-" "Sogou Web Sprider(compatible; Mozilla 4.0; MSIE 6.0; Windows NT 5.1; SV1; Avant Browser; InfoPath.1; .NET CLR 2.0.50727; .NET CLR1.1.4322)"

    Referer: ""
    UserAgent: "Sogou Web Sprider(compatible; Mozilla 4.0; MSIE 6.0; Windows NT 5.1; SV1; Avant Browser; InfoPath.1; .NET CLR 2.0.50727; .NET CLR1.1.4322)"


    220.181.61.217 - - [11/Jan/2016:13:10:57 -0700] "GET www.cnblackhat.comyouxigao.com/play/3615?id=3615 HTTP/1.1" 302 5 "-" "Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)"

    Referer: ""
    UserAgent: "Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)"

    220.181.19.74 - - [11/Jan/2016:06:20:37 -0700] "GET  www.cnblackhat.com/vwsoft-vwantileechs-download.html?pr=vwantileechs&vi=download HTTP/1.1" 200 27406 "-" "Sogou Orion spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)"
    Referer: ""
    UserAgent: "Sogou Orion spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)"


    220.181.19.78 - - [11/Jan/2016:10:55:18 -0700] "GET  www.cnblackhat.com/robots.txt HTTP/1.1" 404 - "http://pic.sogou.com/" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322)"
    Referer: "http://pic.sogou.com/"
    UserAgent: "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322)"

    219.234.81.27 - - [11/Jan/2016:23:53:41 -0700] "GET www.cnblackhat.com/ HTTP/1.1" 200 14271 "-" "Sogou-Test-Spider/4.0 (compatible; MSIE 5.5; Windows 98)"
    Referer: ""
    UserAgent: "Sogou-Test-Spider/4.0 (compatible; MSIE 5.5; Windows 98)"

    2016-03-04 17:23:59 W3SVC226223753 222.33.192.54 HEAD /index.php - 80 - 220.181.19.107 Sogou+head+spider/3.0(+http://www.sogou.com/docs/help/webmasters.htm#07) - 200 0 0


    帖子永久地址: 

    黑帽联盟 - 论坛版权1、本主题所有言论和图片纯属会员个人意见,与本论坛立场无关
    2、本站所有主题由该帖子作者发表,该帖子作者与黑帽联盟享有帖子相关版权
    3、其他单位或个人使用、转载或引用本文时必须同时征得该帖子作者和黑帽联盟的同意
    4、帖子作者须承担一切因本文发表而直接或间接导致的民事或刑事法律责任
    5、本帖部分内容转载自其它媒体,但并不代表本站赞同其观点和对其真实性负责
    6、如本帖侵犯到任何版权问题,请立即告知本站,本站将及时予与删除并致以最深的歉意
    7、黑帽联盟管理员和版主有权不事先通知发贴者而删除本文

    勿忘初心,方得始终!

    0

    主题

    0

    听众

    22

    积分

    黑帽菜鸟

    Rank: 1

  • TA的每日心情

    2019-10-23 21:16
  • 签到天数: 8 天

    [LV.3]偶尔看看II

    已有 1 人评分黑币 收起 理由
    定位 -4 再灌水,直接封号

    总评分: 黑币 -4   查看全部评分

    回复

    使用道具 举报

    您需要登录后才可以回帖 登录 | 会员注册

    发布主题 !fastreply! 收藏帖子 返回列表 搜索
    回顶部