zoukankan html css js c++ java

利用php抓取蜘蛛爬虫痕迹的示例代码

// 获取蜘蛛爬虫名或防采集
function isSpider(){
    $bots = array(
        'Google'    => 'googlebot',
        'Baidu'     => 'baiduspider',
        'Yahoo'     => 'yahoo slurp',
        'Soso'      => 'sosospider',
        'Msn'       => 'msnbot',
        'Altavista' => 'scooter ',
        'Sogou'     => 'sogou spider',
        'Yodao'     => 'yodaobot'
    );
    $userAgent = strtolower($_SERVER['HTTP_USER_AGENT']);
    foreach ($bots as $k => $v) {
        if (strstr($v, $userAgent)) {
            return $k;
            break;
        }
    }
    return false;
}

// 获取哪种蜘蛛爬虫后保存蜘蛛痕迹。
// 根据采集时HTTP_USER_AGENT是否为空来防止采集
// 抓蜘蛛爬虫
$spi  = isSpider();
if ($spi) {
    $tlc_thispage = addslashes($_SERVER['HTTP_USER_AGENT']);
    $file = 'robot.txt';
    $time = date('Y-m-d H:i:s',mktime());
    $handle = fopen($file,'a+');
    $PR = $_SERVER['REQUEST_URI'];
    fwrite($handle, "Time:{$time} ROBOT:{$spi} AGENT:{$tlc_thispage} URL:{$PR} 

");
    fclose($handle);
}

查看全文

相关阅读:
django rest framework renderer
django集成celery
Celery
ajax csrftoken
验证码刷新、倒计时
 C++ const关键字以及static关键字
 git 查看当前仓库地址以及设置新的仓库地址
 C++ explicit关键字
 SSD训练网络参数计算
 C++ opencv调用resize修改插值方式遇到的坑

原文地址：https://www.cnblogs.com/chenjiacheng/p/6628354.html