INFO: Ignoring response <503 http://www.xicidaili.com/nn>: HTTP status code is not handled or not allowed 用scrapy爬虫 - 走看看

zoukankan html css js c++ java

INFO: Ignoring response <503 http://www.xicidaili.com/nn>: HTTP status code is not handled or not allowed 用scrapy爬虫

用scrapy爬取http://www.xicidaili.com/nt/1（国内ip）是启动小蜘蛛一直报错，将网址换成百度是可以进入parse。

错误：

2018-04-17 16:55:52 [scrapy.core.engine] DEBUG: Crawled (503) <GET http://www.xicidaili.com/nn> (referer: None)
2018-04-17 16:55:53 [scrapy.spidermiddlewares.httperror] INFO: Ignoring response <503 http://www.xicidaili.com/nn>: HTTP status code is not handled or not allowed

在setting中设置

HTTPERROR_ALLOWED_CODES = [503] #忽略503页面（不建议使用）

HTTPERROR_ALLOWED_CODES默认: [] 忽略该列表中所有非200状态码的response。

重新启动小蜘蛛没问题了但实际问题仍没解决

查看全文

相关阅读:
防止重复点击
 刷新当前页面的几种方法
 PHP删除数组中空值
 json转化数组
 两个不能同时共存的条件orWhere查询
 SQLSTATE[42000]
laravel一个页面两个表格分页处理
 Hash::make与Hash::check
unbind()清除指定元素绑定效果
 二级联动

原文地址：https://www.cnblogs.com/dahuag/p/8868003.html

Copyright © 2011-2022 走看看