cmd下执行scrapy爬虫程序,不报错也没有输出,解决方案
想要执行parse能够在cmd看到parse函数的执行结果:
解决方法:
settings.py 中设置 ROBOTSTXT_OBEY = False
案例:
day96day96spiderschouti.py
# -*- coding: utf-8 -*- import scrapy import sys import io sys.stdout = io.TextIOWrapper(sys.stdout.buffer,encoding='gb18030') class ChoutiSpider(scrapy.Spider): name = 'chouti' allowed_domains = ['chouti.com'] start_urls = ['http://dig.chouti.com/'] def parse(self, response): # print(response.url) # print(response.text) content=str(response.body,encoding='utf-8') print(content)
day96day96settings.py
1
|
ROBOTSTXT_OBEY = False |