zoukankan      html  css  js  c++  java
  • scrapy-继承默认的user-agent 中间件

    class MyUserAgentMiddleware(UserAgentMiddleware):
    
        def __init__(self, user_agent):
            self.user_agent = user_agent
    
        @classmethod
        def from_crawler(cls, crawler):
            o = cls(Faker())
            crawler.signals.connect(o.spider_opened, signal=signals.spider_opened)
            return o
    
        def process_request(self,request,spider):
            request.headers.setdefault('User-Agent',self.user_agent.user_agent())
    

      

    配置 

    DOWNLOADER_MIDDLEWARES = {
    #'middle.middlewares.MyCustomDownloaderMiddleware': 543,
    'middle.middlewares.MyUserAgentMiddleware': 100,
    'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware': None,
    }

    注意 @classmethod不能删除
    #!/usr/bin/python
    # -*- coding: UTF-8 -*-
     
    class A(object):
        bar = 1
        def func1(self):  
            print ('foo') 
        @classmethod
        def func2(cls):
            print ('func2')
            print (cls.bar)
            cls().func1()   # 调用 foo 方法
     
    A.func2()               # 不需要实例化
    

      



  • 相关阅读:
    wait waitpid
    达梦备份还原
    sigprocmask阻塞信号
    sigaction信号处理
    dd命令
    linux系统启动过程
    cpio建立、还原备份档
    configure详解
    Git入门基础教程
    一篇文章了解Github和Git教程
  • 原文地址:https://www.cnblogs.com/brady-wang/p/9703109.html
Copyright © 2011-2022 走看看