zoukankan      html  css  js  c++  java
  • scrapy-splash常用设置

    # Splash服务器地址
    SPLASH_URL = 'http://localhost:8050'

    # 开启Splash的两个下载中间件并调整HttpCompressionMiddleware的次序
    DOWNLOADER_MIDDLEWARES = {
    'scrapy_splash.SplashCookiesMiddleware': 723,
    'scrapy_splash.SplashMiddleware': 725,
    'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware': 810,
    }

    # 设置去重过滤器
    DUPEFILTER_CLASS = 'scrapy_splash.SplashAwareDupeFilter'

    # 用来支持cache_args(可选)
    SPIDER_MIDDLEWARES = {
    'scrapy_splash.SplashDeduplicateArgsMiddleware': 100,
    }
     
    #使用Splash的Http缓存,那么还要指定一个自定义的缓存后台存储介质
    HTTPCACHE_STORAGE = 'scrapy_splash.SplashAwareFSCacheStorage'
  • 相关阅读:
    1004. Counting Leaves (30)
    51Nod 1272 最大距离 (栈或贪心)
    D
    M
    N
    F
    E
    L
    A. Office Keys ( Codeforces Round #424 (Div. 1, rated, based on VK Cup Finals) )
    K
  • 原文地址:https://www.cnblogs.com/mahailuo/p/11287599.html
Copyright © 2011-2022 走看看