zoukankan      html  css  js  c++  java
  • 爬虫-retrying用法

    文档:https://pypi.org/project/retrying/

    安装

    pip install retrying

    设置最大重试次数

    # coding=utf-8
    import requests
    from retrying import retry
    
    headers = {"User-Agent":"Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/71.0.3578.80 Safari/537.36"}
    
    @retry(stop_max_attempt_number=3)  # 如果报错 则最大重试次数为3  stop_max_attempt_number设置最大重试次数
    def _parse_url(url, method,data,proxies):
        print("*" * 20)
        if method=="POST":
            response = requests.post(url, data=data, headers=headers, proxies=proxies)
        else:
            response = requests.get(url, headers=headers, timeout=3, proxies=proxies)
        assert response.status_code==200
        return response.content.decode()
    
    def parse_url(url, method="GET", data=None, proxies={}):
        try:
            html_str = _parse_url(url, method, data, proxies)
        except:
            html_str = None
    
        return html_str
    
    if __name__ == '__main__':
        url = "www.baidu.com"
        print(parse_url(url))
  • 相关阅读:
    循序渐进学习栈和队列
    循序渐进学习数据结构之线性表
    2018 noip 备战日志
    2018 noip 考前临死挣扎
    二维前缀和与差分
    noip模板复习
    宽搜总结
    强联通总结
    二分图再次总结
    NOIP 2016 换教室(期望dp)
  • 原文地址:https://www.cnblogs.com/yifengs/p/11724629.html
Copyright © 2011-2022 走看看