zoukankan      html  css  js  c++  java
  • python 多线程,多进程,高效爬虫

    1.多线程
    from concurrent.futures import ThreadPoolExecutor import requests def fetch_async(url): response = requests.get(url) return response url_list = ['http://www.github.com', 'http://www.bing.com'] pool = ThreadPoolExecutor(5)          # 线程池 for url in url_list:               # 迭代器 pool.submit(fetch_async, url) pool.shutdown(wait=True)



    2.多进程

    from concurrent.futures import ProcessPoolExecutor
    import requests

    def fetch_async(url):
      response = requests.get(url)
      return response


    url_list = ['http://www.github.com', 'http://www.bing.com']    # 迭代对象
    pool = ProcessPoolExecutor(5)        # 进程池
    for url in url_list:               # 迭代器
      pool.submit(fetch_async, url)
    pool.shutdown(wait=True)

  • 相关阅读:
    CF732 F Tourist Reform——边双连通分量
    CF36 E Two Paths——欧拉(回)路
    最小生成树()
    Bellman-Ford算法
    SPFA算法
    归并排序(Merge_Sort)
    并查集
    int(3)与int(11)的区别
    mysql应该看的blog
    mysql建立索引
  • 原文地址:https://www.cnblogs.com/jum-bolg/p/10836513.html
Copyright © 2011-2022 走看看