zoukankan      html  css  js  c++  java
  • 异步 携程 网络小爬虫


    from urllib import request
    import gevent,time
    from gevent import monkey
    monkey.patch_all() #把当前程序的所有的io操作给我单独的做上标记

    def f(url):
    print('GET: %s' % url)
    resp = request.urlopen(url)
    data = resp.read()
    print('%d bytes received from %s.' % (len(data), url))

    urls = ['https://www.python.org/',
    'https://www.yahoo.com/',
    'https://github.com/' ]
    time_start = time.time()
    for url in urls:
    f(url)
    print("同步cost",time.time() - time_start)
    async_time_start = time.time()
    gevent.joinall([
    gevent.spawn(f, 'https://www.python.org/'),
    gevent.spawn(f, 'https://www.yahoo.com/'),
    gevent.spawn(f, 'https://github.com/'),
    ])
    print("异步cost",time.time() - async_time_start)


    GET: https://www.python.org/
    50114 bytes received from https://www.python.org/.
    GET: https://www.yahoo.com/
    505000 bytes received from https://www.yahoo.com/.
    GET: https://github.com/
    65396 bytes received from https://github.com/.
    同步cost 3.5022003650665283
    GET: https://www.python.org/
    GET: https://www.yahoo.com/
    GET: https://github.com/
    65396 bytes received from https://github.com/.
    50114 bytes received from https://www.python.org/.
    504996 bytes received from https://www.yahoo.com/.
    异步cost 1.332076072692871

  • 相关阅读:
    重建二叉树
    字符串移位包含的问题
    整数的逆序存储
    容器的综合应用:文本查询程序(摘自C++ Primer)
    vsprintf 变参函数可以用
    常用项目依赖(前端)
    eslint一些常见配置
    Jscrpit中的原型对象
    html网页自适应手机屏幕大小
    A Bit of Fun
  • 原文地址:https://www.cnblogs.com/rongye/p/9983491.html
Copyright © 2011-2022 走看看