zoukankan      html  css  js  c++  java
  • python爬虫项目-一见倾心壁纸

    来自:https://www.cnblogs.com/xdd1997/p/11962969.html
    import re import urllib import urllib.request def getHtml(url): page = urllib.request.urlopen(url) html = page.read() return html def getImage(html,x): #https://mmbiz.qpic.cn/mmbiz_jpg/ib55rg6wzUc3B16KIY3uU53nkcTTDic8uEA4WWBPaHJ8LpibvAnkpS2FZtyjrv7w7dbEeNrhfvPuuyReNAxsLdgJA/640?wx_fmt=jpeg #https://mmbiz.qpic.cn/mmbiz_jpg/ib55rg6wzUc3B16KIY3uU53nkcTTDic8uEHqocI7r86nehl2NeForAqvcTiaEAIuWjTWPKNXnnXIPuUuqnuJeFKYw/640?wx_fmt=jpeg #此处正则为重点 reg = 'data-src="(.*?)"' image = re.compile(reg) imlist = re.findall(reg,html.decode('utf-8')) print(imlist) for i in imlist: print(i) print(x) urllib.request.urlretrieve(i,'%s.jpg' % x) x +=1 return x x=1 url = 'https://mp.weixin.qq.com/s/MVDcn0O3093OlIhMYkqBIA' html = getHtml(url) x = getImage(html,x) print('下载完成') #下载结果与此.py文件在同一目录

      

  • 相关阅读:
    Mybatis中javaType和jdbcType对应关系
    spy日志
    mybatis批量插入和更新
    js打印方案
    js弹窗,父子窗口调用
    extjs4.1
    oracle开启远程连接访问
    javaweb打印
    Leetcode 392.判断子序列
    Leetcode 391.完美矩形
  • 原文地址:https://www.cnblogs.com/gisoracle/p/12003609.html
Copyright © 2011-2022 走看看