zoukankan      html  css  js  c++  java
  • 微信公众号图片抓取

    # coding=utf-8
    import re
    import requests
    
    def spider(paperurl, picpat):
        headers = {
            "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/54.0.2840.99 Safari/537.36"}
    
        response = requests.get(paperurl, headers=headers).text
    
        pattern = re.compile(picpat)
    
        data = pattern.findall(response)
    
        for i in range(0, len(data)):
            picurl = data[i]
            print(picurl)
            write(picurl, i)
    
    
    def write(picurl, i):
        data = requests.get(picurl).content
        #with open("e:jpeg\{}.jpeg".format(i), "wb") as f:
        with open("e:jpeg\{}.jpg".format(i), "wb") as f:
            f.write(data)
    
    
    if __name__ == '__main__':
        paperurl = r"https://mp.weixin.qq.com/s/7huJOS7G4HEZYx1FtOTxIw"
        picpat = r'<img data-ratio="[sS]*?" data-src="(.*?)" data-type="jpeg" data-w="[sS]*?"'
        #paperurl = r"https://mp.weixin.qq.com/s/8VeDZztRqTP6PeopKv7BjQ"
        #picpat = r'<img data-ratio="[sS]*?" data-src="(.*?)" data-type="jpeg" data-w="[sS]*?"'
        spider(paperurl, picpat)
  • 相关阅读:
    1703技术笔录
    技术开发感想
    1701技术随笔
    12月份技术随笔
    光照效果函数
    冰冻效果
    反色效果函数
    哈哈镜效果
    黑白效果函数
    羽化效果
  • 原文地址:https://www.cnblogs.com/gisoracle/p/14491859.html
Copyright © 2011-2022 走看看