zoukankan      html  css  js  c++  java
  • Python处女贴:下载字体

    刚刚开始学习python,在这个时代,让脚本代替人去做些事情还是很值得的。

    下载吧,帮我去把某个网站上所有的字体下载下来…

    import urllib
    from sgmllib import SGMLParser
    
    class URLLister(SGMLParser):
        def reset(self):
            SGMLParser.reset(self)
            self.urls = []
    
        def start_a(self, attrs):
            href = [v for k, v in attrs if k == 'href' and v.endswith('.zip')]
            if href:
                self.urls.extend(href)
    
    Exts = 'a,b,c,d,e,f,g,h,i,j,k,l,m,n,o,p,q,r,s,t,u,v,w,x,y,z,no'
    URLs = ['http://www.3lian.com/ziti/zt2/fonts-' + ele + '.htm' for ele in Exts.split(',')]
    
    downloadURL = []
    
    for url in URLs:
        http = urllib.urlopen(url)
        htmlSrc = http.read()
        http.close()
    
        parser = URLLister()
        parser.feed(htmlSrc)
        parser.close()
    
        downloadURL.extend(parser.urls)
    
    downloadURL = ['http://www.3lian.com/ziti/zt2/' + ele for ele in downloadURL]
    
    for url in downloadURL:
        filename = 'D:\\TDdownload\\font-en\\' + url.split('/')[-1]
        print filename
        zipfile = open(filename, 'wb')
    
        print 'download --- ' + url
    
        http = urllib.urlopen(url)
        zipfile.write(http.read())
        http.close()
        zipfile.close
    
  • 相关阅读:
    【转】wpa_supplicant与wpa_cli之间通信过程
    CSS Hack
    HTML5测试(二)
    HTML5测试(一)
    百分号编码(URL编码)
    DOM事件处理函数
    JS数组
    JS中for循环嵌套
    Codecombat 游戏攻略(计算机科学三)2
    Codecombat 游戏攻略(计算机科学三)
  • 原文地址:https://www.cnblogs.com/hydonlee/p/1455048.html
Copyright © 2011-2022 走看看