zoukankan      html  css  js  c++  java
  • python 爬虫需要的库

    pip install builtwit 
    该模块将URL作为参数,下载该URL并对其进行分析,然后返回该网站使
    用的技术。下面是使用该模块的-一个例子。
    import builtwith
    builtwith.parse('http://example.webscraping.com')
    {'web-servers': ['Nginx'], 'web-frameworks': ['Web2py', 'Twitter Bootstrap'], 'programming-languages': ['Python'], 'javascript-frameworks': ['jQuery', 'Modernizr', 'jQuery UI']}
    寻找网站所有者 pip install python-whois
    import whois
    print (whois.whois('http://example.webscraping.com/'))
    {
      "domain_name": "WEBSCRAPING.COM",
      "registrar": "GoDaddy.com, LLC",
      "whois_server": "whois.godaddy.com",
      "referral_url": null,
      "updated_date": [
        "2013-08-20 08:08:30",
        "2013-08-20 08:08:29"
      ],
      "creation_date": "2004-06-26 18:01:19",
      "expiration_date": "2020-06-26 18:01:19",
      "name_servers": [
        "NS1.WEBFACTION.COM",
        "NS2.WEBFACTION.COM",
        "NS3.WEBFACTION.COM",
        "NS4.WEBFACTION.COM"
      ],
      "status": [
        "clientDeleteProhibited https://icann.org/epp#clientDeleteProhibited",
        "clientRenewProhibited https://icann.org/epp#clientRenewProhibited",
        "clientTransferProhibited https://icann.org/epp#clientTransferProhibited",
        "clientUpdateProhibited https://icann.org/epp#clientUpdateProhibited",
        "clientTransferProhibited http://www.icann.org/epp#clientTransferProhibited",
        "clientUpdateProhibited http://www.icann.org/epp#clientUpdateProhibited",
        "clientRenewProhibited http://www.icann.org/epp#clientRenewProhibited",
        "clientDeleteProhibited http://www.icann.org/epp#clientDeleteProhibited"
      ],
      "emails": "abuse@godaddy.com",
      "dnssec": "unsigned",
      "name": null,
      "org": null,
      "address": null,
      "city": null,
      "state": "Victoria",
      "zipcode": null,
      "country": "AU"
    }
  • 相关阅读:
    Office2010中功能强大的图片背景删除工具
    ArcGIS 缓冲区单位转换问题
    ArcGIS Server 优化
    SQLSERVER 三值逻辑
    SQLSERVER 重置自增列
    ArcGIS 基于AO 实现的经纬度定位
    AO连接ArcGIS server 超时问题
    MySQL 1064 错误
    Socket 请求http 汉字编码问题
    Navicat创建存储过程
  • 原文地址:https://www.cnblogs.com/liangliangzz/p/10159980.html
Copyright © 2011-2022 走看看