zoukankan      html  css  js  c++  java
  • spider

    from lxml import etree
    import requests
    def getHtml(html):
    novelcontent = requests.get(html).content
    return etree.HTML(novelcontent)

    source = getHtml("http://www.cabintu.com")

    listclassify = source.xpath('//ul[@class="sg_menu"]/li/a')
    listtype = source.xpath('//div[@class="mainleft"]/ul[@class="sg_menu"]/li[@class="section"]//ul[@class="subnav_a"]/li[@class="airline"]/a')

    for i in range(0,len(listclassify)-1):
    fname = source.xpath('//div[@class="mainleft"]/ul[@class="sg_menu"]/li[@class="section"]/a/text()')[i]
    print fname



    for n in range(0,len(listtype)-1):
    typelist = source.xpath('//div[@class="mainleft"]/ul[@class="sg_menu"]/li[@class="section"]//ul[@class="subnav_a"]/li[@class="airline"]/a/text()')[n]
    print typelist



    # for n in range(0,)


    # ftypelist = source.xpath('//div[@class="mainleft"]/ul[@class="sg_menu"]/li[@class="section"]/ul[@class="subnav_a"]/li[@class="airline"]/a/text()')[i]
  • 相关阅读:
    直方图均衡
    k-means聚类方法
    核函数
    支持向量机(SVM)
    函数的定义和调用
    ES5新增方法
    继承
    构造函数和原型
    面向对象版tab 栏切换
    ES6中的对象与类
  • 原文地址:https://www.cnblogs.com/cutepython/p/6102824.html
Copyright © 2011-2022 走看看