zoukankan      html  css  js  c++  java
  • Xpath素材

    from lxml import etree
    text = """
    <div>
        <ul>
            <li class="item-0"><a href="link1.html">first item</a></li>
            <li class="item-1"><a href="link2.html">second item</a></li>
            <li class="item-inactive"><a href="link3.html">third item</a></li>
            <li class="item-1"><a href="link4.html">fourth item</a></li>
            <li class="item-0"><a href="link5.html">fifth item</a>
        </ul>
    </div>
    """
    html = etree.HTML(text)
    result = etree.tostring(html,pretty_print=True).decode('utf-8')
    print(result)
    from lxml import etree
    text = """
    <div>
        <ul>
            <li class="item-0"><a href="link1.html">first item</a></li>
            <li class="item-1"><a href="link2.html">second item</a></li>
            <li class="item-inactive"><a href="link3.html"><span class="bold">third item</span>></a></li>
            <li class="item-1"><a href="link4.html">fourth item</a></li>
            <li class="item-0"><a href="link5.html">fifth item</a></li>
        </ul>
    </div>
    """
    # 初始化一个Xpath解析对象
    html = etree.HTML(text)
    # 解析对象输出代码 是一个bytes类型
    result = etree.tostring(html,encoding='utf-8')
    print(type(html))  # <class 'lxml.etree._Element'>
    print(type(result))  # <class 'bytes'>
    print(result.decode('utf-8'))
  • 相关阅读:
    mysql 常用函数
    JSP 分页代码
    day15(Mysql学习)
    day14(编码实战-用户登录注册)
    Bootstrap第3天
    Bootstrap第2天
    Bootstrap 第一天
    day13(JSTL和自定义标签&MVC模型&javaweb三层框架)
    label 对齐
    Alert提示框之后跳转指定页面
  • 原文地址:https://www.cnblogs.com/zhangboblogs/p/10114698.html
Copyright © 2011-2022 走看看