使用DOM方法来遍历一个文档

使用DOM方法来遍历一个文档
问题

你有一个HTML文档要从中提取数据，并了解这个HTML文档的结构。

方法

将HTML解析成一个Document之后，就可以使用类似于DOM的方法进行操作。示例代码：
File input = new File("/tmp/input.html"); Document doc = Jsoup.parse(input, "UTF-8", "http://example.com/"); Element content = doc.getElementById("content"); Elements links = content.getElementsByTag("a"); for (Element link : links) { String linkHref = link.attr("href"); String linkText = link.text(); }
说明

Elements这个对象提供了一系列类似于DOM的方法来查找元素，抽取并处理其中的数据。具体如下：

查找元素
- getElementById(String id)
- getElementsByTag(String tag)
- getElementsByClass(String className)
- getElementsByAttribute(String key) (and related methods)
- Element siblings: siblingElements(), firstElementSibling(), lastElementSibling(); nextElementSibling(), previousElementSibling()
- Graph: parent(), children(), child(int index)
元素数据
- attr(String key)获取属性attr(String key, String value)设置属性
- attributes()获取所有属性
- id(), className() and classNames()
- text()获取文本内容text(String value) 设置文本内容
- html()获取元素内HTMLhtml(String value)设置元素内的HTML内容
- outerHtml()获取元素外HTML内容
- data()获取数据内容（例如：script和style标签)
- tag() and tagName()
操作HTML和文本
查看全文

相关阅读:
Redis学习篇（一）之String类型及其操作
 MySQL笔记（五）之表的连接
 MySQL笔记（三）之数据插入更新与删除
 MySQL笔记（四）之内建函数
 MySQL笔记（二）之数据检索常用关键字
 MySQL笔记（一）之新建数据库和数据表
 京东文胸数据分析
 用SpringSecurity从零搭建pc项目-02
Spring Security构建Rest服务-0800-Spring Security图片验证码
 用SpringSecurity从零搭建pc项目-01

原文地址：https://www.cnblogs.com/deityjian/p/12541594.html

使用DOM方法来遍历一个文档

问题

方法

说明

查找元素

元素数据

操作HTML和文本