beautifulsoup - 走看看

zoukankan html css js c++ java

beautifulsoup

1、安装

pip install beautifulsoup4

2、

from bs4 import BeautifulSoup

html = BeautifulSoup(page_source,features='html.parser')
这个parser取决于我们要解析哪种网页，比如xml, lxml, html

3、如何查找元素和标签？

html.find(name=None, attrs={}, recursive=True, text=None,**kwargs):

name是标签名，如a标签，div, script等

attrs可以根据id, class, name 等等进行查找， text是标签里的text

并且还有html.findall() find.next(), findparent等

查看全文

相关阅读:
UVA 10905
UVA 10859 树形DP
LA 4794 状态DP+子集枚举
 LA 3695 部分枚举
 UVA 11825 状态压缩DP+子集思想
 UVA 10891 区间DP+博弈思想
 HDU 5239 上海大都会 D题（线段树+数论）
HDU 5242 上海大都会 G题
 HDU 5241 上海大都会 F题
 P1359 租用游艇

原文地址：https://www.cnblogs.com/yjybupt/p/13729904.html

Copyright © 2011-2022 走看看