Python爬取糗事百科

import urllib
import urllib.request
from bs4 import BeautifulSoup
"""
    1.抓取糗事百科所有纯文本段子
    2.保存的本地文件
"""
class QiuShi():
    def __init__(self):
        user_agent = 'Mozilla/4.0 (compatible; MSIE 5.5; Windows NT)'
        self.headers = {'User-Agent':user_agent}

    def query(self,page=1):
        self.url = 'http://www.qiushibaike.com/text/page/' + str(page)
        print(self.url)
        res = urllib.request.Request(self.url,headers=self.headers)
        html = urllib.request.urlopen(res)
        bsoup = BeautifulSoup(html,'html.parser')
        for content in bsoup.find_all('div',{'class':'content'}):
            print(content.get_text())

if __name__ =='__main__':
    qiushi = QiuShi()
    for i in range(35):
        qiushi.query(i)

查看全文

相关阅读:
[leetcode] First Missing Positive
[leetcode] Can Place Flowers
[leetcode] Maximum Product of Three Numbers
[leetcode] Generate Parentheses
蓝桥杯 PREV-7 连号区间数
 蓝桥杯 PREV-3 带分数（dfs）
蓝桥杯 PREV-2 打印十字图
 团体程序设计天梯赛 L3-016 二叉搜索树的结构 (30分)
团体程序设计天梯赛 L3-020 至多删三个字符 (30分)（DP）
团体程序设计天梯赛 L3-011 直捣黄龙 (30分)

原文地址：https://www.cnblogs.com/lkpp/p/7400043.html