如何获取一个页面的所有的a链接

####

使用的beautiful soap的方式

import requests
from bs4 import BeautifulSoup
from ezpymysql import Connection

db = Connection(
    'localhost',
    'spider_test',
    'root',
    'Ji10201749'
)

re = requests.get("https://news.sina.com.cn/")
html = re.content

bs = BeautifulSoup(html, "html.parser")

for item in bs.find_all("a"):
    item = {
        'subject': item.get("href",""),
        'url': item.text,
    }
    db.table_insert('news_sina_spider', item)

#####

查看全文

相关阅读:
Python如何爬取淘宝MM呢?教你一招
 Python爬虫实战之如何爬取百度贴吧帖子？案例详解
 SpringBoot定时任务如何正确运用？案例详解
 JS数组之重排序方法
 JS数组之栈和队列
 JS数组之转换方法
 计算机相关推荐教程
 多维数组
 重新认识变量和数组
 数组

原文地址：https://www.cnblogs.com/andy0816/p/15030970.html