zoukankan      html  css  js  c++  java
  • 爬天极网美女图片

    import os
    #导入发送请求模块:
    import requests
    #导入解析文本模块:
    from bs4 import BeautifulSoup
    #返回上一级目录:
    base_path = os.path.dirname(os.path.abspath(__file__))
    #路径和图片文件夹拼接:
    img_path = os.path.join(base_path,"img")
    #获取响应:
    response = requests.get("http://pic.yesky.com/c/6_20491_1.shtml")
    #将请求结果交给bs4解析:
    soup = BeautifulSoup(response.text,"html.parser")
    #经过分析定位到<div class="lb_box">:
    div_obj = soup.find(name="div",attrs={"class":"lb_box"})
    #从div中找所有的dl标签(每一张图片的外部标点)、结果是列表
    list_dl = div_obj.find_all(name = "dl")
    #循环每一张图片的dl:
    for dl in list_dl:
    #从dl中找到img:
    img = dl.find(name = "img")
    #从标签中获取图片的src属性:
    img_src = img.get("src")
    #拿到img的URL、使用requests发请求:
    img_response = requests.get(img_src)
    #处理路径和文件名:
    file_path = os.path.join(img_path,img_src.rsplit("/",1)[-1])
    with open(file_path,"wb") as f:
    f.write(img_response.content)
    效果如下:

  • 相关阅读:
    js保存图片至本地
    ArrayLike
    key的作用
    react Video event
    react中字符串换行
    react打包后找不到静态文件
    2020软件工程第三次作业
    003 Longest Substring Without Repeating Characters
    002addTwoNumbers
    001twoSum
  • 原文地址:https://www.cnblogs.com/zhang-da/p/12207584.html
Copyright © 2011-2022 走看看