zoukankan      html  css  js  c++  java
  • Web_Scraping Techniques

    web_scraping_package.py

    from bs4 import BeautifulSoup
    import requests
    session = requests.Session()
    headers = {
    'User-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.149 Safari/537.36',
    'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.9'
    }
    import sys
    sys.path

    获取python的path

    ['', '/usr/lib/python36.zip', '/usr/lib/python3.6', '/usr/lib/python3.6/lib-dynload', '/home/christopher/.local/lib/python3.6/site-packages', '/usr/local/lib/python3.6/dist-packages', '/usr/lib/python3/dist-packages']

    这里我们把

    web_scraping_package.py

    放到

    /home/christopher/.local/lib/python3.6/site-packages

    目录下

    以后就可以直接import

    from web_scraping_package import session, headers, BeautifulSoup

    就不用再写一大串导入文件了。

  • 相关阅读:
    webp怎么打开 webp怎么转换成jpg
    波浪运动
    缓动
    动画的封装
    单张滑动tab 组件
    明星单品tab
    多个tab选项卡
    下拉框
    购物车css样式效果
    菜单导航兼容和不兼容捕获方法
  • 原文地址:https://www.cnblogs.com/profesor/p/12938829.html
Copyright © 2011-2022 走看看