zoukankan      html  css  js  c++  java
  • 爬虫基础之Request类


    categories:

    • 爬虫
      tags:
    • Request
    • 模拟请求拉钩网

    request.Request类

    如果想要在请求的时候增加一些请求头,用request.Request

    from urllib import request
    headers = {
    	'userAgent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/78.0.3904.108 Safari/537.36'
    }
    req = request.Request("http://www.baidu.com/",headers = headers)
    resp = request.urlopen(req)
    print(resp.read()[:10])
    #返回结果如下
    b'<!DOCTYPE '
    

    模拟请求拉钩网站

    from urllib import request,parse
    headers = {
    	'userAgent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/78.0.3904.108 Safari/537.36',
        'Referer':'https://www.lagou.com/jobs/list_python?labelWords=&fromSearch=true&suginput=',
    }
    url = 'https://www.lagou.com/jobs/list_python?labelWords=&fromSearch=true&suginput='
    data = {
        'first':'true',
        'pn':1,
        'kd':'python'
    }
    req = request.Request(url,headers=headers,data=parse.urlencode(data).encode('utf-8'),method='POST')
    resp = request.urlopen(req)
    print(resp.read().decode('utf-8'))
    
  • 相关阅读:
    团队项目-BUG挖掘
    评论任务
    4-14结对-复利计算
    做汉堡-结对
    复利计算--结对
    input上传按钮的优化
    avalon.js与 ajax使用的一个错误实例
    去除list集合中重复项的几种方法
    mvc学习记录
    常用js正则
  • 原文地址:https://www.cnblogs.com/anyux/p/12203062.html
Copyright © 2011-2022 走看看