zoukankan      html  css  js  c++  java
  • python 中使用 urllib2 伪造 http 报头的2个方法

    方法1、

    1
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    27
    28
    29
    #!/usr/bin/python
    # -*- coding: utf-8 -*-
    #encoding=utf-8
    #Filename:urllib2-header.py
      
    import urllib2
    import sys
      
    #抓取网页内容-发送报头-1
    send_headers = {
     'Host':'www.jb51.net',
     'User-Agent':'Mozilla/5.0 (Windows NT 6.2; rv:16.0) Gecko/20100101 Firefox/16.0',
     'Accept':'text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8',
     'Connection':'keep-alive'
    }
      
    req = urllib2.Request(url,headers=send_headers)
    r = urllib2.urlopen(req)
      
    html = r.read()        #返回网页内容
    receive_header = r.info()     #返回的报头信息
      
    # sys.getfilesystemencoding()
    html = html.decode('utf-8','replace').encode(sys.getfilesystemencoding()) #转码:避免输出出现乱码
      
    print receive_header
    # print '####################################'
    print html

    方法2、

    1
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    #!/usr/bin/python
    # -*- coding: utf-8 -*-
    #encoding=utf-8
    #Filename:urllib2-header.py
      
    import urllib2
    import sys
      
      
    req = urllib2.Request(url)
    req.add_header('Referer','http://www.jb51.net/')
    req.add_header('User-Agent','Mozilla/5.0 (Windows NT 6.2; rv:16.0) Gecko/20100101 Firefox/16.0')
    r = urllib2.urlopen(req)
      
    html = r.read()
    receive_header = r.info()
      
    html = html.decode('utf-8').encode(sys.getfilesystemencoding())
      
    print receive_header
    print '#####################################'
    print html
  • 相关阅读:
    centos7安装jdk8
    centos7安装mysql5.7.31
    docker打包,运行springboot
    centos7安装docker
    ps学习记录
    Html的学习(二)
    tensorflow C++接口调用图像分类pb模型代码
    tensorflow C++接口调用目标检测pb模型代码
    ubuntu14 编译tensorflow C++ 接口
    Python opencv计算批量图片的BGR各自的均值
  • 原文地址:https://www.cnblogs.com/wangjq19920210/p/9016851.html
Copyright © 2011-2022 走看看