zoukankan      html  css  js  c++  java
  • python解决urllib2乱码问题

    示例:

    #!/usr/bin/env python
    # -*- coding: utf-8 -*-
    import urllib 
    import urllib2
      
    def main():
    	url = "http://www.douban.com"
    	#浏览器头
    	headers = {'User-Agent':'Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US; rv:1.9.1.6) Gecko/20091201 Firefox/3.5.6'}
    	req = urllib2.Request(url=url,headers=headers)
    	data = urllib2.urlopen(req).read()
    	print data
    	return 0
        
    if __name__ == '__main__':
    	main()
    

    打印出来的内容中,汉字为乱码。解决方法:

    #!/usr/bin/env python
    # -*- coding: utf-8 -*-
    import urllib 
    import urllib2
    import sys 
    type = sys.getfilesystemencoding()
        
    def main():
    	url = "http://www.douban.com"
    	#浏览器头
    	headers = {'User-Agent':'Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US; rv:1.9.1.6) Gecko/20091201 Firefox/3.5.6'}
    	req = urllib2.Request(url=url,headers=headers)
    	data = urllib2.urlopen(req).read()
    	print data.decode("UTF-8").encode(type)
    	return 0
        
    if __name__ == '__main__':
    	main()
    

    有关python的encode和decode使用方法參见:http://blog.csdn.net/xyw_blog/article/details/40188037


    本文为xyw_Eliot原创。转载请注明出处:http://blog.csdn.net/xyw_blog/article/details/40187913

    版权声明:本文博客原创文章,博客,未经同意,不得转载。

  • 相关阅读:
    u Calculate e
    Elevator
    骑士走棋盘
    Number Sequence
    老鼠走迷宫
    Let the Balloon Rise
    A+B Problem II
    Three-Color Flag
    Noldbach problem
    Almost Prime
  • 原文地址:https://www.cnblogs.com/gcczhongduan/p/4616154.html
Copyright © 2011-2022 走看看