zoukankan      html  css  js  c++  java
  • jsoup UnsupportedMimeTypeExceptio

    Exception in thread "main" <strong><span style="font-size:18px;">org.jsoup.UnsupportedMimeTypeException:</span></strong> Unhandled content type. Must be text/*, application/xml, or application/xhtml+xml. Mimetype=application/json; charset=utf-8, URL=
    	at org.jsoup.helper.HttpConnection$Response.execute(HttpConnection.java:487)
    	at org.jsoup.helper.HttpConnection$Response.execute(HttpConnection.java:434)
    	at org.jsoup.helper.HttpConnection.execute(HttpConnection.java:181)
    	at org.jsoup.helper.HttpConnection.get(HttpConnection.java:170)
    做新浪微博爬虫的时候,jsoup请求网址出现这样的错误,解决方法是添加蓝色部分代码
    <pre name="code" class="java">Jsoup.connect("http://").ignoreContentType(true).get();


    
    

     可参考以下API解释:

    ignoreContentType
    
    Connection ignoreContentType(boolean ignoreContentType)
    Ignore the document's Content-Type when parsing the response. By default this is false, an unrecognised content-type will cause an IOException to be thrown. (This is to prevent producing garbage by attempting to parse a JPEG binary image, for example.) Set to true to force a parse attempt regardless of content type.
    Parameters:
    ignoreContentType - set to true if you would like the content type ignored on parsing the response into a Document.
    Returns:
    this Connection, for chaining


  • 相关阅读:
    SQL学习记录
    Python 函数和变量作用域
    Python 使用socket实现一对多通信
    Flask wtforms validate_on_submit() 无法返回值问题
    Flask WTForm BooleanField用法
    Python3 中的nonlocal用法
    Python 实现二进制循环效果
    Python 各种类型转换
    第一章:数据结构
    Python Challenge
  • 原文地址:https://www.cnblogs.com/CHWYH/p/5816273.html
Copyright © 2011-2022 走看看