zoukankan      html  css  js  c++  java
  • 正确使用HttpClient,避免出现大量CLOSE_WAIT的TCP链接

    As I discovered when debugging TCP connections stuck in the CLOSE_WAIT state for one of our customers, we were using HttpClient incorrectly. We’re not alone in this case, as you’ll find out if you google HttpClient CLOSE_WAIT, but it’s very non-intuitive. Even the official tutorial is wrong, so I’m describing the issue here.

    Apache HttpClient is usually used like this in basic mode:

    HttpClient httpClient = new HttpClient();
    HttpMethod method = new GetMethod(uri);
    try {
    int statusCode = httpClient.executeMethod(method);
    byte[] responseBody = method.getResponseBody();
    // ...
    return stuff;
    } finally {
    method.releaseConnection();
    }

    But this is not enough.

    The issue is that releasing the connection makes it available again to the HttpClientinstance, but does not close it, because HTTP 1.1 is used and it can pipeline further requests to the same host:port in the same connection.

    Even though the server may have decided to close its end of the connection, on our client side the connection is still open and will stay that way until an attempt to read from it is made (at which point the client will detect that the other end is closed). TCP works like that, there is a notion of half-closed connection, because close()actually just means I will not send any more data, and you can still receive data from a connection that you closed but which has not yet been closed on the other end.

    So what happens then is that when the HttpClient instance goes out of scope it becomes available to the GC, but it will not be garbage collected immediately. Until the GC collects it, the socket connection held internally will stay open and the socket will be stuck in the CLOSE_WAIT state.

    To fix this, the simplest way is to add:

    method.setRequestHeader("Connection", "close");

    before executing the method. This will instruct HttpClient to close the connection by itself once the full response has been received. Another way is to do in the finallyblock:

    httpClient.getHttpConnectionManager().closeIdleConnections(0);

    An even better way is to not use a new HttpClient object each time, but to reuse one that has been initialized with a MultiThreadedHttpConnectionManager sized appropriately. Of course in this case the connection manager must be shut down properly when the application shuts down:

    private MultiThreadedHttpConnectionManager connectionManager;
    private HttpClient httpClient;
    
    public void init() {
    connectionManager = new MultiThreadedHttpConnectionManager()
    // ... configure connectionManager ...
    httpClient = new HttpClient(connectionManager);
    }
    
    public void shutdown() {
    connectionManager.shutdown();
    }
    
    public String process(String uri) {
    HttpMethod method = new GetMethod(uri);
    try {
    int statusCode = httpClient.executeMethod(method);
    byte[] responseBody = method.getResponseBody();
    // ...
    return stuff;
    } finally {
    method.releaseConnection();
    }
    }

    Florent

    PS: I’m using here the APIs from HttpClient 3, but it also applies with slightly different names to the completely refactored APIs of HttpClient 4.
  • 相关阅读:
    探索c#之Async、Await剖析
    探索C#之布隆过滤器(Bloom filter)
    探索C#之虚拟桶分片
    刷新本地的DNS缓存数据
    php取整函数ceil,floor,round,intval函数的区别
    这样顶级人生规划 ,想不成功都难
    全篇干货,10分钟带你读透《参与感》
    iOS审核秘籍】提审资源检查大法
    php如何遍历多维的stdClass Object 对象,php的转换成数组的函数只能转换外面一丛数组
    RDS MySQL 连接数满情况的处理
  • 原文地址:https://www.cnblogs.com/fwwdn/p/4493261.html
Copyright © 2011-2022 走看看