Search code examples
javaurlapache-commons-httpclient

Why this URL produces a timeout with apache client and works with URL connection?


This simple URL http://www.cert.fnmt.es/certs/ACCOMP.crt is causing me a headache. You can see it is available, but I can not get content with Apache HTTP client

String url = "http://www.cert.fnmt.es/certs/ACCOMP.crt";

//Java URL openConnection OK
InputStream in = new URL(url).openConnection().getInputStream();
byte[] bytes = IOUtils.toByteArray(in);
System.out.println("OK? "+(bytes.length>0));

//Apache http client (SocketTimeoutException)
HttpGet httpGet = new HttpGet(url);     
HttpClient client = HttpClientBuilder.create().build();
HttpResponse response = client.execute(httpGet);                
System.out.println("OK? "+(response != null));

The exception is raised here: client.execute(httpGet);

OK? true
Exception in thread "main" java.net.SocketTimeoutException: Read timed out
    at java.net.SocketInputStream.socketRead0(Native Method)
    at java.net.SocketInputStream.socketRead(SocketInputStream.java:116)
    at java.net.SocketInputStream.read(SocketInputStream.java:170)
    at java.net.SocketInputStream.read(SocketInputStream.java:141)
    at org.apache.http.impl.io.SessionInputBufferImpl.streamRead(SessionInputBufferImpl.java:136)
    at org.apache.http.impl.io.SessionInputBufferImpl.fillBuffer(SessionInputBufferImpl.java:152)
    at org.apache.http.impl.io.SessionInputBufferImpl.readLine(SessionInputBufferImpl.java:270)
    at org.apache.http.impl.conn.DefaultHttpResponseParser.parseHead(DefaultHttpResponseParser.java:140)
    at org.apache.http.impl.conn.DefaultHttpResponseParser.parseHead(DefaultHttpResponseParser.java:57)
    at org.apache.http.impl.io.AbstractMessageParser.parse(AbstractMessageParser.java:260)
    at org.apache.http.impl.DefaultBHttpClientConnection.receiveResponseHeader(DefaultBHttpClientConnection.java:161)
    at org.apache.http.impl.conn.CPoolProxy.receiveResponseHeader(CPoolProxy.java:153)
    at org.apache.http.protocol.HttpRequestExecutor.doReceiveResponse(HttpRequestExecutor.java:271)
    at org.apache.http.protocol.HttpRequestExecutor.execute(HttpRequestExecutor.java:123)
    at org.apache.http.impl.execchain.MainClientExec.execute(MainClientExec.java:254)
    at org.apache.http.impl.execchain.ProtocolExec.execute(ProtocolExec.java:195)
    at org.apache.http.impl.execchain.RetryExec.execute(RetryExec.java:86)
    at org.apache.http.impl.execchain.RedirectExec.execute(RedirectExec.java:108)
    at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:184)
    at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:82)
    at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:106)
    at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:57)
    at test.CommonsDataLoader.main(CommonsDataLoader.java:102)

I am using Java 1.7 and apache http-client4.3.4. I do not know whether it is a bug or an incompatibility with the server. My code is based on apache httpclient and I can not hard code exceptions for 'special' sites


Solution

  • please, add

     httpGet.addHeader("User-Agent", "Mozilla/5.0");
    

    Sound like the site is filtering on user agents.