zoukankan      html  css  js  c++  java
  • 网络爬虫简单实现

    有访问权限

    public class SpiderTest {
      public static void main(String[] args) throws IOException {
        URL url = new URL("https://www.baidu.com");
        InputStream is = url.openStream();
        BufferedReader br = new BufferedReader(new InputStreamReader(is, "utf-8"));
        String msg = null;
        while(null!=(msg=br.readLine())){
          System.out.println(msg);
        }
        br.close();
      }
    }

    无访问权限

    public class SpiderTest2 {
      public static void main(String[] args) throws IOException {
        URL url = new URL("https://www.jd.com");
        HttpURLConnection conn = (HttpURLConnection)url.openConnection();
        conn.setRequestMethod("GET");
        conn.setRequestProperty("User-Agent","Mozilla/5.0 (Windows NT 6.1; W…) Gecko/20100101 Firefox/67.0");
        InputStream is = url.openStream();
        BufferedReader br = new BufferedReader(new InputStreamReader(conn.getInputStream(), "utf-8"));
        String msg = null;
        while(null!=(msg=br.readLine())){
          System.out.println(msg);
        }
        br.close();
      }
    }

  • 相关阅读:
    C# API 大全
    也说_T、_TEXT、TEXT、L
    项脊轩志--归有光
    C# tostring()汇总
    StructLayout特性
    关于C#静态构造函数的几点说明
    C#生成DLL文件
    做.net的早晚会用到,并且网上还没有这方面的正确资料或几乎很少
    C# 实现屏幕键盘 (ScreenKeyboard)
    Microsoft .NET Native
  • 原文地址:https://www.cnblogs.com/5aixin/p/11094702.html
Copyright © 2011-2022 走看看