zoukankan      html  css  js  c++  java
  • 查询数据,从链接地址中爬取文章内容jsoup

    查询数据,从链接地址中爬取文章内容

    protected void doGet(HttpServletRequest request, HttpServletResponse response) throws ServletException, IOException {
            // TODO Auto-generated method stub
            //response.getWriter().append("Served at: ").append(request.getContextPath());
            int pageNum=1;
            int pageSize=100;
            for(pageNum=1;pageNum<101;pageNum++)
            {
                try {
                    int page1= (pageNum-1)*pageSize;
                    Map<Integer,String> map1 = ManageMySQL.getPageData(page1,pageSize);
                    for(Integer key : map1.keySet())
                    {
                        System.out.println(key+"  "+map1.get(key));
                        String context1 = getContentByURL(map1.get(key)).replace(" ", "");
                        ManageMySQL.updateContext(key, context1);
                    }
                } catch (Exception e) {
                    // TODO Auto-generated catch block
                    e.printStackTrace();
                }
                
            }
            
            
            
        }
  • 相关阅读:
    python 杂谈
    python: list转字符串
    dataframe
    time模块
    python 调试器
    BAT机器学习面试1000题系列(41-45题)
    join()函数
    value_counts()
    模型评估
    04flask_scripts使用
  • 原文地址:https://www.cnblogs.com/herd/p/11716503.html
Copyright © 2011-2022 走看看