zoukankan      html  css  js  c++  java
  • webmagic 初始化 startRequests

    在spider类中有三个方法可以初始化startRequests。可以对这些地方进行扩展。

        /**
         * create a spider with pageProcessor.
         *
         * @param pageProcessor pageProcessor
         */
        public Spider(PageProcessor pageProcessor) {
            this.pageProcessor = pageProcessor;
            this.site = pageProcessor.getSite();
            this.startRequests = pageProcessor.getSite().getStartRequests();
        }
    
        /**
         * Set startUrls of Spider.<br>
         * Prior to startUrls of Site.
         *
         * @param startUrls startUrls
         * @return this
         */
        public Spider startUrls(List<String> startUrls) {
            checkIfRunning();
            this.startRequests = UrlUtils.convertToRequests(startUrls);
            return this;
        }
    
        /**
         * Set startUrls of Spider.<br>
         * Prior to startUrls of Site.
         *
         * @param startRequests startRequests
         * @return this
         */
        public Spider startRequest(List<Request> startRequests) {
            checkIfRunning();
            this.startRequests = startRequests;
            return this;
        }
  • 相关阅读:
    B
    R
    C
    B
    异步解决方案----Promise与Await
    NPM 与 Nodejs
    借助node.js + mysql 学习基础ajax~
    bind、call、apply的区别与实现原理
    私有 npm 仓库的搭建
    学习 Promise,掌握未来世界 JS 异步编程基础
  • 原文地址:https://www.cnblogs.com/guazi/p/6676189.html
Copyright © 2011-2022 走看看