码迷,mamicode.com
首页 > Web开发 > 详细

webmagic 初始化 startRequests

时间:2017-04-07 00:30:36      阅读:168      评论:0      收藏:0      [点我收藏+]

标签:web   地方   site   proc   ram   create   gets   url   running   

在spider类中有三个方法可以初始化startRequests。可以对这些地方进行扩展。

    /**
     * create a spider with pageProcessor.
     *
     * @param pageProcessor pageProcessor
     */
    public Spider(PageProcessor pageProcessor) {
        this.pageProcessor = pageProcessor;
        this.site = pageProcessor.getSite();
        this.startRequests = pageProcessor.getSite().getStartRequests();
    }

    /**
     * Set startUrls of Spider.<br>
     * Prior to startUrls of Site.
     *
     * @param startUrls startUrls
     * @return this
     */
    public Spider startUrls(List<String> startUrls) {
        checkIfRunning();
        this.startRequests = UrlUtils.convertToRequests(startUrls);
        return this;
    }

    /**
     * Set startUrls of Spider.<br>
     * Prior to startUrls of Site.
     *
     * @param startRequests startRequests
     * @return this
     */
    public Spider startRequest(List<Request> startRequests) {
        checkIfRunning();
        this.startRequests = startRequests;
        return this;
    }

 

webmagic 初始化 startRequests

标签:web   地方   site   proc   ram   create   gets   url   running   

原文地址:http://www.cnblogs.com/guazi/p/6676189.html

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!