标签:web 地方 site proc ram create gets url running
在spider类中有三个方法可以初始化startRequests。可以对这些地方进行扩展。
/** * create a spider with pageProcessor. * * @param pageProcessor pageProcessor */ public Spider(PageProcessor pageProcessor) { this.pageProcessor = pageProcessor; this.site = pageProcessor.getSite(); this.startRequests = pageProcessor.getSite().getStartRequests(); } /** * Set startUrls of Spider.<br> * Prior to startUrls of Site. * * @param startUrls startUrls * @return this */ public Spider startUrls(List<String> startUrls) { checkIfRunning(); this.startRequests = UrlUtils.convertToRequests(startUrls); return this; } /** * Set startUrls of Spider.<br> * Prior to startUrls of Site. * * @param startRequests startRequests * @return this */ public Spider startRequest(List<Request> startRequests) { checkIfRunning(); this.startRequests = startRequests; return this; }
标签:web 地方 site proc ram create gets url running
原文地址:http://www.cnblogs.com/guazi/p/6676189.html