标签:
编程工程
$ cd ./anthelion/anthelion/target/classes
$ java -Xmx15G -cp ../Anthelion-1.0.0-jar-with-dependencies.jar com.yahoo.research.robme.anthelion.simulation.CCFakeCrawler ./index ./network ./label ../../config/baseline.properties result.log
The files which we used to measure the performance when crawling for HTML pages including Microdata, Microformats and RDFa can be found on the dedicated page of the WebDataCommons project: http://webdatacommons.org/structureddata/anthelion/
标签:
原文地址:http://www.cnblogs.com/timdes/p/5171148.html