标签:spark
build/mvn -Pyarn -Phadoop-2.4 -Dhadoop.version=2.4.0 -DskipTests clean package
./bin/pyspark
>>>textFile =sc.textFile("README.md")
126
>>>textFile.first() # First item in this RDD
标签:spark
原文地址:http://blog.csdn.net/wangjuntytl/article/details/46046417