标签:
The skills to transfer data between external systems and your cluster. This includes the following:
在外部系统和集群之间转移数据的技能,包括以下几个:
Convert a set of data values in a given format stored in HDFS into new data values and/or a new data format and write them into HDFS. This includes writing Spark applications in both Scala and Python:
将给定的HDFS上的一套数据值转化成为一套新的数据值和数据格式,并且写入到HDFS中。这包括使用Scala和Python编写Spark程序
Use DDL (Data Definition Language) in order to create tables in the Hive metastore for use by Hive and Impala.
使用DDL(数据定义语言)在Hive元数据库中创建表便于hive和impala使用
CCA Spark and Hadoop 开发者认证技能点【2016只为hadoop达到巅峰】
标签:
原文地址:http://blog.csdn.net/mrcharles/article/details/50444551