Hive的分桶

时间：2018-08-14 14:24:55 阅读：177 评论：0 收藏：0 [点我收藏+]

含义：实质是将数据分成不同的文件。hive中的分桶和hadoop中的reduce个数相同。

　　首先设置采用分桶：

　　　　hive>set hive.enforce.bucketing=true;

　　　　hive>create table buckets_users(id int,name string) clustered by (id) into 4 buckets;

　　　　hive>insert overwrite table buckets_users select * from student;

　　　　　　说明：按照ID分桶，分为4个桶，buckets_users 和 student表的字段要相同。

原文地址：https://www.cnblogs.com/lyr999736/p/9473893.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

周排行