[MapReduce_8] MapReduce 中的自定义分区实现

时间：2018-11-06 13:32:24 阅读：155 评论：0 收藏：0 [点我收藏+]

标签：mapred 图片结果 style word tostring text nbsp writable

0. 说明

　　设置分区数量 && 编写自定义分区代码

1. 设置分区数量

　　分区(Partition)

　　分区决定了指定的 Key 进入到哪个 Reduce 中

　　默认 hash 分区，算法

// 返回的分区号
(key.hashCode() & Integer.MAX_VALUE) % numReduceTasks

　　设置分区数

job.setNumReduceTasks(3);

2. 代码编写

　　在 [MapReduce_1] 运行 Word Count 示例程序代码基础之上进行以下操作

　　实现将文本中的数字存放在分区0，数字之外的内容放置到分区1

　　【2.1 编写 MyPartition.java】

package hadoop.mr.partition;

import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Partitioner;


/**
 * MapReduce 自定义分区
 */
public class MyPartition extends Partitioner<Text, IntWritable> {
    /**
     * 自定义分区将数字放在0号分区，其余放在1号分区
     */
    @Override
    public int getPartition(Text key, IntWritable value, int numPartitions) {
        try {
            Integer.parseInt(key.toString());
            return 0;
        } catch (Exception e) {
            return 1;
        }
    }
}

　　【2.2 修改 WCApp.java】

　　技术分享图片

　　【2.3 最终结果】

　　技术分享图片　　　　

[MapReduce_8] MapReduce 中的自定义分区实现

标签：mapred 图片结果 style word tostring text nbsp writable

原文地址：https://www.cnblogs.com/share23/p/9779593.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行