wordcount 过程

时间：2016-09-27 17:48:14 阅读：113 评论：0 收藏：0 [点我收藏+]

标签：

hdfs原始数据

hello a

hello b

map阶段：

输入数据：<0,"hello a">

<8,"hello b">

key为偏移量

输出数据：

     map(key,value,context)

                 {

                    String[]  words = value.split("\t");

                     for(String word :words)

                           {

                            //hello
                            //a
                            //hello
                            //b
                            输出conetxt.write（key,vlaue）

                           }

                 }

　　<hello,1>

<a,1>

<hello,1>

<b,1>

reduce阶段：（分组排序,字典序排序）

输入数据：

<a,1>

<b,1>

<hello,{1,1}>

输出数据：

reduce(key,value,context)

                 {
                    int sum=0;
                    String word=key;
                    for(int i:value)
                        {
                           sum+=i;
                         }
                     context.write(key,sum);
                 }

wordcount 过程

标签：

原文地址：http://www.cnblogs.com/yuanfuqiang/p/5913613.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行