Warning: file_get_contents(/data/phpspider/zhask/data//catemap/0/hadoop/6.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Hadoop 阿帕奇猪程序_Hadoop_Apache Pig_Bigdata - Fatal编程技术网

Hadoop 阿帕奇猪程序

Hadoop 阿帕奇猪程序,hadoop,apache-pig,bigdata,Hadoop,Apache Pig,Bigdata,需要帮助,以编写猪脚本计数的字数:在一个 包含以下文本的文件 What|is|Hadoop History|of|Hadoop How|Hadoop|name|was|given Problems|with|Traditional|Large-Scale|Systems|and|Need|for|Hadoop Understanding|Hadoop|Architecture Fundamental|of|HDFS|(Blocks,|Name|Node,|Data|Node,|Secondary

需要帮助,以编写猪脚本计数的字数:在一个

包含以下文本的文件

What|is|Hadoop
History|of|Hadoop
How|Hadoop|name|was|given
Problems|with|Traditional|Large-Scale|Systems|and|Need|for|Hadoop
Understanding|Hadoop|Architecture
Fundamental|of|HDFS|(Blocks,|Name|Node,|Data|Node,|Secondary|Name|Node)
Rack|Awareness
Read/Write|from|HDFS
HDFS|Federation|and|High|Availability

将数据加载到一个字符数组中。将“|”替换为空格,即“.”,并标记将给出单词的行,然后对单词进行分组和计数

A = LOAD '/user/hadoop/data.txt' AS (line:chararray);
B = FOREACH A GENERATE FLATTEN(TOKENIZE(REPLACE(line,'\\|',' ')));
C = GROUP B BY $0;
D = FOREACH C GENERATE group, COUNT(B);
DUMP D;
输出


从上述代码中,我得到的输出是单个字母。我想显示单词,然后计数。输出是这样的:(W)(h)(a)(t)(i)(s)(h)(a)(d)(o)(o)(p)。。。我想用以下方式显示输出(什么),(hadoop)…你能帮忙吗