Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/352.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python Hadoop 2.7.3流媒体Jar无效_Python_Hadoop_Jar - Fatal编程技术网

Python Hadoop 2.7.3流媒体Jar无效

Python Hadoop 2.7.3流媒体Jar无效,python,hadoop,jar,Python,Hadoop,Jar,我对Hadoop非常陌生,并尝试使用运行一个简单的程序 我已将本地示例数据复制到hdfs,但在我的map reduce作业中,当我按照apache官方文档运行此命令时 hadoop jar hadoop-streaming-2.7.3.jar \ -input /user/hduser/gutenberg/* \ -output /user/hduser/gutenberg-output \ -mapper /home/hduser/mapper.py \ -reducer /home/hdus

我对Hadoop非常陌生,并尝试使用运行一个简单的程序

我已将本地示例数据复制到hdfs,但在我的map reduce作业中,当我按照apache官方文档运行此命令时

hadoop jar hadoop-streaming-2.7.3.jar \
-input /user/hduser/gutenberg/* \
-output /user/hduser/gutenberg-output \
-mapper /home/hduser/mapper.py \
-reducer /home/hduser/reducer.py
我得到了这个错误

不是有效的JAR:/usr/lib/hadoop-streaming-2.7.3.JAR


请尝试帮助我。

它正在使用Hadoop 2.7.3

这是您需要运行的命令

[Linux]$ hadoop jar \ 
/usr/local/hadoop/share/hadoop/tools/lib/hadoop-streaming-2.7.3.jar \
-file /home/python/mapper.py /home/python/reducer.py \
-mapper "python mapper.py" \
-reducer "python reducer1.py" \
-input /tmp/word_i \
-output /tmp/word_output

hadoop流媒体jar的位置:

$HADOOP_HOME/share/HADOOP/tools/lib/HADOOP-streaming-2.7.1.2.4.2.0-258.jar

因为$HADOOP_HOME有时不一样