Python Hadoop 2.7.3流媒体Jar无效
我对Hadoop非常陌生,并尝试使用运行一个简单的程序 我已将本地示例数据复制到hdfs,但在我的map reduce作业中,当我按照apache官方文档运行此命令时Python Hadoop 2.7.3流媒体Jar无效,python,hadoop,jar,Python,Hadoop,Jar,我对Hadoop非常陌生,并尝试使用运行一个简单的程序 我已将本地示例数据复制到hdfs,但在我的map reduce作业中,当我按照apache官方文档运行此命令时 hadoop jar hadoop-streaming-2.7.3.jar \ -input /user/hduser/gutenberg/* \ -output /user/hduser/gutenberg-output \ -mapper /home/hduser/mapper.py \ -reducer /home/hdus
hadoop jar hadoop-streaming-2.7.3.jar \
-input /user/hduser/gutenberg/* \
-output /user/hduser/gutenberg-output \
-mapper /home/hduser/mapper.py \
-reducer /home/hduser/reducer.py
我得到了这个错误
不是有效的JAR:/usr/lib/hadoop-streaming-2.7.3.JAR
请尝试帮助我。它正在使用Hadoop 2.7.3 这是您需要运行的命令
[Linux]$ hadoop jar \
/usr/local/hadoop/share/hadoop/tools/lib/hadoop-streaming-2.7.3.jar \
-file /home/python/mapper.py /home/python/reducer.py \
-mapper "python mapper.py" \
-reducer "python reducer1.py" \
-input /tmp/word_i \
-output /tmp/word_output
hadoop流媒体jar的位置: $HADOOP_HOME/share/HADOOP/tools/lib/HADOOP-streaming-2.7.1.2.4.2.0-258.jar 因为$HADOOP_HOME有时不一样