针对python运行hadoop流(版本-2.6.5)时发生classnotfoundexception

针对python运行hadoop流(版本-2.6.5)时发生classnotfoundexception,hadoop,classnotfoundexception,hadoop2,hadoop-streaming,Hadoop,Classnotfoundexception,Hadoop2,Hadoop Streaming,每当我试图运行这段代码以在python中执行map reduce代码时,我都会得到一个classnotfoundexception。我目前正在使用hadoop-2.6.5 输入: hadoop jar /usr/local/hadoop1/share/hadoop/tools/sources/hadoop-streaming-2.6.5-test-sources.jar \ -input /wordcount/input/student_list.txt \ -output /wordcoun

每当我试图运行这段代码以在python中执行map reduce代码时,我都会得到一个classnotfoundexception。我目前正在使用hadoop-2.6.5

输入:

hadoop jar /usr/local/hadoop1/share/hadoop/tools/sources/hadoop-streaming-2.6.5-test-sources.jar \
-input /wordcount/input/student_list.txt \ 
-output /wordcount/output/student_list_py.txt \
-mapper /home/hduser/wordcount_py/mapper.py \ 
-reducer /home/hduser/wordcount_py/reducer.py
输出:

Exception in thread "main" java.lang.ClassNotFoundException: -input
    at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
    at java.lang.Class.forName0(Native Method)
    at java.lang.Class.forName(Class.java:348)
    at org.apache.hadoop.util.RunJar.run(RunJar.java:214)
    at org.apache.hadoop.util.RunJar.main(RunJar.java:136)

我不熟悉大数据和Hadoop。请提供帮助。

您使用了错误的jar来运行Hadoop流作业。流媒体jar位于hadoop安装目录下的路径
share/hadoop/tools/lib/hadoop-streaming-2.6.5.jar
。另请注意,
-输出
必须是不存在的目录,而不是文件名

试试这个命令

hadoop jar /usr/local/hadoop1/share/hadoop/tools/lib/hadoop-streaming-2.6.5.jar \
-input /wordcount/input/student_list.txt \ 
-output /wordcount/output/ \
-mapper /home/hduser/wordcount_py/mapper.py \ 
-reducer /home/hduser/wordcount_py/reducer.py \
-file /home/hduser/wordcount_py/mapper.py \
-file /home/hduser/wordcount_py/reducer.py

谢谢..bt我仍然收到如下错误:17/04/30 13:58:43警告流媒体。StreamJob:-文件选项已弃用,请使用通用选项-文件。packageJobJar:[/home/hduser/wordcount\u py/mapper.py,/home/hduser/wordcount\u py/reducer.py][]/tmp/streamjob6033330482697904585.jar tmpDir=null线程“main”java.io.FileNotFoundException:///mapper.py(拒绝权限)1)映射器和reducer文件的路径正确吗?2) 你有权限吗?3) Python文件有shebang行吗?