Apache pig 错误2017:创建作业配置时发生内部错误

Apache pig 错误2017:创建作业配置时发生内部错误,apache-pig,Apache Pig,执行时,以下代码会弹出一个错误,显示为2017年错误:创建作业配置时出现内部错误。在PIG中 data = LOAD 'info.txt' USING PigStorage(); name_col_one = FOREACH data GENERATE $0 AS timeStamp, $1 AS one, $2 AS two, $3 AS info, $4 AS four, $5 AS five, $6 AS six, $7 AS seven, $8 AS eight, $9 AS nine

执行时,以下代码会弹出一个错误,显示为2017年错误:创建作业配置时出现内部错误。在PIG中

data = LOAD 'info.txt' USING PigStorage();

name_col_one = FOREACH data GENERATE $0 AS timeStamp, $1 AS one, $2 AS two, $3 AS info, $4 AS four, $5 AS five, $6 AS six, $7 AS seven, $8 AS eight, $9 AS nine, $10 AS ten, $11  AS eleven;

process_col_one = FOREACH name_col_one GENERATE FLATTEN(STRSPLIT(timeStamp,'\\s+',2)) AS (time:chararray, date:chararray), one, two;

new_timestamp = FOREACH process_col_one GENERATE CONCAT(date,CONCAT(' ',time)), one, two;

sys_info = FOREACH name_col_one GENERATE info;

split_  = FOREACH sys_info GENERATE REPLACE(info, '\\[', '') AS new_split;
split_again  = FOREACH split_ GENERATE REPLACE(new_split, ']', '\t') AS final_split;

others = FOREACH name_col_one GENERATE four, five, six, seven, eight, nine, ten, eleven;

r1 = RANK new_timestamp;
r2 = RANK split_again;
r3 = RANK others;

final = JOIN r1 BY rank_new_timestamp, r2 BY rank_split_again;
DUMP final;
info.txt中的样本数据

2015年2月23日23:58:19良好1042559519[Linux][Baseline][lrtp2nosqlprod1][FileSystem][tmp]FileSystems/tmp\Use%=1%9:5603 0 1

2015年2月23日23:58:15良好1042559519[Linux][Baseline][lrtp2nosqlprod1][FileSystem][boot]FileSystems/boot\Use%=37%3:5603 0 37

23:58:15 2015年2月23日良好1042559537[Linux][Baseline][lrtp2nosqlprod1][Process][srmclient][SiSExclude]正在运行3:5599正在运行无数据1 0

23:58:15 2015年2月23日良好1042559537[Linux][Baseline][lrtp2nosqlprod1][Process][OSWatcher][SiSExclude]正在运行,2个进程4:5599正在运行无数据2 0 0

关系 new_timestamp将时间戳与输入dat反向, split_再次删除$3中的方括号,并用“\t”分隔

Pig Stack Trace
---------------
ERROR 2017: Internal error creating job configuration.

org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1066: Unable to open iterator for alias final
    at org.apache.pig.PigServer.openIterator(PigServer.java:880)
    at org.apache.pig.tools.grunt.GruntParser.processDump(GruntParser.java:774)
    at org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:372)
    at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:198)
    at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:173)
    at org.apache.pig.tools.grunt.Grunt.run(Grunt.java:69)
    at org.apache.pig.Main.run(Main.java:541)
    at org.apache.pig.Main.main(Main.java:156)
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
    at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
    at java.lang.reflect.Method.invoke(Method.java:606)
    at org.apache.hadoop.util.RunJar.main(RunJar.java:212)
Caused by: org.apache.pig.PigException: ERROR 1002: Unable to store alias final
    at org.apache.pig.PigServer.storeEx(PigServer.java:982)
    at org.apache.pig.PigServer.store(PigServer.java:942)
    at org.apache.pig.PigServer.openIterator(PigServer.java:855)
    ... 12 more
Caused by: org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobCreationException: ERROR 2017: Internal error creating job configuration.
    at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.getJob(JobControlCompiler.java:873)
    at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.compile(JobControlCompiler.java:298)
    at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher.launchPig(MapReduceLauncher.java:190)
    at org.apache.pig.PigServer.launchPlan(PigServer.java:1322)
    at org.apache.pig.PigServer.executeCompiledLogicalPlan(PigServer.java:1307)
    at org.apache.pig.PigServer.storeEx(PigServer.java:978)
    ... 14 more
Caused by: java.lang.NullPointerException
    at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.getJob(JobControlCompiler.java:817)
    ... 19 more
================================================================================
欢迎任何帮助。
提前感谢。

此问题以前已报告(),并已修复,可能尝试使用最新版本的pig

这个问题有时可以通过指定输入数据文件的路径来解决
e、 g.“/home/user/doc/info.txt”

请编辑您的帖子,添加您输入数据的样本和完整的堆栈跟踪。你用的是哪个版本的Pig?我可以卸载r1、r2和r3。我只有在转储final时才得到错误。好的,但是您可以添加它吗?你用的是哪个版本的猪?我们需要更多信息来帮助您。它适用于我使用0.14.0。回到Pig 0.12.0,当Pig找不到文件时,它显示的不是一条正确的错误消息,而是您发布的内容。。。您确定在HDFS中使用了正确的路径吗?如果您在控制台中键入hadoop fs-cat info.txt,您会看到该文件的内容吗?是的,我可以看到该文件