Hadoop 猪的工作失败了,需要建议吗

Hadoop 猪的工作失败了,需要建议吗,hadoop,apache-pig,Hadoop,Apache Pig,请帮助我理解以下连接失败的原因 代码: nyse_div= load '/home/cloudera/NYSE_daily_dividends' using PigStorage(',')as(exchange:chararray, symbol:chararray, date:chararray, dividends:double); nyse_div1= foreach nyse_div generate symbol,SUBSTRING(date,0,4) as year,dividend

请帮助我理解以下连接失败的原因

代码:

nyse_div= load '/home/cloudera/NYSE_daily_dividends' using PigStorage(',')as(exchange:chararray, symbol:chararray, date:chararray, dividends:double);
nyse_div1= foreach nyse_div generate symbol,SUBSTRING(date,0,4) as year,dividends;
nyse_div2= group nyse_div1 by (symbol,year);
nyse_div3= foreach nyse_div2 generate group,AVG(nyse_div1.dividends);

nyse_price= load '/home/cloudera/NYSE_daily_prices' using PigStorage(',')as(exchange:chararray, symbol:chararray, date:chararray, open:double, high:double, low:double, close:double, volume:long, adj:double);
nyse_price1= foreach nyse_price generate symbol,SUBSTRING(date,0,4) as year,open..;
nyse_price2= group nyse_price1 by (symbol,year);
nyse_price3= foreach nyse_price2 generate group,MAX(nyse_price1.high),MIN(nyse_price1.low);
nyse_final= join nyse_div3 by group,nyse_price3 by group;
--store nyse_div3 into 'home/cloudera/NYSE_daily_dividends/output' using PigStorage(',');
--store nyse_price3 into 'home/cloudera/NYSE_daily_dividends/output1' using PigStorage(',');
store nyse_final into '/home/cloudera/NYSE_daily_dividends/output' using PigStorage(',');
****失败的作业:**

JobId   Alias   Feature Message Outputs
job_local766969553_0008 nyse_final      HASH_JOIN       Message: Job failed!    /home/cloudera/NYSE_daily_dividends/output,
Input(s):
Successfully read records from: "/home/cloudera/NYSE_daily_dividends"
Successfully read records from: "/home/cloudera/NYSE_daily_prices"
Output(s):
Failed to produce result in "/home/cloudera/NYSE_daily_dividends/output"
Job DAG:
job_local1308827629_0006        ->      job_local766969553_0008,
job_local241929118_0007 ->      job_local766969553_0008,
job_local766969553_0008
2014-11-12 17:00:35,263 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Some jobs have failed! Stop running all dependent jobs**

你的代码非常适合我。我认为您必须粘贴完整的错误日志才能理解错误

请查看我得到的以下输入的输出

纽约证券交易所每日股息

NYSE,AIT,2009-11-12,0.15
NYSE,AIT,2009-08-12,0.15
NYSE,AIT,2009-05-13,0.15
NYSE,AIT,2009-02-11,0.15
NYSE,AEA,2010-02-08,4.42,4.42,4.21,4.24,205500,4.24
NYSE,AEA,2010-02-05,4.42,4.54,4.22,4.41,194300,4.41
NYSE,AEA,2010-02-04,4.55,4.69,4.39,4.42,233800,4.42
NYSE,AIT,2009-02-11,0.15,4.87,4.55,4.55,234444,4.56
纽约证券交易所每日价格

NYSE,AIT,2009-11-12,0.15
NYSE,AIT,2009-08-12,0.15
NYSE,AIT,2009-05-13,0.15
NYSE,AIT,2009-02-11,0.15
NYSE,AEA,2010-02-08,4.42,4.42,4.21,4.24,205500,4.24
NYSE,AEA,2010-02-05,4.42,4.54,4.22,4.41,194300,4.41
NYSE,AEA,2010-02-04,4.55,4.69,4.39,4.42,233800,4.42
NYSE,AIT,2009-02-11,0.15,4.87,4.55,4.55,234444,4.56
代码的输出

((AIT,2009),0.15,(AIT,2009),4.87,4.55)

你能粘贴纽交所每日股息和纽交所每日价格文件的样本输入吗?交易所,股票符号,日期,股息纽交所,美国在台协会,2009-11-12,0.15纽交所,美国在台协会,2009-08-12,0.15纽交所,美国在台协会,2009-05-13,0.15纽交所,美国在台协会,2009-02-11,0.15交易所,股票符号,日期,股票价格开放,股票价格高,股票价格低,股票价格收盘,股票数量,股票价格调整收盘纽约证券交易所,AEA,2010-02-08,4.42,4.42,4.21,4.24205500,4.24纽约证券交易所,AEA,2010-02-05,4.42,4.54,4.22,4.41194300,4.41纽约证券交易所,AEA,2010-02-04,4.55,4.69,4.39,4.42233800,4.42您应该在日志中看到错误的堆栈跟踪。将stacktrace粘贴到这里,以帮助我们理解失败的部分。我也不认为有任何错误,因为这是非常简单的代码。一些作业失败了!停止运行所有相关作业JobId别名功能输出作业\u local1308827629\u 0006 nyse\u div、nyse\u div1、nyse\u div2、nyse\u div3 GROUP\u BY、组合器作业\u local241929118\u 0007 nyse\u price、nyse\u price1、nyse\u price2、nyse\u price3 GROUP\u BY、,合并器失败的作业:作业ID别名功能消息输出作业\u本地7669553\u 0008纽约证券交易所\u最终哈希\u加入消息:作业失败/home/cloudera/NYSE_daily_股息/output,Input(s):成功读取以下记录:“您可以尝试在本地模式而不是hdfs下运行pig脚本吗。这会给您一些想法。我只在本地模式下运行。仍然是同一问题。是因为uge数据。让我用小数据集尝试一下。”。。