Hadoop 我们可以从ApachePig中的脚本B访问脚本a中加载的关系吗?
我的问题是,是否有一种方法可以从script2.pig访问sript1.pig中构建的关系“data1”,而无需再次加载数据 script1.pig有:Hadoop 我们可以从ApachePig中的脚本B访问脚本a中加载的关系吗?,hadoop,apache-pig,Hadoop,Apache Pig,我的问题是,是否有一种方法可以从script2.pig访问sript1.pig中构建的关系“data1”,而无需再次加载数据 script1.pig有: data1 = LOAD '$some_location'USING PigStorage('\t') AS (...); data1 = LOAD '$some_location'USING PigStorage('\t') AS (...); filter1 = FILTER data1 BY <<some-condition
data1 = LOAD '$some_location'USING PigStorage('\t') AS (...);
data1 = LOAD '$some_location'USING PigStorage('\t') AS (...);
filter1 = FILTER data1 BY <<some-condition-1>>;
运行script2.pig;
执行董事
script2.pig有:
filter1 = FILTER data1 BY <<some-condition>>;
filter1 = FILTER data1 BY <<some-condition-2>>;
filter1=根据过滤数据1;
我是否可以访问script2.pig中的数据1,而不必在script2.pig中重新加载数据1?我在项目中尝试了一些方法来解决这个问题,并且成功了: runner_script.pig具有:
RUN script1.pig;
EXEC;
RUN script2.pig;
EXEC;
script1.pig有:
data1 = LOAD '$some_location'USING PigStorage('\t') AS (...);
data1 = LOAD '$some_location'USING PigStorage('\t') AS (...);
filter1 = FILTER data1 BY <<some-condition-1>>;
data1=使用PigStorage('\t')作为(…)加载“$some_location”;
filter1=根据过滤数据1;
script2.pig有:
filter1 = FILTER data1 BY <<some-condition>>;
filter1 = FILTER data1 BY <<some-condition-2>>;
filter1=根据过滤数据1;
这样我就不必加载两次数据1 如果有人能提供另一种解决方案,我将不胜感激。不…………我想你可以,请看看我的解决方案,如果有任何问题或你需要我方提供任何其他信息,请告诉我。