Warning: file_get_contents(/data/phpspider/zhask/data//catemap/0/hadoop/6.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Hadoop 我们可以从ApachePig中的脚本B访问脚本a中加载的关系吗?_Hadoop_Apache Pig - Fatal编程技术网

Hadoop 我们可以从ApachePig中的脚本B访问脚本a中加载的关系吗?

Hadoop 我们可以从ApachePig中的脚本B访问脚本a中加载的关系吗?,hadoop,apache-pig,Hadoop,Apache Pig,我的问题是,是否有一种方法可以从script2.pig访问sript1.pig中构建的关系“data1”,而无需再次加载数据 script1.pig有: data1 = LOAD '$some_location'USING PigStorage('\t') AS (...); data1 = LOAD '$some_location'USING PigStorage('\t') AS (...); filter1 = FILTER data1 BY <<some-condition

我的问题是,是否有一种方法可以从script2.pig访问sript1.pig中构建的关系“data1”,而无需再次加载数据

script1.pig有:

data1 = LOAD '$some_location'USING PigStorage('\t') AS (...);
data1 = LOAD '$some_location'USING PigStorage('\t') AS (...);
filter1 = FILTER data1 BY <<some-condition-1>>;
运行script2.pig; 执行董事

script2.pig有:

filter1 = FILTER data1 BY <<some-condition>>;
filter1 = FILTER data1 BY <<some-condition-2>>;
filter1=根据过滤数据1;

我是否可以访问script2.pig中的数据1,而不必在script2.pig中重新加载数据1?

我在项目中尝试了一些方法来解决这个问题,并且成功了:

runner_script.pig具有:

RUN script1.pig; 
EXEC;

RUN script2.pig; 
EXEC;
script1.pig有:

data1 = LOAD '$some_location'USING PigStorage('\t') AS (...);
data1 = LOAD '$some_location'USING PigStorage('\t') AS (...);
filter1 = FILTER data1 BY <<some-condition-1>>;
data1=使用PigStorage('\t')作为(…)加载“$some_location”;
filter1=根据过滤数据1;
script2.pig有:

filter1 = FILTER data1 BY <<some-condition>>;
filter1 = FILTER data1 BY <<some-condition-2>>;
filter1=根据过滤数据1;

这样我就不必加载两次数据1

如果有人能提供另一种解决方案,我将不胜感激。不…………我想你可以,请看看我的解决方案,如果有任何问题或你需要我方提供任何其他信息,请告诉我。