Warning: file_get_contents(/data/phpspider/zhask/data//catemap/0/hadoop/6.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Hadoop 如何在配置单元CDH中创建ORC文件?_Hadoop_Hive_Cloudera_Dataformat_Hortonworks Data Platform - Fatal编程技术网

Hadoop 如何在配置单元CDH中创建ORC文件?

Hadoop 如何在配置单元CDH中创建ORC文件?,hadoop,hive,cloudera,dataformat,hortonworks-data-platform,Hadoop,Hive,Cloudera,Dataformat,Hortonworks Data Platform,我可以用Apache Hadoop或Hortonworks的HDP轻松创建ORC文件格式: CREATE TABLE ... STORED AS ORC 然而,这在Cloudera的CDH4.5中不起作用。(惊喜!)我得到: 失败:SemanticException无法识别存储为子句中的文件格式:ORC 因此,作为替代方案,我尝试下载并安装包含ORC类的Hive jar: hive> add jar /opt/cloudera/parcels/CDH-4.5.0-1.cdh4.5.0.p

我可以用Apache Hadoop或Hortonworks的HDP轻松创建ORC文件格式:

CREATE TABLE ... STORED AS ORC
然而,这在Cloudera的CDH4.5中不起作用。(惊喜!)我得到:

失败:SemanticException无法识别存储为子句中的文件格式:ORC

因此,作为替代方案,我尝试下载并安装包含ORC类的Hive jar:

hive> add jar /opt/cloudera/parcels/CDH-4.5.0-1.cdh4.5.0.p0.30/lib/hive/lib/hive-exec-0.11.0.jar;
然后创建我的ORC表:

hive>    CREATE TABLE test (name STRING)

> row format serde
> 'org.apache.hadoop.hive.ql.io.orc.OrcSerde'
>  stored as inputformat
>   'org.apache.hadoop.hive.ql.io.orc.OrcInputFormat'   
>   outputformat
>   'org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat';
OK
但在从一些CSV数据插入此表时,我得到一个错误:

hive> INSERT OVERWRITE TABLE test 
> SELECT name FROM textdata;

    Diagnostic Messages for this Task:
java.lang.RuntimeException: Error in configuring object
    at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
    at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
    at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
    at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:413)

如何在CDH的配置单元中创建ORC表?

CDH 4.5包含配置单元0.10,请参阅。ORC已添加到蜂箱0.11中,请参阅和

CDH5目前处于测试阶段,但它确实包含蜂巢0.11,请参阅