Hadoop 如何在使用kite数据集导入数据时避免IO错误?

Hadoop 如何在使用kite数据集导入数据时避免IO错误?,hadoop,hortonworks-data-platform,kite-dataset,Hadoop,Hortonworks Data Platform,Kite Dataset,我在Ubuntu 14上使用Hortonworks HDP发行版(2.4) 下载的风筝数据集 运行此命令: ./kite-dataset -v csv-import --delimiter '|' ml-100k/u.item movies 获取此错误: WARNING: Use "yarn jar" to launch YARN applications. SLF4J: Class path contains multiple SLF4J bindings. SLF4J: Found bin

我在Ubuntu 14上使用Hortonworks HDP发行版(2.4)

下载的风筝数据集

运行此命令:

./kite-dataset -v csv-import --delimiter '|' ml-100k/u.item movies
获取此错误:

WARNING: Use "yarn jar" to launch YARN applications.
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/usr/hdp/2.4.2.0-258/hadoop/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/usr/hdp/2.4.2.0-258/zookeeper/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
IO error
org.kitesdk.data.DatasetIOException: Cannot add jar path to distributed cache: /usr/hdp/2.4.2.0-258/hive/lib
    at org.kitesdk.tools.TaskUtil$ConfigBuilder.addJarPathForClass(TaskUtil.java:129)
    at org.kitesdk.tools.TransformTask.run(TransformTask.java:165)
    at org.kitesdk.cli.commands.CSVImportCommand.run(CSVImportCommand.java:186)
    at org.kitesdk.cli.Main.run(Main.java:184)
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:76)
    at org.kitesdk.cli.Main.main(Main.java:266)
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
    at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
    at java.lang.reflect.Method.invoke(Method.java:497)
    at org.apache.hadoop.util.RunJar.run(RunJar.java:221)
    at org.apache.hadoop.util.RunJar.main(RunJar.java:136)
Caused by: java.io.IOException: Jar file: /usr/hdp/2.4.2.0-258/hive/lib/ojdbc6.jar does not exist.
    at org.apache.crunch.util.DistCache.addJarToDistributedCache(DistCache.java:115)
    at org.apache.crunch.util.DistCache.addJarDirToDistributedCache(DistCache.java:208)
    at org.apache.crunch.util.DistCache.addJarDirToDistributedCache(DistCache.java:229)
    at org.kitesdk.tools.TaskUtil$ConfigBuilder.addJarPathForClass(TaskUtil.java:127)
    ... 11 more

如何克服此问题?

这似乎是错误消息的相关部分:

原因:java.io.IOException:Jar文件:/usr/hdp/2.4.2.0-258/hive/lib/ojdbc6.Jar不存在


缺少的jar似乎是一个Oracle JDBC驱动程序。

您可以从这里下载JDBC驱动程序

这是错误的相关部分吗<代码>由以下原因引起:java.io.IOException:Jar文件:/usr/hdp/2.4.2.0-258/hive/lib/ojdbc6.Jar不存在-似乎是Oracle JDBC驱动程序。我们将立即研究此问题。谢谢如果这有帮助,你介意我把这个评论作为答案吗?