Apache flink Flink:DataSource';的输出导致错误:无法读取用户代码包装器
我才刚开始玩flink。我写了下面的代码,得到了 “数据源的输出导致错误:无法读取用户代码包装器”“错误” 我做错什么了吗 版本:Flink v 0.9.1(hadoop 1)未使用hadoop:Local execution 外壳:scala外壳 代码:Apache flink Flink:DataSource';的输出导致错误:无法读取用户代码包装器,apache-flink,Apache Flink,我才刚开始玩flink。我写了下面的代码,得到了 “数据源的输出导致错误:无法读取用户代码包装器”“错误” 我做错什么了吗 版本:Flink v 0.9.1(hadoop 1)未使用hadoop:Local execution 外壳:scala外壳 代码: val env = ExecutionEnvironment.getExecutionEnvironment val text = env.readTextFile("/home/ashish/Downloads/spark/synop.20
val env = ExecutionEnvironment.getExecutionEnvironment
val text = env.readTextFile("/home/ashish/Downloads/spark/synop.201501.csv"
val data_split = text.flatMap{_.split(';')}
data_split.first(3).print()
注意:输入文件使用“;”除沫器
错误:
Scala-Flink> val data_split = text.flatMap{_.split(';')}
data_split: org.apache.flink.api.scala.DataSet[String] = org.apache.flink.api.scala.DataSet@60f70249
Scala-Flink> data_split.first(3).print()
09/24/2015 09:20:14 Job execution switched to status RUNNING.
09/24/2015 09:20:14 CHAIN DataSource (at $line26.$read$$iw$$iw$$iw$$iw$$iw$$iw$.<init>(<console>:14) (org.apache.flink.api.java.io.TextInputFormat)) -> FlatMap (FlatMap at $line27.$read$$iw$$iw$$iw$$iw$$iw$$iw$.<init>(<console>:15))(1/1) switched to SCHEDULED
09/24/2015 09:20:14 CHAIN DataSource (at $line26.$read$$iw$$iw$$iw$$iw$$iw$$iw$.<init>(<console>:14) (org.apache.flink.api.java.io.TextInputFormat)) -> FlatMap (FlatMap at $line27.$read$$iw$$iw$$iw$$iw$$iw$$iw$.<init>(<console>:15))(1/1) switched to DEPLOYING
09/24/2015 09:20:14 CHAIN DataSource (at $line26.$read$$iw$$iw$$iw$$iw$$iw$$iw$.<init>(<console>:14) (org.apache.flink.api.java.io.TextInputFormat)) -> FlatMap (FlatMap at $line27.$read$$iw$$iw$$iw$$iw$$iw$$iw$.<init>(<console>:15))(1/1) switched to FAILED
java.lang.Exception: Call to registerInputOutput() of invokable failed
at org.apache.flink.runtime.taskmanager.Task.run(Task.java:504)
at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.RuntimeException: The initialization of the DataSource's outputs caused an error: Could not read the user code wrapper: $line27.$read$$iw$$iw$$iw$$iw$$iw$$iw$$anonfun$1
at org.apache.flink.runtime.operators.DataSourceTask.registerInputOutput(DataSourceTask.java:89)
at org.apache.flink.runtime.taskmanager.Task.run(Task.java:501)
... 1 more
Caused by: org.apache.flink.runtime.operators.util.CorruptConfigurationException: Could not read the user code wrapper: $line27.$read$$iw$$iw$$iw$$iw$$iw$$iw$$anonfun$1
at org.apache.flink.runtime.operators.util.TaskConfig.getStubWrapper(TaskConfig.java:284)
at org.apache.flink.runtime.operators.RegularPactTask.instantiateUserCode(RegularPactTask.java:1507)
at org.apache.flink.runtime.operators.chaining.ChainedFlatMapDriver.setup(ChainedFlatMapDriver.java:39)
at org.apache.flink.runtime.operators.chaining.ChainedDriver.setup(ChainedDriver.java:72)
at org.apache.flink.runtime.operators.RegularPactTask.initOutputs(RegularPactTask.java:1378)
at org.apache.flink.runtime.operators.DataSourceTask.initOutputs(DataSourceTask.java:290)
at org.apache.flink.runtime.operators.DataSourceTask.registerInputOutput(DataSourceTask.java:86)
... 2 more
Caused by: java.lang.ClassNotFoundException: $line27.$read$$iw$$iw$$iw$$iw$$iw$$iw$$anonfun$1
at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
at java.lang.Class.forName0(Native Method)
at java.lang.Class.forName(Class.java:348)
at org.apache.flink.util.InstantiationUtil$ClassLoaderObjectInputStream.resolveClass(InstantiationUtil.java:71)
at java.io.ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1613)
at java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1518)
at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1774)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:2000)
at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1924)
at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1801)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:2000)
at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1924)
at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1801)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
at java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
at org.apache.flink.util.InstantiationUtil.deserializeObject(InstantiationUtil.java:302)
at org.apache.flink.util.InstantiationUtil.readObjectFromConfig(InstantiationUtil.java:264)
at org.apache.flink.runtime.operators.util.TaskConfig.getStubWrapper(TaskConfig.java:282)
... 8 more
09/24/2015 09:20:14 Job execution switched to status FAILING.
09/24/2015 09:20:14 CHAIN GroupReduce (GroupReduce at org.apache.flink.api.scala.DataSet.first(DataSet.scala:707)) -> FlatMap (collect())(1/1) switched to CANCELED
09/24/2015 09:20:14 DataSink (collect() sink)(1/1) switched to CANCELED
org.apache.flink.runtime.client.JobExecutionException: Job execution failed.
at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$receiveWithLogMessages$1.applyOrElse(JobManager.scala:314)
at scala.runtime.AbstractPartialFunction$mcVL$sp.apply$mcVL$sp(AbstractPartialFunction.scala:33)
at scala.runtime.AbstractPartialFunction$mcVL$sp.apply(AbstractPartialFunction.scala:33)
at scala.runtime.AbstractPartialFunction$mcVL$sp.apply(AbstractPartialFunction.scala:25)
at org.apache.flink.runtime.ActorLogMessages$$anon$1.apply(ActorLogMessages.scala:43)
at org.apache.flink.runtime.ActorLogMessages$$anon$1.apply(ActorLogMessages.scala:29)
at scala.PartialFunction$class.applyOrElse(PartialFunction.scala:118)
at org.apache.flink.runtime.ActorLogMessages$$anon$1.applyOrElse(ActorLogMessages.scala:29)
at akka.actor.Actor$class.aroundReceive(Actor.scala:465)
at org.apache.flink.runtime.jobmanager.JobManager.aroundReceive(JobManager.scala:92)
at akka.actor.ActorCell.receiveMessage(ActorCell.scala:516)
at akka.actor.ActorCell.invoke(ActorCell.scala:487)
at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:254)
at akka.dispatch.Mailbox.run(Mailbox.scala:221)
at akka.dispatch.Mailbox.exec(Mailbox.scala:231)
at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
Caused by: java.lang.Exception: Call to registerInputOutput() of invokable failed
at org.apache.flink.runtime.taskmanager.Task.run(Task.java:504)
at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.RuntimeException: The initialization of the DataSource's outputs caused an error: Could not read the user code wrapper: $anonfun$1
at org.apache.flink.runtime.operators.DataSourceTask.registerInputOutput(DataSourceTask.java:89)
at org.apache.flink.runtime.taskmanager.Task.run(Task.java:501)
... 1 more
Caused by: org.apache.flink.runtime.operators.util.CorruptConfigurationException: Could not read the user code wrapper: $anonfun$1
at org.apache.flink.runtime.operators.util.TaskConfig.getStubWrapper(TaskConfig.java:284)
at org.apache.flink.runtime.operators.RegularPactTask.instantiateUserCode(RegularPactTask.java:1507)
at org.apache.flink.runtime.operators.chaining.ChainedFlatMapDriver.setup(ChainedFlatMapDriver.java:39)
at org.apache.flink.runtime.operators.chaining.ChainedDriver.setup(ChainedDriver.java:72)
at org.apache.flink.runtime.operators.RegularPactTask.initOutputs(RegularPactTask.java:1378)
at org.apache.flink.runtime.operators.DataSourceTask.initOutputs(DataSourceTask.java:290)
at org.apache.flink.runtime.operators.DataSourceTask.registerInputOutput(DataSourceTask.java:86)
... 2 more
Caused by: java.lang.ClassNotFoundException: $anonfun$1
at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
at java.lang.Class.forName0(Native Method)
at java.lang.Class.forName(Class.java:348)
at org.apache.flink.util.InstantiationUtil$ClassLoaderObjectInputStream.resolveClass(InstantiationUtil.java:71)
at java.io.ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1613)
at java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1518)
at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1774)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:2000)
at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1924)
at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1801)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:2000)
at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1924)
at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1801)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
at java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
at org.apache.flink.util.InstantiationUtil.deserializeObject(InstantiationUtil.java:302)
at org.apache.flink.util.InstantiationUtil.readObjectFromConfig(InstantiationUtil.java:264)
at org.apache.flink.runtime.operators.util.TaskConfig.getStubWrapper(TaskConfig.java:282)
... 8 more
Scala-Flink>val-data_-split=text.flatMap{{u.split(';')}
data_split:org.apache.flink.api.scala.DataSet[String]=org.apache.flink.api.scala。DataSet@60f70249
Scala Flink>data_split.first(3).print()
2015年9月24日09:20:14作业执行切换到运行状态。
2015年9月24日09:20:14链数据源(位于$line26.$read$$iw$$iw$$iw$$iw$$iw$。(:14)(org.apache.flink.api.java.io.TextInputFormat))->FlatMap(位于$line27的FlatMap.$read$$iw$$iw$$iw$$iw$$iw$$iw$(:15))(1/1)切换到预定
2015年9月24日09:20:14链数据源(位于$line26.$read$$iw$$iw$$iw$$iw$$iw$。(:14)(org.apache.flink.api.java.io.TextInputFormat))->FlatMap(位于$line27的FlatMap.$read$$iw$$iw$$iw$$iw$$iw$$iw$(:15))(1/1)切换到部署
2015年9月24日09:20:14链数据源(位于$line26.$read$$iw$$iw$$iw$$iw$$iw$。(:14)(org.apache.flink.api.java.io.TextInputFormat))->FlatMap(位于$line27的FlatMap.$read$$iw$$iw$$iw$$iw$$iw$$iw$(:15))(1/1)切换到失败
java.lang.Exception:调用invokable的registerInputOutput()失败
位于org.apache.flink.runtime.taskmanager.Task.run(Task.java:504)
运行(Thread.java:745)
原因:java.lang.RuntimeException:数据源输出的初始化导致错误:无法读取用户代码包装:$line27。$read$$iw$$iw$$iw$$iw$$iw$$iw$$iw$$iw$$anonfun$1
位于org.apache.flink.runtime.operators.DataSourceTask.registerInputOutput(DataSourceTask.java:89)
位于org.apache.flink.runtime.taskmanager.Task.run(Task.java:501)
... 还有一个
原因:org.apache.flink.runtime.operators.util.CorruptConfigurationException:无法读取用户代码包装:$line27。$read$$iw$$iw$$iw$$iw$$iw$$iw$$anonfun$1
位于org.apache.flink.runtime.operators.util.TaskConfig.getStubWrapper(TaskConfig.java:284)
位于org.apache.flink.runtime.operators.regularpacktask.instantiationEUSERcode(regularpacktask.java:1507)
位于org.apache.flink.runtime.operators.chaining.ChainedFlatMapDriver.setup(ChainedFlatMapDriver.java:39)
位于org.apache.flink.runtime.operators.chaining.ChainedDriver.setup(ChainedDriver.java:72)
位于org.apache.flink.runtime.operators.regularpacktask.initOutputs(regularpacktask.java:1378)
位于org.apache.flink.runtime.operators.DataSourceTask.initOutputs(DataSourceTask.java:290)
位于org.apache.flink.runtime.operators.DataSourceTask.registerInputOutput(DataSourceTask.java:86)
... 还有两个
原因:java.lang.ClassNotFoundException:$line27.$read$$iw$$iw$$iw$$iw$$iw$$iw$$iw$$iw$$anonfun$1
位于java.net.URLClassLoader.findClass(URLClassLoader.java:381)
位于java.lang.ClassLoader.loadClass(ClassLoader.java:424)
位于java.lang.ClassLoader.loadClass(ClassLoader.java:357)
位于java.lang.Class.forName0(本机方法)
位于java.lang.Class.forName(Class.java:348)
位于org.apache.flink.util.InstantiationUtil$ClassLoaderObjectInputStream.resolveClass(InstantiationUtil.java:71)
位于java.io.ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1613)
位于java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1518)
位于java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1774)
位于java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
位于java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:2000)
位于java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1924)
位于java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1801)
位于java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
位于java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:2000)
位于java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1924)
位于java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1801)
位于java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
位于java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
位于org.apache.flink.util.InstantiationUtil.deserializeObject(InstantiationUtil.java:302)
位于org.apache.flink.util.InstantiationUtil.readObjectFromConfig(InstantiationUtil.java:264)
位于org.apache.flink.runtime.operators.util.TaskConfig.getStubWrapper(TaskConfig.java:282)
... 8个以上
2015年9月24日09:20:14作业执行切换到失败状态。
2015年9月24日09:20:14 CHAIN GroupReduce(GroupReduce位于org.apache.flink.api.scala.DataSet.first(DataSet.scala:707))->FlatMap(collect())(1/1)切换为取消
2015年9月24日09:20:14数据链路(collect()接收器)(1/1)切换到已取消
org.apache.flink.runtime.client.JobExecutionException:作业执行失败。
位于org.apache.flink.runtime.jobmanager.jobmanager$$anonfun$receiveWithLogMessages$1.applyOrElse(jobmanager.scala:314)
在scala.runtime.AbstractPartialFunction$mcVL$sp.apply$mcVL$sp(AbstractPartialFunction.scala:33)
在scala.runtime.AbstractPartialFunction$mcVL$sp.apply处(AbstractPartialFunction.scala:33)
在scala.runtime.AbstractPartialFunction$mcVL$sp.apply处(AbstractPartialFunction.scala:25)
位于org.apache.flink.runtime.ActorLogMessages$$anon$1.apply(ActorLogMessages.scala:43)
位于org.apache.flink.runtime.ActorLogMessages$$anon$1.apply(ActorLogMessages.scala:29)
在scala.PartialFunction$class.applyOrElse中(PartialFunction.scala:118)
在org.apache.flink.runtime.ActorLogMessages$$anon$1.applyOrElse(ActorLogMessages.scala:29)
在akka.actor.actor$class.aroundReceive(actor.scala:465)
位于org.apache.flink.runtime.jobmanager.jobmanager.aroundReceive(jobmanager.scala:92)
在akka.actor.ActorCell.receiv