Java 具有MLlib/Spark的UnsolvedAddressException

Java 具有MLlib/Spark的UnsolvedAddressException,java,networking,apache-spark,apache-spark-mllib,Java,Networking,Apache Spark,Apache Spark Mllib,在集群中运行MLlib for files>1分区时,出现以下异常: 16/08/14 12:43:23警告TaskSetManager:在阶段2.1(TID)中丢失任务2.0 49,da06.qcri.org):获取失败(BlockManagerId(3,da08.qcri.org, 33322),shuffleId=0,mapId=5,reduceId=2,消息= org.apache.spark.shuffle.FetchFailedException:无法连接到 da08.qcri.or

在集群中运行MLlib for files>1分区时,出现以下异常:

16/08/14 12:43:23警告TaskSetManager:在阶段2.1(TID)中丢失任务2.0 49,da06.qcri.org):获取失败(BlockManagerId(3,da08.qcri.org, 33322),shuffleId=0,mapId=5,reduceId=2,消息= org.apache.spark.shuffle.FetchFailedException:无法连接到 da08.qcri.org:33322 org.apache.spark.storage.ShuffleBlockFetcherIterator.throwFetchFailedException(ShuffleBlockFetcherIterator.scala:323) 在 org.apache.spark.storage.shuffleblockfetcheriator.next(shuffleblockfetcheriator.scala:300) 在 org.apache.spark.storage.shuffleblockfetcheriator.next(shuffleblockfetcheriator.scala:51) 位于scala.collection.Iterator$$anon$11。下一步(Iterator.scala:328)位于 scala.collection.Iterator$$anon$13.hasNext(Iterator.scala:371)位于 scala.collection.Iterator$$anon$11.hasNext(Iterator.scala:327)位于 org.apache.spark.util.CompletionIterator.hasNext(CompletionIterator.scala:32) 在 org.apache.spark.interruptableiterator.hasNext(interruptableiterator.scala:39) 在 org.apache.spark.util.collection.ExternalAppendOnlyMap.insertAll(ExternalAppendOnlyMap.scala:152) 在 org.apache.spark.Aggregator.combineCombinersByKey(Aggregator.scala:58) 在 org.apache.spark.shuffle.BlockStoreShuffleReader.read(BlockStoreShuffleReader.scala:83) 位于org.apache.spark.rdd.shuffleddd.compute(shuffleddd.scala:98) org.apache.spark.rdd.rdd.computeOrReadCheckpoint(rdd.scala:306)位于 org.apache.spark.rdd.rdd.iterator(rdd.scala:270)位于 org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:38) 在org.apache.spark.rdd.rdd.computeOrReadCheckpoint(rdd.scala:306)上 位于org.apache.spark.rdd.rdd.iterator(rdd.scala:270) org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:66)位于 org.apache.spark.scheduler.Task.run(Task.scala:89)位于 org.apache.spark.executor.executor$TaskRunner.run(executor.scala:227) 在 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) 在 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) 运行(Thread.java:745)

原因: java.io.IOException:无法连接到位于的***.org:33322 org.apache.spark.network.client.TransportClientFactory.createClient(TransportClientFactory.java:216) 在 org.apache.spark.network.client.TransportClientFactory.createClient(TransportClientFactory.java:167) 在 org.apache.spark.netty.NettyBlockTransferService$$anon$1.createAndStart(NettyBlockTransferService.scala:90) 在 org.apache.spark.network.shuffle.RetryingBlockFetcher.fetchAllOutstanding(RetryingBlockFetcher.java:140) 在 org.apache.spark.network.shuffle.RetryingBlockFetcher.access$200(RetryingBlockFetcher.java:43) 在 org.apache.spark.network.shuffle.RetryingBlockFetcher$1.run(RetryingBlockFetcher.java:170) 在 Executors$RunnableAdapter.call(Executors.java:511) 在java.util.concurrent.FutureTask.run(FutureTask.java:266)…3 更多

原因:java.nio.channels.unsolvedAddressException位于 sun.nio.ch.Net.checkAddress(Net.java:123)位于 sun.nio.ch.SocketChannelImpl.connect(SocketChannelImpl.java:621)位于 io.netty.channel.socket.nio.NioSocketChannel.doConnect(NioSocketChannel.java:209) 在 io.netty.channel.nio.AbstractNioChannel$AbstractNiosafsafe.connect(AbstractNioChannel.java:207) 在 io.netty.channel.DefaultChannelPipeline$HeadContext.connect(DefaultChannelPipeline.java:1097) 在 io.netty.channel.AbstractChannelHandlerContext.invokeConnect(AbstractChannelHandlerContext.java:471) 在 io.netty.channel.AbstractChannelHandlerContext.connect(AbstractChannelHandlerContext.java:456) 在 io.netty.channel.ChannelOutboundHandlerAdapter.connect(ChannelOutboundHandlerAdapter.java:47) 在 io.netty.channel.AbstractChannelHandlerContext.invokeConnect(AbstractChannelHandlerContext.java:471) 在 io.netty.channel.AbstractChannelHandlerContext.connect(AbstractChannelHandlerContext.java:456) 在 io.netty.channel.ChannelDuplexHandler.connect(ChannelDuplexHandler.java:50) 在 io.netty.channel.AbstractChannelHandlerContext.invokeConnect(AbstractChannelHandlerContext.java:471) 在 io.netty.channel.AbstractChannelHandlerContext.connect(AbstractChannelHandlerContext.java:456) 在 io.netty.channel.AbstractChannelHandlerContext.connect(AbstractChannelHandlerContext.java:438) 在 io.netty.channel.DefaultChannelPipeline.connect(DefaultChannelPipeline.java:908) 在io.netty.channel.AbstractChannel.connect(AbstractChannel.java:203)上 bootstrap$2.run(bootstrap.java:166)在 io.netty.util.concurrent.SingleThreadEventExecutor.runAllTasks(SingleThreadEventExecutor.java:357) 运行(NioEventLoop.java:357) io.netty.util.concurrent.SingleThreadEventExecutor$2.run(SingleThreadEventExecutor.java:111) …还有一个

在从属配置文件中,我使用IP而不是主机名设置节点。此外,当我使用主机名从主节点ping机器时,它似乎没有任何问题

有没有人有过类似的问题,或者对如何解决有什么想法