Apache spark Spark作业已终止,java.io.EOFException:尝试从服务器读取响应时出现意外的EOF
我正在Amazon EMR中运行spark作业,作业终止时出现以下错误:Apache spark Spark作业已终止,java.io.EOFException:尝试从服务器读取响应时出现意外的EOF,apache-spark,apache-spark-sql,amazon-emr,Apache Spark,Apache Spark Sql,Amazon Emr,我正在Amazon EMR中运行spark作业,作业终止时出现以下错误: 20/10/01 10:44:51 WARN DataStreamer: Exception for BP-1069374220-10.0.1.121-1601548370932:blk_1073741830_1006 java.io.EOFException: Unexpected EOF while trying to read response from server at org.apache.hadoop
20/10/01 10:44:51 WARN DataStreamer: Exception for BP-1069374220-10.0.1.121-1601548370932:blk_1073741830_1006
java.io.EOFException: Unexpected EOF while trying to read response from server
at org.apache.hadoop.hdfs.protocolPB.PBHelperClient.vintPrefixed(PBHelperClient.java:402)
at org.apache.hadoop.hdfs.protocol.datatransfer.PipelineAck.readFields(PipelineAck.java:213)
at org.apache.hadoop.hdfs.DataStreamer$ResponseProcessor.run(DataStreamer.java:1073)
20/10/01 10:44:51 WARN DataStreamer: Error Recovery for BP-1069374220-10.0.1.121-1601548370932:blk_1073741830_1006 in pipeline [DatanodeInfoWithStorage[10.0.1.16:50010,DS-3736ee37-017b-419c-af06-57ff2a605389,DISK], DatanodeInfoWithStorage[10.0.1.25:50010,DS-7514c078-d287-4df4-b081-190f696b7794,DISK]]: datanode 0(DatanodeInfoWithStorage[10.0.1.16:50010,DS-3736ee37-017b-419c-af06-57ff2a605389,DISK]) is bad.
20/10/01 10:46:46 WARN DataStreamer: Exception for BP-1069374220-10.0.1.121-1601548370932:blk_1073741830_1007
java.io.EOFException: Unexpected EOF while trying to read response from server
at org.apache.hadoop.hdfs.protocolPB.PBHelperClient.vintPrefixed(PBHelperClient.java:402)
at org.apache.hadoop.hdfs.protocol.datatransfer.PipelineAck.readFields(PipelineAck.java:213)
at org.apache.hadoop.hdfs.DataStreamer$ResponseProcessor.run(DataStreamer.java:1073)
文件中可能有一些垃圾字符。在读取数据时,请尝试过滤掉这些记录或使用此选项条件。选项(“模式”,“DROPMALFORMED”)文件中可能有一些垃圾字符。在读取数据时尝试过滤掉这些记录或使用此选项条件。选项(“模式”,“DROPMALFORMED”)您可以显示任何代码以使错误可复制?您可以显示任何代码以使错误可复制?