Apache spark Spark中的MapOutputRackerMasterEndpoint输出正在减慢我的工作速度。这是什么意思?

Apache spark Spark中的MapOutputRackerMasterEndpoint输出正在减慢我的工作速度。这是什么意思?,apache-spark,pyspark,Apache Spark,Pyspark,我在spark工作中得到了这个输出,这给我的工作增加了很多延迟。就在昨天,我让作业在这条消息下运行了一整晚,回到家后发现没有其他处理 16/01/08 18:51:36 INFO spark.MapOutputTrackerMaster: Size of output statuses for shuffle 0 is 930564 bytes 16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send m

我在spark工作中得到了这个输出,这给我的工作增加了很多延迟。就在昨天,我让作业在这条消息下运行了一整晚,回到家后发现没有其他处理

16/01/08 18:51:36 INFO spark.MapOutputTrackerMaster: Size of output statuses for shuffle 0 is 930564 bytes
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.47:37232
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.44:55904
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.38:35491
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.43:60071
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.48:39267
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.39:44310
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.42:51390
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.32:36981
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.30:39261
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.34:58811
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.31:39829
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.40:45521
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.46:35892
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.33:35119
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.36:44486
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.37:35971
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.35:59625
16/01/08 18:51:36 INFO spark.MapOutputTrackerMasterEndpoint: Asked to send map output locations for shuffle 0 to 172.31.1.45:40861

我试着用谷歌搜索,但文档很少,我无法真正理解。有人知道吗?

这可能会有帮助。这个问题的第一个答案。在我的例子中,这个日志记录是在我将数据帧保存到磁盘时进行的。