Scala Spark:如何将RDD[(Long,Iterable[String])转换为RDD[(Long,String)]?
如何将此RDD[(长,可编辑的[String])]转换为 RDD[(长,字符串)]像这样吗Scala Spark:如何将RDD[(Long,Iterable[String])转换为RDD[(Long,String)]?,scala,apache-spark,Scala,Apache Spark,如何将此RDD[(长,可编辑的[String])]转换为 RDD[(长,字符串)]像这样吗 (852403, PT0000094043) (852403, PT0000097083) (852403, PT0000036162) (357331, PT0000068829) (357331, PT0000094042) (357331, PT0000066859) 尝试flatMapValues: rdd.flatMapValues(identity) 或flatMap: rdd.flatM
(852403, PT0000094043)
(852403, PT0000097083)
(852403, PT0000036162)
(357331, PT0000068829)
(357331, PT0000094042)
(357331, PT0000066859)
尝试
flatMapValues
:
rdd.flatMapValues(identity)
或flatMap
:
rdd.flatMap{ case (k, vs) => vs.map(v => (k, v)) }
rdd.flatMap{ case (k, vs) => vs.map(v => (k, v)) }