Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/336.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 如何提取数组中映射的值列表?_Python_Pyspark_Apache Spark Sql_Schema - Fatal编程技术网

Python 如何提取数组中映射的值列表?

Python 如何提取数组中映射的值列表?,python,pyspark,apache-spark-sql,schema,Python,Pyspark,Apache Spark Sql,Schema,在我的数据框架中,我有以下模式: root |-- data: array (nullable = false) | |-- element: map (containsNull = false) | | |-- key: integer | | |-- value: string (valueContainsNull = false) 例如,该值: "DATA":[{"message":"A", "AlternateMessage":"B"},{"mes

在我的数据框架中,我有以下模式:

root
 |-- data: array (nullable = false)
 |    |-- element: map (containsNull = false)
 |    |    |-- key: integer
 |    |    |-- value: string (valueContainsNull = false)
例如,该值:

"DATA":[{"message":"A", "AlternateMessage":"B"},{"message":"C", "AlternateMessage":"D"},{"message":"E", "AlternateMessage":"F"}]
我需要:

MESSAGE="A,C,E"
ALTERNEATE_MESSAGE="B,D,F"
我用

df=sf.withColumn("message",F.concat_ws(df.data.message))
但我有错误

有什么帮助吗?
谢谢

如果您可以使用spark 2.4+,则使用transform+array_-join,类似于此::如果您可以使用spark 2.4+,则使用transform+array_-join,类似于此::