Warning: file_get_contents(/data/phpspider/zhask/data//catemap/3/apache-spark/5.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Apache spark Spark:如何将成对rdd保存到json文件?_Apache Spark_Pyspark - Fatal编程技术网

Apache spark Spark:如何将成对rdd保存到json文件?

Apache spark Spark:如何将成对rdd保存到json文件?,apache-spark,pyspark,Apache Spark,Pyspark,我的Rdd是这样的: [('f1',1), ('f2',2)] 如何将其保存到json文件?您可以将rdd转换为数据帧并写入json from pyspark.sql import SparkSession spark = SparkSession.builder \ .appName('SO')\ .getOrCreate() sc= spark.sparkContext df = sc.parallelize(

我的Rdd是这样的:

[('f1',1), ('f2',2)]

如何将其保存到json文件?

您可以将rdd转换为数据帧并写入json

    from pyspark.sql import SparkSession

    spark = SparkSession.builder \
        .appName('SO')\
        .getOrCreate()

    sc= spark.sparkContext

    df = sc.parallelize(
        [('f1', 1), ('f2', 2)]).toDF(["key", "value"])


    df.write.format('json').save('output_path')
json文件中的输出如下所示

 {"key":"f1","value":1}
 {"key":"f2","value":2}

您可以将rdd转换为dataframe并写入JSON

    from pyspark.sql import SparkSession

    spark = SparkSession.builder \
        .appName('SO')\
        .getOrCreate()

    sc= spark.sparkContext

    df = sc.parallelize(
        [('f1', 1), ('f2', 2)]).toDF(["key", "value"])


    df.write.format('json').save('output_path')
json文件中的输出如下所示

 {"key":"f1","value":1}
 {"key":"f2","value":2}

您期望的json格式是什么?您期望的json格式是什么?