Python Spark DataFrame将元素转换为字符串_Python_Apache Spark_Pyspark_Apache Spark Sql - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/3/apache-spark/6.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python Spark DataFrame将元素转换为字符串_Python_Apache Spark_Pyspark_Apache Spark Sql - Fatal编程技术网

Python Spark DataFrame将元素转换为字符串

python apache-spark pyspark

Python Spark DataFrame将元素转换为字符串,python,apache-spark,pyspark,apache-spark-sql,Python,Apache Spark,Pyspark,Apache Spark Sql,我有这样一个数据帧： +---+--------------------+ |idn| recommendations| +---+--------------------+ |463|[[10955,0.0086656...| |496|[[12767,0.0209305...| |148|[[9813,0.00673213...| |471|[[8537,0.00546676...| |243|[[10846,0.0044064...| |623|[[10955,0.3857911.

我有这样一个数据帧：

+---+--------------------+
|idn|     recommendations|
+---+--------------------+
|463|[[10955,0.0086656...|
|496|[[12767,0.0209305...|
|148|[[9813,0.00673213...|
|471|[[8537,0.00546676...|
|243|[[10846,0.0044064...|
|623|[[10955,0.3857911...|
|540|[[11463,0.0250675...|
|392|[[7177,0.01615425...|
|737|[[7994,0.12720428...|
|516|[[10955,0.4047550...|
+---+--------------------+

dataFrame.printSchema()


 root
 |-- idn: long (nullable = true)
 |-- recommendations: array (nullable = true)
 |    |-- element: struct (containsNull = true)
 |    |    |-- id_usn: long (nullable = true)
 |    |    |-- rating: double (nullable = true)

模式如下：

+---+--------------------+
|idn|     recommendations|
+---+--------------------+
|463|[[10955,0.0086656...|
|496|[[12767,0.0209305...|
|148|[[9813,0.00673213...|
|471|[[8537,0.00546676...|
|243|[[10846,0.0044064...|
|623|[[10955,0.3857911...|
|540|[[11463,0.0250675...|
|392|[[7177,0.01615425...|
|737|[[7994,0.12720428...|
|516|[[10955,0.4047550...|
+---+--------------------+

dataFrame.printSchema()


 root
 |-- idn: long (nullable = true)
 |-- recommendations: array (nullable = true)
 |    |-- element: struct (containsNull = true)
 |    |    |-- id_usn: long (nullable = true)
 |    |    |-- rating: double (nullable = true)

现在我想将列中的id\u usn和评级转换为字符串
您可以按如下方式强制转换嵌套结构列

col_schema = ArrayType(StructType([StructField('id_usn',StringType(),True),StructField('rating',StringType(),True)])) df = dataFrame.select('idn',dataFrame.recommendations.cast(col_schema)) df.printSchema()
请试试这个，让我知道

[apache spark]相关文章推荐

随机文章推荐

Sqlalchemy 将删除级联到多对多关联表？ sqlalchemy

Sqlalchemy查询和过滤 sqlalchemy

检测对象未提交时SQLAlchemy内存泄漏（例如回滚） sqlalchemy

基于sqlalchemy中的某些条件自动删除对象 sqlalchemy

带括号的SQLAlchemy连词 sqlalchemy

Flask SQLAlchemy：如何调用db.create_all（）和db.drop_all（）而不触发数据库提交？ sqlalchemy

Sqlalchemy 如何与其他声明性基础一起使用 sqlalchemy

[python]相关推荐

如何在没有实际执行的情况下检查Emacs中Python代码的语法？
Python Validation Emacs Syntax

让Python代码第一次运行的好方法是什么？
Python Debugging

Python 确定一个序列是否在另一个序列中的最佳方法？
Python Algorithm

Python Django持久数据库连接
Python Database Django

Python中的非无测试
Python

Python 如何从“；类文件对象”；urllib.urlopen（）返回了什么？
Python

Python CSV：从值中删除引号
Python Csv

python列表中重复项的索引
Python

在python中使用subprocess.call时，如何将stdout重定向到文件？
Python

Python 重写实例上的特殊方法
Python

Python 在Jinja2中将变量传递给宏
Python Google App Engine

Python 是否有方法使用seleium阻止/关闭iFrame？
Python Html Selenium Iframe

如何在python上使用最后一个参数的difrent size生成3D矩阵
Python Algorithm List Matrix

Python 熊猫：遍历两个不同的列
Python Pandas

Python 为什么CNN-LSTM比LSTM快？
Python Keras

在python中对不同输入（不同大小）的函数进行有效的基准测试
Python Testing

Python 如何将已读取和解析的数据写入另一个文件？
Python Python 3.x

Python 使用多处理器或ray与其他cpu绑定任务同时写入文件
Python

Python 对诅咒边框使用utf-8字符
Python Python 3.x

Python 我想根据某些条件在数据框中添加一列
Python Pandas

Python 有没有更好/更快的方法来计算数组中每个元素的相对秩？
Python Performance Numpy Sorting

Python 返回另一个函数的函数'-&燃气轮机'；符号
Python Function

用于散热的Python numpy矢量化
Python Numpy

Python 在初始化期间，您可以向QObject传递什么类型的KWARG？
Python

在Python中重写属性get行为，但仅由外部方法重写
Python Oop Inheritance Properties

Python正则表达式输出NaN
Python Regex Pandas

Python 循环中断，而不执行最后一条打印语句
Python Loops Printing

Python 是否有从时间序列创建7天移动平均线的功能？
Python Pandas Matplotlib

Python 属性错误：'；敌人'；对象没有属性'；blit&x27；
Python

Python：从txt文件目录中计算字数，并将字数写入单独的txt文件
Python

Tags

C# 3.0 Jquery Mobile Ldap Imagemagick Ionic2 Dynamics Crm 2011 Ruby On Rails 3.2 If Statement Utf 8 Cordova Sml Regex Flutter Continuous Integration Sbt Stata Entity Framework Ruby On Rails 4 Character Encoding Vue.js Visual Studio 2015 Object Email Dictionary Doctrine Orm Laravel 5 Wso2 Gis Vector Sublimetext3 Wix Azure Ad B2c Uml Amazon Dynamodb Artificial Intelligence Sqlalchemy Gstreamer Monitoring Php Image Material Ui Visual Studio 2008 Solr Docker Compose .htaccess Kotlin Build Telegram Gridview Perforce Windows 7 Api Laravel Pascal Testing Recursion Programming Languages Webview Ravendb Deployment Ignite Chart.js Git Windows Phone 8.1 Sql Server Colors Visual Studio 2017 Pagination Hash Couchbase Xamarin.forms Xml Drupal 6 Nativescript Calendar Identityserver4 Spring Boot Electron Selenium Webdriver Msbuild Magento Clearcase Chef Infra Google Calendar Api Ibm Cloud Printing Frameworks Filter Google Cloud Storage Function Hybris Asp.net Web Api Ftp Websphere Glassfish Binary Netsuite Rabbitmq Linux Kernel Grails Dynamics Crm Visual C++ Plsql Fluent Nhibernate Qt4 Apache Flink C# 4.0 Pdf Cron Fullcalendar Asp.net Mvc Google Chrome Extension Android Ndk Download Usb Virtualbox Elm Windows 10 Virtual Machine Openlayers 3 Synchronization Xamarin.ios Perl Wolfram Mathematica Netbeans Dask Swift3 Gulp Highcharts Tridion Hbase Ionic Framework Parse Platform Angularjs Nservicebus Jquery Plugins Winforms Nestjs Here Api Uwp Sencha Touch Sequelize.js Enums Oracle10g Gnuplot Sparql EmptyTag Keras Oop Typescript Qt Internet Explorer Signalr Twilio Emacs Django Yii Ethereum Svn Join Syntax Unit Testing Ssrs 2008 Parsing Nhibernate Sharepoint 2013 Python 2.7 Geometry Mariadb Database Design Winapi Visual Studio Code C++11 Graphviz Ip Soap Animation Node.js Netty Opencv Playframework 2.0 Polymer Jekyll Tableau Api Ada Dom Microservices Jsf 2 Coffeescript Mule Wcf Phpstorm Text R Sencha Touch 2 Batch File Groovy Erlang Nsis Memory Leaks Anaconda

Copyright © 2024. All Rights Reserved by - Fatal编程技术网