Apache spark PySpark-如何从此数据帧筛选行_Apache Spark_Pyspark - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/3/apache-spark/6.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Apache spark PySpark-如何从此数据帧筛选行_Apache Spark_Pyspark - Fatal编程技术网

Apache spark PySpark-如何从此数据帧筛选行

apache-spark pyspark

Apache spark PySpark-如何从此数据帧筛选行,apache-spark,pyspark,Apache Spark,Pyspark,我正在尝试从文件中读取第一行，然后从数据帧中过滤该行我正在使用take（1）阅读第一行。然后，我想从数据帧中过滤它（它可能在数据集中出现多次）但是我得到了以下错误TypeError:条件应该是string或Column 我想从Nicky那里得到答案数据如下所示（但需要对多个列执行相同操作）：我希望结果如下： 1 2 3 4 5 在数据帧上获取结果列表（行）我们需要使用[0][0]和在filter子句中，使用列名并过滤与标题不相等的行 header = df1.take(1)[0][0

我正在尝试从文件中读取第一行，然后从数据帧中过滤该行

我正在使用

take（1）

阅读第一行。然后，我想从数据帧中过滤它（它可能在数据集中出现多次）

但是我得到了以下错误

TypeError:条件应该是string或Column

我想从Nicky那里得到答案

数据如下所示（但需要对多个列执行相同操作）：

我希望结果如下：

在数据帧上获取结果列表（行）我们需要使用[0][0]和在filter子句中，使用列名并过滤与标题不相等的行 header = df1.take(1)[0][0] #filter out rows that are not equal to header final_df = df1.filter(col("<col_name>") != header) final_df.show() header=df1.take（1）[0][0] #筛选出不等于标题的行 final_df=df1.过滤器（列（“”）=标题）最终设计图显示（） 1 2 3 4 5 header = df1.take(1)[0][0] #filter out rows that are not equal to header final_df = df1.filter(col("<col_name>") != header) final_df.show()

[pyspark]相关文章推荐 Pyspark Spark嵌套循环和RDD转换 pyspark pyspark：使用reduceByKey聚合后写入文件 pyspark Pyspark数据帧中的填充 pyspark Pyspark-数据帧中的动态where子句 pyspark Pyspark 运行可执行python代码时未定义sc pyspark Pyspark GCP-GKE与Dataproc上的火花 pysparkgoogle-cloud-platform 带有GraphFrame的PySpark异常 pyspark Pyspark--根据另一个数据帧的行值筛选数据帧 pyspark Pyspark：输出到csv——时间戳格式不同 pyspark 在pyspark中聚合Kolmogrov-Smirnov检验 pyspark Pyspark 基于现有数据帧的条件创建新的spark数据帧 pyspark 无法修剪pyspark数据帧中的空白 pyspark Pyspark 从Azure存储资源管理器读取databricks中的zip文件 pyspark 在Pipenv-Pyspark中安装模块后出现ModuleNotFound错误 pyspark PySpark合并结构中的结构字段 pyspark Pyspark Databricks，dbutils，获取Azure Data Lake gen 2路径中所有子文件夹的文件计数和文件大小 pyspark 根据PySpark Dataframe中两列之间的匹配分配唯一ID pyspark 随机文章推荐 Asp.net core mvc 编辑列表时的TagHelper验证 asp.net-core-mvc Asp.net core mvc Asp.net核心路由到另一个项目 asp.net-core-mvc Asp.net core mvc 无法解析“GraphQL.Http.IDocumentWriter”类型的服务 asp.net-core-mvcgraphql Asp.net core mvc 在扩展方法中使用ApplicationDbContext asp.net-core-mvc Asp.net core mvc ASP.NET Core 2.2-操作筛选器数据库查询问题 asp.net-core-mvc Asp.net core mvc 重定向后TempData为空 asp.net-core-mvc

[apache spark]相关推荐 Tags Ruby On Rails 4 Windbg Nservicebus Orientdb Https Oauth 2.0 Woocommerce Aframe Amazon Web Services Azure Cosmosdb Puppet Npm Instagram Monitoring Regex Computer Science Tree Blazor Web Streaming Video Elixir Racket Primefaces Sip Nlp Button Express Identityserver4 Qml Visual C++ Spotify Logic Firefox Notepad++ Spring Boot Webpack Perl Asp.net Mvc 5 Oauth Bots .htaccess Css Discord.js Botframework Jekyll Pointers Silverstripe Content Management System Spring Security Opencv Variables Python Certificate Security Prometheus F# Visual Studio 2017 Network Programming Django Models Gradle Function Openid Sails.js Redis Clojure Binding Google Cloud Dataflow Devexpress Types Pip Server Json Scheme Wpf Google Maps Tsql Processing Cluster Computing View Windows 7 Encryption Ibm Mobilefirst Domain Driven Design Symfony1 Microsoft Graph Api Scrapy Caching Postman Binary Subsonic Laravel 5 Rxjs Math Scikit Learn Xamarin.ios Dependencies Internationalization Mpi Python 2.7 Ios4 Nosql Transactions Inheritance Facebook Graph Api React Native Bazel Jar Geolocation Tcp Stata Jpa Spring Cloud Oracle11g Entity Framework 4 Xamarin Com Hazelcast Statistics Unicode Quickbooks Reporting Services Ecmascript 6 Cocoa Sparql Matlab Office Js Gnuplot Design Patterns Algorithm Xml Gtk Wso2 Ethereum Syntax Google Drive Api Doxygen Open Source Acumatica Ibm Midrange Ruby On Rails 3.1 Mqtt Vhdl Wix Sqlite Xcode4 Indexing Compiler Construction Kubernetes Google Compute Engine Sugarcrm Vb6 Phpunit Postgresql Report Twilio Rss Map Generics Excel Entity Framework Core Sql Server 2008 R2 Lisp Sphinx Responsive Design Xsd Jvm Dynamics Crm 2011 Notifications Virtualbox Google Maps Api 3 Mapbox Scala Mapreduce Vmware Graphql Programming Languages Twitter Soap Google Colaboratory Delphi Yii Sonarqube Ssrs 2008 Coding Style Centos Parse Platform Nuget Rally Jasmine Post Kendo Ui Animation Service Mule Sharepoint 2013 Pascal Azure Sql Database Amazon Dynamodb X86 Windows Mobile

Copyright © 2024. All Rights Reserved by - Fatal编程技术网