Apache spark 特定列Spark 1.6除外_Apache Spark_Dataframe_Apache Spark Sql - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/3/apache-spark/5.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Apache spark 特定列Spark 1.6除外_Apache Spark_Dataframe_Apache Spark Sql - Fatal编程技术网

Apache spark 特定列Spark 1.6除外

apache-spark dataframe

Apache spark 特定列Spark 1.6除外,apache-spark,dataframe,apache-spark-sql,Apache Spark,Dataframe,Apache Spark Sql,我正在尝试使用dfB从dfA中筛选出行 dfA： dfB：我的目标是从dfA中填写dfB中所有year cid 我认为这是一个明显的例子，除了： dfA.except(dfB) 但是，我需要两个DF中的列数相同。是否有方法对特定列执行EXPECT操作？或者应该完全走另一条路。不，我认为这对不起作用，除了。您需要的是左反联接： dfA.join(dfB,Seq("year","cid"),"leftanti") 在spark 2之前，此操作也应如此 dfA.join(dfB.withCo

我正在尝试使用dfB从dfA中筛选出行

dfA：

dfB：

我的目标是从dfA中填写dfB中所有

year cid

我认为这是一个明显的例子，除了：

dfA.except(dfB)

但是，我需要两个DF中的列数相同。是否有方法对特定列执行EXPECT操作？

或者应该完全走另一条路。

不，我认为这对

不起作用，除了。您需要的是左反联接：
dfA.join(dfB,Seq("year","cid"),"leftanti")

在spark 2之前，此操作也应如此
dfA.join(dfB.withColumn("b",lit(1)),Seq("year","cid"),"left")
  .where($"b".isNull).drop($"b")

你可以用DFB做dfA的左外连接，你需要的是一个左反join@RameshMaharjan似乎有点过激不？spark 1.6中是否有反连接部分？@user3741859 noSorry我刚注意到我在标题中没有提到它我在spark 1.6上编辑问题它不应该为null你想删除所有匹配的内容吗？@user3741859您想要所有不显示在dfB中的行，对吗？在这种情况下，您必须保留来自dfA的在dfB中不匹配的数据，即b为空。。。
dfA.join(dfB,Seq("year","cid"),"leftanti")

dfA.join(dfB.withColumn("b",lit(1)),Seq("year","cid"),"left")
  .where($"b".isNull).drop($"b")




[dataframe]相关文章推荐



                                                        
Dataframe 在同一反应函数中创建多个数据帧并分别输出
dataframeshiny 
Dataframe R中的反应对象和函数
dataframe 
Dataframe 尝试刮取文本时rvest中出现错误
dataframe 
Dataframe 如何封送、解组或转换对象，同时避免在golang中键入
dataframego 
Dataframe 如何从字典中创建数据框，其中每个项都是PySpark中的一列
dataframepyspark 
Dataframe 使用ID文件替换数据集中的某些列，然后打印整个数据集
dataframeawksed 
Dataframe PySpark中的拆分列：如何确保输出是int数组，但对于某个字符使用空数组
dataframeapache-sparkpyspark 
Dataframe 有没有一种方法可以选择性地应用这个stringr函数？
dataframestringrregex 
Dataframe spark数据框比较并仅显示不同的值
dataframeapache-spark 
Dataframe 是否可以使用unix删除包含特定值的列？
dataframeunixawk 
Spark scala更改dataframe中列的数据类型
dataframeapache-spark 
Dataframe 在pyspark数据帧中生成序列，以便在null之后找到值时该序列递增
dataframepyspark 
（Julia）将DataFrame列总和分配给新列
dataframejulia 
Dataframe Pyspark-使用startswith from列表创建一个新列
dataframeapache-sparkpyspark 
Dataframe 保存的数据帧与加载的数据帧不同
dataframecsv 
                                       





随机文章推荐



                                                        
Algorithm 昂立夫：它是如何工作的？
algorithmoptimizationcompressioncloud 
Algorithm 使用数组实现4堆
algorithm 
Algorithm 文件比较策略
algorithmvideohash 
Algorithm 快速排序决策树
algorithm 
Algorithm 最小边交点算法
algorithm 
Algorithm 何时使用Paxos（真正的实际用例）？
algorithm 
Algorithm 优化房间的3D布局？
algorithmoptimizationlanguage-agnostic 
Algorithm CUDA最大约简算法不工作
algorithmcudaparallel-processing 
Algorithm 概念上简单的线性时间后缀树构造
algorithmdata-structures 
Algorithm 聚类数目未知的无监督聚类
algorithmmathartificial-intelligencemachine-learning 
Algorithm 小数基的整数对数
algorithm 
Algorithm 算法分析
algorithm 
Algorithm 贝尔曼福特可视示例
algorithm 
Algorithm 难题：仅在10或12的倍数上找到最小资源分配
algorithmmath 
Algorithm 欧氏算法的时间复杂度
algorithm 
Algorithm 具有逆运算且无重复的超大集合的动态伪随机置换
algorithmrandom 
Algorithm 对于矩阵中的所有标记区域，找到它们的顶点以进行绘制
algorithmmatrixgraphics 
Algorithm 像Reelgood这样的流媒体聚合器如何进行搜索和启动？
algorithmsearchstreaming 
Algorithm 基于时间间隔重叠、权重约束和距离最小化的组合分组优化问题
algorithmoptimization 
Algorithm 如何使用第一个映射的值检索嵌套映射的值？
algorithmscalafunctional-programming


                                        

                                        
                                        


                                                
                                                        [apache spark]相关推荐
                                                        
                                                        
                                                

                                                
                                                        Tags
                                                        
Eclipse Plugin
Sharepoint 2007
Module
Kernel
Redirect
Google Chrome Extension
Mapreduce
Linkedin
Url Rewriting
Fullcalendar
Parameters
Ethereum
Push Notification
Kentico
Pytorch
Mod Rewrite
Visual Studio 2017
Session
Inheritance
Plot
Pyspark
Cron
D3.js
Gulp
Protocol Buffers
Azure Service Fabric
Log4net
Apache Zookeeper
Octave
Gnuplot
Appium
Cryptography
Nhibernate
Dojo
Computer Science
Sdk
Django Rest Framework
Log4j
File
Antlr4
Version Control
Gwt
Menu
Graphviz
Input
Twilio
Parallel Processing
Vim
.htaccess
Autodesk Forge
Bluetooth
Actions On Google
Processing
Tcl
Openerp
Bazel
Imagemagick
Web Scraping
Outlook
Sublimetext2
Sip
Awk
Dynamics Crm 2011
Phpunit
Wicket
Sed
Compilation
Nunit
Dependency Injection
Jquery Plugins
Syntax
Erlang
Winapi
Domain Driven Design
Tridion
Ravendb
Memory
Socket.io
Redux
Mapbox
C# 3.0
Quickbooks
Github
Operating System
Reactjs
Active Directory
Bash
Discord.py
Razor
Neo4j
Mdx
Google Maps
Selenium
Localization
Apache
Grafana
Linux
Exception Handling
Ms Access
Service
Solr
Discord
Flutter
Dom
Twitter Bootstrap 3
Random
Xamarin.android
Debian
Drools
Weblogic
Mips
Android Emulator
Printing
Server
Formatting
Hyperlink
Python Sphinx
Svg
Antlr
Ip
.net
Transactions
Entity Framework
Plone
Mvvm
Responsive Design
Directx
Iphone
Xslt
Powerbi
Dialogflow Es
Laravel
Path
Coffeescript
Django
EmptyTag
Nservicebus
Jakarta Ee
Sql Server 2005
Msbuild
Validation
Material Ui
Datatables
Google Maps Api 3
Macos
Ignite
Google Colaboratory
Z3
Stream
Akka
Ipad
Text
Tcp
Fluent Nhibernate
Google Chrome
Jira
Docker Compose
Hyperledger Fabric
Continuous Integration
Logstash
Silverstripe
Ffmpeg
Mysql
Opengl Es
Moodle
Spring Integration
Ios8
Exception
Xna
Vbscript
Sbt
Windows Mobile
Dotnetnuke
Zend Framework
Com
Javafx 2
Vagrant
Jekyll
Tkinter
Variables
Asp.net Mvc 5
Ibm Mobilefirst
Chef Infra
Qt
Visual Studio Code
Scikit Learn
Asp.net
Cors
Libgdx
Grails
Function
Amazon Web Services
Netsuite
Xmpp
Arm
Database Design
Orchardcms
Matrix
Ssrs 2008
Architecture
Batch File


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网