Python 如何使用列表有条件地从数据帧中删除重复项_Python_Pandas_Dataframe_Duplicates - Fatal编程技术网

Python 如何使用列表有条件地从数据帧中删除重复项

python pandas dataframe

Python 如何使用列表有条件地从数据帧中删除重复项,python,pandas,dataframe,duplicates,Python,Pandas,Dataframe,Duplicates,我有一个df，想删除ID上的所有重复项 Name Symbol ID 0 ZOO INC Remove 88579Y101 1 Zoo Inc ZZZ 88579Y101 2 A Inc AAA 90138A103 3 a inc. Remove 90138A103 4 2U Inc TWUO 90214J101 5 Keep Remove 11

我有一个

df

，想删除

ID

上的所有重复项

       Name     Symbol         ID
0   ZOO INC     Remove  88579Y101
1   Zoo Inc        ZZZ  88579Y101
2     A Inc        AAA  90138A103
3     a inc.    Remove  90138A103
4    2U Inc       TWUO  90214J101
5      Keep     Remove  111111111

但是我只想删除重复的行，其中

Symbol==“remove”

。输出应该如下所示：

       Name     Symbol         ID
0   Zoo Inc        ZZZ  88579Y101
1     A Inc        AAA  90138A103
2    2U Inc       TWUO  90214J101
3      Keep     Remove  111111111

我不能使用

result\u df=df.drop\u重复项（subset=['ID']，keep='first'）

（或

keep='last'

），因为数据集没有特定的模式。而且先按字母顺序排序也没用

虽然我知道我可以用

NaN

替换所有

Remove

，然后使用提供的解决方案，但我正在寻找另一种解决方案，因为我最终可能需要传递字符串列表

Pandas是否支持以下内容：

result\u df=df.drop\u duplicates（subset=['ID']，keep=（df['Symbol']！='Remove'））

与

keep=False

一起使用，用于所有复制品，并通过比较链连接

移除，通过
按位链接在一起或
，通过~
反转掩码：
m1 = df['ID'].duplicated(keep=False)
m2 = (df['Symbol'] == 'Remove')

df = df[~(m1 & m2)]

print (df)
      Name Symbol         ID
1  Zoo Inc    ZZZ  88579Y101
2    A Inc    AAA  90138A103
4   2U Inc   TWUO  90214J101
5     Keep     Remove  111111111

df[~df['Symbol'].eq（'Remove'）]
？但这不会删除Symbol=='Remove
中的所有行吗？我只想删除那些在ID
上重复的内容。我会更新这个问题，让它更清楚。这不会反映在你的样本数据上。但是再一次，耶兹雷尔的答案已经涵盖了这一点。你可能还想在你的输出中显示ID 111111（这是一个棘手的部分，但是你的技术产生了这个，所以我的投票）。




[pandas]相关文章推荐



                                                        
Pandas 滚动平均的行为
pandas 
Pandas 使用日期列填充缺少的行
pandas 
Pandas 通过复制现有记录的一部分将记录添加到dataframe
pandas 
Pandas 如何将x轴标签设置为带有数据帧时间戳列的熊猫图中的日期
pandas 
Pandas 删除多重索引中的重复索引，而不考虑顺序
pandasindexing 
Pandas 如何将索引拆分为两列？
pandas 
Pandas 嵌套循环超出列表范围
pandasloops 
Pandas 如何按ID和时间段分组日期时间数据？
pandas 
Pandas 逻辑操作：从数据帧中的列中选择两个值
pandasdataframeindexing 
Pandas 将对象更改为日期时间格式
pandasdatetimetime 
如何在Pandas、Python中按此数据分组？
pandas 
Pandas 在matplotlib所在的位置，无法正确加载图表
pandasmatplotlib 
Pandas Groupby两列之和，并在中创建新的数据帧
pandas 
Pandas 在groupby中以字典格式存储值频率的最快方法
pandas 
Pandas 真假匹配
pandasdataframe 
Pandas 数据帧列赢得'；t用空字符串替换NAN
pandasdataframe 
Pandas xarray，多维坐标索引
pandasnumpy 
Pandas 通过对数Himal binning创建二维图像
pandasdataframe 
Pandas 熊猫表正在破坏我在azure Synapse中的会话
pandas 
Pandas python数据帧中的日期转换中出现断言错误
pandasdatetime 
                                       





随机文章推荐



                                                        
Hash 哈希到底是什么？
hash 
Hash 使用graphviz（点）为节点创建顶部标签
hashgraphviz 
Hash 如何使用jstraverse更新
hashnode.js 
Hash 散列任意对象的正确方法
hashgo 
Hash MD5散列中可以只有数字还是只有字母？
hash 
Hash 正确实现Zobrist哈希
hash 
Hash SHA1暴力程序
hash 
Hash 保留哈希值
hash 
Hash 番石榴版本之间的哈希问题
hash 
Hash 文件修改次数是否是确定文件是否已更改的可靠方法？
hashcompilation 
Hash MariaDB虚拟列-可以给我一个哈希吗？
hashmariadb 
Hash md5sum返回不同的值，并带有“quot；“相同”；串
hashterminal 
Hash Whatsapp消息在散列后中断
hash 
Hash Docker如何计算每层的哈希值？它是确定性的吗？
hashdocker 
Hash 哈希表-插入、搜索和删除的复杂性
hashbig-o 
Hash 如何在arduino中使用哈希函数
hasharduino 
Hash 复制时通过saltstack检查文件的哈希
hash 
Hash 是否将SHA256哈希解密为原始字符串？
hash 
Hash 对于多项式表达式，什么样的散列函数才算合适？
hash


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
在python中，将错误从类传递到呈现html的正确方法是什么
									Python
							 
Python 以智能方式引用BLOB
									Python
							 									Image
							 									Google App Engine
							 
Python Django带龙卷风
									Python
							 									Django
							 
Python getpass.getpass（）错误，没有其他代码吗？
									Python
							 									Python 3.x
							 
在Python中循环类的某些变量
									Python
							 									Class
							 									Variables
							 									Loops
							 
可以用python连接MySQL数据库吗
									Python
							 									Mysql
							 
Python 如何在Django上使用GeoIP查询ASN的已知IP地址？
									Python
							 									Django
							 									Gis
							 
Python 如何让两个脚本引用同一个模块
									Python
							 									Python 2.7
							 
Python 正则表达式字匹配
									Python
							 									Regex
							 
如何在Python中命名实例变量？
									Python
							 									Python 3.x
							 
Python 我想使用matplotlib绘制日期图表，但它显示了浮点（）的nvalid文本：2014-12-10
									Python
							 									Datetime
							 									Numpy
							 									Pandas
							 									Matplotlib
							 
Python 类型错误：不支持的类型<；类型'；列表'&燃气轮机；书面形式
									Python
							 									Excel
							 									Pandas
							 									Dataframe
							 
Python请求头未正确设置
									Python
							 
在python列表中打印海龟图形
									Python
							 									List
							 
Python Keras中的Theano图形打印
									Python
							 									Keras
							 
Python 如何从数据库获取电子邮件配置？
									Python
							 									Django
							 
Python 从OpenCV到PyQt获取网络摄像头镜头
									Python
							 									Opencv
							 
Python Django:过滤查询集，然后计数
									Python
							 									Django
							 
使用try和，除了在python中使用google api搜索URL
									Python
							 									Google Api
							 
提取物<；a>；Python中的内容，Selenium Webdriver
									Python
							 									Selenium
							 									Selenium Webdriver
							 
Python 使用DataFrame.apply在Pandas中使用特定列创建新列
									Python
							 									Pandas
							 									Dataframe
							 
Python itertools组合定制
									Python
							 
Python 将JSON值中的双引号转换为单引号？
									Python
							 									Json
							 									List
							 									Replace
							 
Python TensorFlow无法为Tensor'；占位符：0'；
									Python
							 									Tensorflow
							 									Deep Learning
							 
在Python中如何按日期对数据帧排序？
									Python
							 									Python 3.x
							 									Pandas
							 									Sorting
							 									Dataframe
							 
Python scikit学习DecisionTreeClassifier.tree_u2;.value做什么？
									Python
							 									Machine Learning
							 									Scikit Learn
							 
Python 首次使用时初始化属性
									Python
							 									Python 3.x
							 
Python 根据条件获取NumPy数组的连续元素组
									Python
							 									Arrays
							 									Numpy
							 
在python中向列表中的字符串添加数字
									Python
							 
Python 如何使用在应用程序中返回numpy数组的函数
									Python
							 									Pandas
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Network Programming
Quickbooks
Openstack
Listview
D3.js
Ios
Com
Drop Down Menu
Shopify
Push Notification
Scrapy
Oracle
Sharepoint
Udp
.net Core
Sugarcrm
Opengl Es
Reflection
Devexpress
Resharper
Database Design
Mapreduce
Image Processing
Compression
Gwt
Debugging
Jsf
Clearcase
Azure Devops
Docker
Sql Server 2005
Nunit
Mule
Content Management System
Perforce
Recursion
Ftp
Log4j
Javascript
Command Line
Yii2
Snmp
Templates
Pentaho
Ravendb
Html5 Canvas
Couchbase
Ios8
Apache Nifi
Actionscript 3
Passwords
Arangodb
Terminal
Unit Testing
Codenameone
Drupal 7
Animation
Ipad
Swift2
Junit
Mapping
Firefox Addon
Primefaces
Jqgrid
Inheritance
Replace
Sip
Utf 8
Azure Ad B2c
Loops
Mongoose
Encoding
C++ Cli
Core Data
Yocto
Types
For Loop
Orientdb
Plone
If Statement
Safari
Libgdx
Mfc
Swing
Properties
Sencha Touch 2
Rest
Stata
Sparql
Jakarta Ee
Logic
Mongodb
Acumatica
Stanford Nlp
C++
Http
Sql Server
Sbt
Eclipse
Excel Formula
Matplotlib
Azure Cosmosdb
Coldfusion
Verilog
Keycloak
Seo
Database
Charts
Generics
Windows 8
Swiftui
Latex
Audio
Routing
Usb
Google Sheets
Cluster Computing
Programming Languages
Jestjs
Puppet
3d
Grid
Isabelle
Markdown
Aws Lambda
Unity3d
Clang
Twitter Bootstrap
Bison
Service
Ibm Mobilefirst
Office365
Eclipse Rcp
Google Drive Api
Sqlalchemy
Jasper Reports
Doctrine Orm
Cuda
Electron
Qt
Macros
Odoo
Silverstripe
Phantomjs
Sapui5
Antlr
Redis
Notepad++
Google Cloud Firestore
Sharepoint 2007
Project Management
Mips
D
Printing
Spotify
Mvvm
Sprite Kit
Spring Mvc
Iphone
Snowflake Cloud Data Platform
Formatting
Robotframework
Terraform
Parsing
Ms Access
EmptyTag
Ldap
Erlang
Rspec
Tableau Api
Download
Lua
Pandas
Openerp
Cookies
Geometry
Model View Controller
Design Patterns
File Upload
Networking
List
Ssh
Pascal
Ssas
Typo3
Math
Air
Cygwin
Amazon Ec2
Wso2
Arduino
Mercurial
Module
Fluent Nhibernate
Active Directory
Syntax
Gradle
Visual Studio 2013
Ip
Vim
Openssl


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网