Python pandas：通过正则表达式从其他数据帧过滤一个非常大的数据帧_Python_Pandas_Dataframe - Fatal编程技术网

Python pandas：通过正则表达式从其他数据帧过滤一个非常大的数据帧

python pandas dataframe

Python pandas：通过正则表达式从其他数据帧过滤一个非常大的数据帧,python,pandas,dataframe,Python,Pandas,Dataframe,我有一个巨大的csv（df1），几乎有300万行，其他csv（df2），有15k行。我想从df1中得到满足df2中提到的条件的行写了一些东西，但需要很长时间才能完成守则的结果：代码会将“原因”列附加到df1，并在df2中满足条件及它将df1的索引附加到df2的条件请帮我缩短执行时间改变算法以达到速度也是受欢迎的例： def豁免（行，df）：温度=（df.invalization

我有一个巨大的csv（df1），几乎有300万行，其他csv（df2），有15k行。我想从df1中得到满足df2中提到的条件的行写了一些东西，但需要很长时间才能完成守则的结果：

代码会将“原因”列附加到df1，并在df2中满足条件及它将df1的索引附加到df2的条件

请帮我缩短执行时间改变算法以达到速度也是受欢迎的例：

def豁免（行，df）：
温度=（df.invalization


谢谢，
Krism
您的代码似乎没有提供您所期望的内容。运行它之后，我得到了df1['Reason']='sure'
wherePin=='aaa'。Hi@QuangHoang，：）Pin=='aaa'满足[5，“a+”，“Im”]&[2，'.*，“sure”]的条件，最近的“Reason”将分配给df1[“Reason”]
def waived(row,df) :
    temp = (df.Violation < float(row["Limit"]))  & (df.Pin.str.contains(row["Pin"]))
    if temp.any() : 
        df.loc[temp,"Reason"] = row["Reason"]
        return df[temp].index.tolist()

df1 = pd.DataFrame({'Violation': [0.5,1,2,3,4,5,6],'Pin':"kkk","aaa","bbb","ccc","abc","xyz","abcdef"]},index=[0, 1,2,3,4,5,6])
df2 = pd.DataFrame({'Limit': [5,3,2],'Pin': ["a+","bb*",".*"],"Reason":["Im","not","sure"]},index=[0, 1,2])
df2["Indexes"] = df2.apply(lambda row : waived(row,df1),axis=1)




[pandas]相关文章推荐



                                                        
Pandas 使用自定义解析函数将熊猫中的索引转换为日期时间
pandas 
Pandas pyspark中的java.lang.OutOfMemoryError
pandasapache-sparkpyspark 
Pandas Scikit熊猫，交叉val分数特征数量
pandasscikit-learn 
Pandas 在熊猫系列中循环，以分钟为单位返回日期
pandas 
Pandas 如果其他列为空，则连接某些列
pandas 
Pandas 熊猫多索引切片与索引
pandas 
pandas将pandas（0.13.1）更新为最新版本
pandas 
Pandas 替换熊猫中的名称时忽略NaN值
pandasreplace 
Pandas 为什么要改变；日期“；列到日期时间破产图？
pandasmatplotlib 
Pandas 将一行数据复制到数据表中
pandas 
Pandas 熊猫：查找与其他日期相近的指定星期几的日期
pandas 
Pandas python中熊猫的数据帧
pandaspython-2.7dataframejoin 
Pandas 月度效用数据的时间序列预测
pandas 
Pandas 取消包含重复项的多索引的堆栈
pandas 
Pandas Jupyter“；TypeError:无效的类型比较“；
pandasnumpymatplotlib 
Pandas 将列表传递给loc方法
pandasdataframe 
Pandas 选择矩阵-在Python中避免循环以提高效率
pandasperformancedataframe 
Pandas 如何在负值和正值（-0.2到0.2）之间筛选数据帧，并删除满足条件的行？
pandasfilter 
使用pandas展平时间序列物联网数据
pandasdatetime 
Pandas 熊猫与楠群比
pandasdataframe 
                                       





随机文章推荐



                                                        
NServiceBus订阅错误队列
nservicebus 
如何在消息处理程序中获取NServiceBus子容器的句柄？
nservicebus 
Nservicebus 用于查询读取模型的存储无关选项
nservicebus 
将NServiceBus重试配置为不影响ApplicationException
nservicebus 
NServiceBus的硬件要求
nservicebus 
如何使用AzureServiceBusTransport配置NServiceBus（v6.2）以不创建拓扑
nservicebus 
Nservicebus如何处理structuremap上的嵌套容器？
nservicebus 
NServiceBus限制使用者（端点）
nservicebus


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
如何为Python2.4构建sqlite？
									Python
							 									Sqlite
							 
Python Pyplot导出到.eps时使用alpha<；1.
									Python
							 									Matplotlib
							 
在python中使用请求库时遇到问题
									Python
							 									Security
							 
Python 我应该创建什么表单来显示这些模型？
									Python
							 									Django
							 
Python 我想退回'；a'；而不是整个URL
									Python
							 									Xpath
							 									Scrapy
							 
宽度优先搜索Python程序错误：键错误3
									Python
							 									Algorithm
							 
Python脚本在有或没有函数导入的情况下的效率
									Python
							 									Module
							 
Python 如何线程化应用程序？
									Python
							 
Python ndarray array2string输出格式
									Python
							 									Numpy
							 
Python 实习计划的绩效优化
									Python
							 									Regex
							 									Algorithm
							 									Performance
							 									Lambda
							 
Python 将多行分配给表中的一个索引
									Python
							 									Pandas
							 									Dataframe
							 
Python理解递归程序
									Python
							 									Python 3.x
							 									Recursion
							 
Python 显示单个QS标准项
									Python
							 
Python Pyon Django what'；问题出在哪里'；非类型'；对象没有属性'；页码范围'；
									Python
							 									Django
							 									Pagination
							 
Python 将样式表应用于升级的小部件
									Python
							 									Python 3.x
							 
Python Tensorflow连接tf.data.Dataset.list_文件
									Python
							 									File
							 									Validation
							 									Tensorflow
							 
在GCP虚拟机上使用python检索存储桶
									Python
							 									Python 3.x
							 									Google Cloud Platform
							 
Python 在Django中，获取各种条件的多个计数
									Python
							 									Django
							 
Python 有没有办法在RPIzero上使用编译的keras模型？
									Python
							 									Tensorflow
							 									Keras
							 									Raspberry Pi
							 
Python 除了dict、list、set和tuple之外，还有其他内置容器吗？
									Python
							 
Python &引用；如果；语句查找剩余编号并从csv追加到列表中
									Python
							 									Python 3.x
							 									List
							 									Csv
							 									If Statement
							 
Python 熊猫成对运算类似于rolling（）.corr（）
									Python
							 									Pandas
							 									Dataframe
							 
Python 如何从Spacy加载任意语言
									Python
							 									Python 3.x
							 									Nlp
							 
附加到列表的python递归
									Python
							 									List
							 									Recursion
							 
Python 如何使用多个输入变量LSTM生成1个输出
									Python
							 									Machine Learning
							 									Keras
							 									Neural Network
							 
Python 使用时出现错误消息：groupby（'；'；）.transform（pd.rolling_sum，window=30）
									Python
							 									Pandas
							 
Python 如何获得此验证代码以限制字符长度并仅允许数字
									Python
							 									Validation
							 									Tkinter
							 
Python Django过滤器截止日期前的日期
									Python
							 									Django
							 
Python 求所有可逆方阵
									Python
							 									Numpy
							 									Matrix
							 
如何使用python脚本在zigbee网络中找到终端设备？
									Python
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Io
Apache2
Sed
Validation
Oracle10g
Ios4
Xcode4
Compiler Construction
Jakarta Ee
Flutter
Express
Sap
Scrapy
Xml
Websphere
Orm
Drools
Input
Junit
Pip
Caching
Mule
Performance
Rdf
Listview
Ibm Midrange
Filter
Tkinter
If Statement
Vagrant
Sms
Docker
Oauth
Hyperledger Fabric
Deployment
Gwt
Odoo
Google Visualization
Regex
Socket.io
Redis
Doctrine Orm
Google Sheets
Api
Cygwin
Logging
Wxpython
Join
Backbone.js
Ignite
Linker
Ssrs 2008
Sql
Rss
Cordova
Centos
Authentication
Django
Perl
Search
Openlayers
Serialization
Knockout.js
Telerik
Debian
Google App Maker
Select
Jersey
Mysql
Drupal
Stored Procedures
Nestjs
Z3
Mod Rewrite
Opencl
Android
Webstorm
Image
Sublimetext2
Cucumber
Tableau Api
Asp.net Mvc 2
Networking
Sql Server 2008
Apache Zookeeper
Phpmyadmin
.htaccess
Google Plus
Mpi
D
Ios5
Sparql
Debugging
Teamcity
Phantomjs
Computer Vision
Swift3
Asynchronous
Download
Magento
Oop
Antlr4
Merge
Cmd
Netbeans
Octave
Entity Framework Core
Visual Studio 2008
Ssl
C
Dialogflow Es
Kibana
Jestjs
Testing
Swift
Openerp
C++
Ruby On Rails
Spring Integration
Layout
Computer Science
Extjs
Java 8
Stripe Payments
Uitableview
Modelica
Streaming
Kdb
Pytorch
Math
Seo
Memory Management
Data Binding
Npm
C# 4.0
Web Services
Orchardcms
Forms
Artifactory
Compiler Errors
Linux
Influxdb
Dart
Programming Languages
Botframework
Ibm Cloud
Heroku
C++ Cli
Spring Boot
Events
Geometry
Canvas
Glassfish
Struts2
Composer Php
Javafx
Testng
Ms Access
Gruntjs
F#
Liferay
Logic
Audio
Inheritance
Properties
Wolfram Mathematica
Amazon Dynamodb
Xquery
Kotlin
Racket
Google Calendar Api
Ecmascript 6
Ionic Framework
Requirejs
Indexing
Memory
Zend Framework2
View
Dependency Injection
Exchange Server
Zend Framework
Vba
Jupyter Notebook
Powerbi
Post
Firebase
Tabs
Amazon Ec2
Visual Studio 2013
Amazon Redshift
Angular
Spring
Oauth 2.0
Google Cloud Dataflow
Loops
Selenium
Entity Framework 4
File
Autocomplete
Db2
EmptyTag


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网