Python：DF中高效的拆分列_Python_Performance_Pandas_Split - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/0/performance/5.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python：DF中高效的拆分列_Python_Performance_Pandas_Split - Fatal编程技术网

Python：DF中高效的拆分列

python performance pandas

Python：DF中高效的拆分列,python,performance,pandas,split,Python,Performance,Pandas,Split,假设我有一个DF，它包含一个表单的列 0 A.1 1 A.2 2 B.3 3 4.C 假设我只想使用“.”后面的元素，按“.”拆分这些列。一个天真的方法是 for i in range(len(tbl)): tbl['column_name'].iloc[i] = tbl['column_name'].iloc[i].split('.',1)[1] 这很有效。而且对于大桌子来说速度很慢。有人知道如何加快这个过程吗？我可以在DF中使用新列，因此我不局限于更

假设我有一个DF，它包含一个表单的列

0     A.1
1     A.2
2     B.3
3     4.C

假设我只想使用“.”后面的元素，按“.”拆分这些列。一个天真的方法是

for i in range(len(tbl)):
  tbl['column_name'].iloc[i] = tbl['column_name'].iloc[i].split('.',1)[1]

这很有效。而且对于大桌子来说速度很慢。有人知道如何加快这个过程吗？我可以在DF中使用新列，因此我不局限于更改源列（因为我在示例中重用了它）。

谢谢

对于大型数据帧，使用

map

而不是For循环必须更快：

%timeit df['newcol']  = df.column_name.map(lambda x: x.split('.')[1])
100 loops, best of 3: 10.7 ms per loop

%timeit for i in range(len(df)): df['newcol'].iloc[i] = df['column_name'].iloc[i].split('.',1)[1]
1 loops, best of 3: 7.63 s per loop

pandas

具有字符串方法，可以在不使用循环的情况下高效地执行这些操作（这会降低性能）。在这种情况下，您可以使用：

阿美-塔沃里像奇迹一样工作（缓慢的奇迹，但不是2小时的奇迹：-）@谢谢你的回答。它比本地大熊猫慢一些，但比常规的“for”循环快得多！可能在某些情况下，这种情况甚至比熊猫解析更好。非常感谢你们两位。
>> import pandas as pd >> df = pd.DataFrame({'a': ['A.1', 'A.2', 'B.3', 'C.4']}) >> df a 0 A.1 1 A.2 2 B.3 3 C.4 >> df.a.str.split('.').apply(pd.Series) 0 1 0 A 1 1 A 2 2 B 3 3 C 4

[performance]相关文章推荐

随机文章推荐

Azure ad b2c Azure AD B2C重置用户'；密码 azure-ad-b2c

Azure ad b2c Azure active directory根据电子邮件地址池或预注册帐户限制注册 azure-ad-b2c

Azure ad b2c Azure B2C新手澄清 azure-ad-b2c

Azure ad b2c 嵌套JSON作为REST API的输入/输出'；s使用Azure AD B2C自定义策略 azure-ad-b2c

Azure ad b2c 从令牌提供程序返回的身份验证令牌是否与系统令牌相同？ azure-ad-b2c

Azure ad b2c 有没有一种方法可以在本地测试Azure B2C自定义策略，而无需将XML文件上载到Azure门户？ azure-ad-b2c

[python]相关推荐

python中的unittest：忽略我要测试的代码的导入
Python Unit Testing

Python 同时打开几个文件指针，好吗？
Python File

Python 如何使用cx\U Oracle运行非查询sql命令？
Python Sql

Python中的二维列表？
Python List

Python 对列表中的一列求和（后续）
Python

Python 向服务中添加在sud中WSDL中缺少的其他元素
Python

Python 检索完整号码
Python

在创建python生成器之后更新它
Python Python 2.7

Python-使用test进行列表编译以避免被零除
Python List Numpy

Python Can'；在Django Flatpage中不使用{MEDIA_URL}}？
Python Html Django

Python 将文档的每一行拆分为n组
Python Python 2.7

Python浮点小数
Python

通过python uno接口将文件（内存中）转换为pdf
Python Pdf

Python中基于时间的for循环
Python For Loop

Python WSGI中间件是否可以修改请求主体，然后将其传递？
Python Python 3.x

Python 更新ttk.Notebook中tab开关上的框架
Python Python 2.7 Tabs Tkinter

Python 产生无限结果的字谜代码
Python Algorithm Recursion

Python 如何在return语句中将字符串计算为布尔值？
Python

Python Can'；t在单元测试中设置会话变量
Python Unit Testing Session Flask

Python 在使用线程进行数据加载培训时，在验证集上保存性能最佳的TensorFlow模型的最有效方法
Python Multithreading Tensorflow

Python 将时间序列数据输入Tensorflow进行LSTM分类器训练
Python Numpy Machine Learning Tensorflow Neural Network

Python 为什么赢了'；不要让我删除这个字符串或将其打包
Python String Canvas Time Tkinter

Python 后面只有数字的正则表达式
Python Regex

Python 基于dataframes VLookup样式的列合并两个excel文件
Python Sql Pandas

Python re.findall返回带有不需要的字符串的链接
Python

Python 基于成人收入数据集的神经网络训练精度低
Python Machine Learning Tensorflow Neural Network Deep Learning

Dataframe命令在IPython中工作，但在脚本中不工作
Python Pandas Ipython

Python 如何使用ElementMaker向元素添加属性？
Python Xml

Python 如何从powershell'；s 2线输出
Python Powershell

Python 返回Gensim Word2vec中单词的排名
Python

Tags

Acumatica Pagination Webstorm Ruby On Rails Bash Formatting Layout Elixir Streaming Indexing Sharepoint Log4net Silverlight Ms Office Visual Studio 2010 Sublimetext3 Gnuplot Teamcity Opencart Weblogic Ecmascript 6 Dialogflow Es Pentaho Jwt Tomcat Data Structures Bison Workflow Iis Stored Procedures Mqtt Html5 Canvas Kernel Ruby On Rails 3.1 Twitter Bootstrap Tsql Symfony1 Chef Infra Drupal 6 Model Forms Configuration Azure Database Design Websocket Data Binding Vagrant Google Maps 3d Jar Download Sugarcrm Scala Time Charts Sdk Internet Explorer 8 Jhipster Navigation Silverstripe Leaflet Sharepoint 2007 R Jasper Reports Unity3d Wicket Nuget Yaml Spring Boot Kubernetes Interface Mdx Android Studio Dask Cassandra Xslt Tinymce Big O Regex Antlr Statistics Yocto Z3 Collections Mapreduce Twig Discord.js Sass Open Source Ignite Postman Oauth Web Services Listview Asterisk Flash Entity Framework 4 Git Netty Plone Google Drive Api Appium Sencha Touch Stripe Payments Common Lisp Lua Testng Azure Functions Video Streaming Firefox Sails.js Imagemagick C++ Content Management System Ag Grid Sapui5 Ios6 Generics Multithreading Glsl Fullcalendar Domain Driven Design Ibm Mobilefirst Sml Swift2 Algorithm Image Javafx Go Monitoring Python 3.x Sharepoint 2013 Windows Mobile Synchronization Recursion Shell Openerp Speech Recognition .net 4.0 Vim Nginx Jakarta Ee Select Pine Script Google Cloud Storage Apache Passwords Webrtc Stream Puppet Android Javascript Pointers Openstack .net Core Sonarqube Spotify Url Rewriting Html Compiler Errors Rx Java Struts2 Tridion Processing Tags Razor Encryption Proxy Unix Jquery Plugins Bazel Gis Oracle Apex Gdb Php Google Colaboratory Exception Handling List Windows Phone 7 Jasmine Webview Instagram Logic Ibm Mq Service Map User Interface Actionscript 3 Google Plus Cors Colors Mule Xamarin.android Mongoose Google Chrome Swiftui Zend Framework Hive Time Complexity Hyperledger Fabric C# 4.0

Copyright © 2024. All Rights Reserved by - Fatal编程技术网