Python 将订单与列中的项目合并_Python_Pandas_Dataframe_Expand - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/fsharp/3.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 将订单与列中的项目合并_Python_Pandas_Dataframe_Expand - Fatal编程技术网

Python 将订单与列中的项目合并

python pandas dataframe

Python 将订单与列中的项目合并,python,pandas,dataframe,expand,Python,Pandas,Dataframe,Expand,我有一个包含所有订单、客户和订单项信息的数据集。我不想在新列中扩展我的orderitems，但不会丢失有关客户的信息 CustomerId OrderId Item 1 1 CD 1 1 DVD 2 2 CD 结果应该是： CustomerId OrderId CD DVD 1 1 1 1 2 2 1 0 我试过了 df2 = pd.concat([df, pd.get_dummies(df

我有一个包含所有订单、客户和订单项信息的数据集。我不想在新列中扩展我的orderitems，但不会丢失有关客户的信息

CustomerId    OrderId    Item
1    1    CD
1    1    DVD
2    2    CD

结果应该是：

CustomerId    OrderId    CD    DVD
1    1    1    1
2    2    1    0

我试过了

df2 = pd.concat([df, pd.get_dummies(df.Item)], axis='columns')
df2 = df2.groupby('CustomerId')

更简单的是

或者，如果性能很重要，

df.pivot_table(index=['CustomerId', 'OrderId'], columns=['Item'], aggfunc='size', fill_value=0) Item CD DVD CustomerId OrderId 1 1 1 1 2 2 1 0

如果要使用假人，另一个选项是：

# Solution similar to @jezrael but with str.get_dummies (df.set_index(['CustomerId', 'OrderId']) .Item.str.get_dummies() .sum(level=[0, 1]) .reset_index()) CustomerId OrderId CD DVD 0 1 1 1 1 1 2 2 1 0
如果你需要指示灯

(df.set_index(['CustomerId', 'OrderId']) .Item.str.get_dummies() .max(level=[0, 1]) .reset_index())

像这样：
pd.crosstab（[df.CustomerId，df.OrderId]，df.Item.）.reset_index（）
我有点好奇-你需要计算重复的值，或者在输出中需要
0,1
？@jezrael在走上高速路之前，请记住这是重复的。我重新打开它并回答，因为您没有删除您的答案。是的，原因是我认为OP需要其他东西，但我错了，清除重复。。。
(df.set_index(['CustomerId', 'OrderId']) .Item.str.get_dummies() .max(level=[0, 1]) .reset_index())

[pandas]相关文章推荐

Pandas 什么'；将这些数据放入数据框的最简单方法是什么？ pandas

Pandas 计算值1在转换为数据帧的VCF的每一行中出现的次数 pandas dataframe

Pandas 带Bokeh的时间序列图表 pandas

Pandas 查询多索引数据帧 pandas dataframe

Pandas 为什么我的解析日期在Jupyter中不起作用 pandas

使用Pandas选择每个PeriodIndex中最早的行 pandas

Pandas 通过迭代将变量添加到数据帧 pandas dataframe

Pandas pd.级数中连续对的匹配 pandas

Pandas 基于for循环条件舍入数据帧列 pandas dataframe

Pandas 均值的T检验 pandas

Pandas 按分组并按第n行中的元素减去列中的每个元素 pandas

Pandas 第二个csv文件的长度不等于使用np.select的基本文件 pandas numpy

Pandas 熊猫：创建具有两个列表的堆叠条形图 pandas jupyter-notebook

Pandas 合并具有不相等行的数据帧 pandas dataframe merge

Pandas spark数据帧中的不匹配特征计数 pandas dataframe pyspark

Pandas Groupby shift（滞后值）模拟，仅带Numpy（无熊猫） pandas numpy

Pandas 设置的并集不正确 pandas dataframe

如何从pandas数据框中获取n个点的块来绘制它们的平均值 pandas for-loop matplotlib plot

Pandas 与熊猫一起阅读拉链 pandas windows

Pandas 将dataframe中的一列拆分为新列 pandas dataframe

随机文章推荐

[python]相关推荐

在Python列表中高效地查找索引（与MATLAB相比）
Python Matlab List Numpy

Python 芹菜和Django-没有名为'；django'；
Python Django

Python中的多线程还是串行处理？
Python C++ Multithreading

Python concat后熊猫无到NaN：bug还是故意行为？
Python Pandas

Python GUI实现方向
Python Multithreading Wxpython

在python中如何比较数组中的（x，y）点以提取矩形形状
Python Opencv Numpy

Python 基于条件创建新的numpy阵列
Python Performance Numpy

Python 如何创建没有聚合的简单交叉表？
Python Pandas

Python 为什么有些页面没有爬行？
Python Python 3.x Web Scraping Web Crawler

Python 从split（）获取最少数量的元素
Python String

如何知道安装了numba或tensorflow的python代码中每个块的最大线程数？
Python Tensorflow Cuda

python将语音转换为文本
Python

Python 有没有办法在DAG中找到任何随机任务所花费的时间？
Python Google Cloud Platform Airflow

Python 使用堆栈计算中缀表达式时，算术错误的计算和KeyDict错误；
Python

Python 刮削困难的桌子
Python Web Scraping

Python 安全地将浮点向下转换为可能的最小整数类型
Python Pandas Numpy

在Python中使用对象作为列表索引
Python Class

Python 用于从字符串中查找解析瓶大小的正则表达式（例如750ML）
Python Regex String

Python UserSchema类对象返回空dict
Python Sqlalchemy

Python Jupyter笔记本-链接两个滑块小部件，向值添加偏移量
Python Jupyter Notebook

流畅的Python书籍，示例2.15
Python

Python &引用；参数中存在域错误。”；加上；统计数据beta.rvs“；
Python Statistics

Python 如何在django rest API中为路由器提供参数？
Python Django Django Rest Framework

Python 熊猫：计算Z分数以避免；“展望未来”；偏倚
Python Pandas

Python 绘制类别变量和日期时间之间的关系
Python Matplotlib

Python Spyder和Repl.it与Visual Studio和命令行的输出差异-[34mR与字母R（带蓝色collor）
Python

Python 如何用PyGame绘制矩形轮廓（未填充）？
Python

Python 从Jupyter笔记本中的ipyWidgets通过文件上传上传的MS Word文档中提取文本
Python Jupyter Notebook

跟踪时获取python内置函数的返回值
Python Debugging

如何使用Python在imo.im中自动发送消息？我在网上找不到imo.imapi
Python Api

Tags

Ag Grid Azure Functions Windows Phone Winforms Listview Material Ui Maps Adobe Cobol Embedded Compiler Construction Log4net Ethereum Routing Certificate Google Bigquery Plot Mercurial Sqlalchemy Google App Engine Cuda File Requirejs Appium Perl Verilog Enums Webstorm Salesforce Cloud Foundry Actionscript 3 Mapping Asp.net 3d Workflow D Xampp Scripting Haskell Service Webpack Android Studio Makefile Vue.js Twitter Json Opengl Es Influxdb .net Core Sequelize.js Wxpython Stored Procedures Fullcalendar Nativescript Properties Jasper Reports Erlang Sprite Kit Angular Sml Safari Join Sas Jvm Xna Julia Flash Google Cloud Firestore Hadoop Optimization Phpstorm Mariadb Swift Google Analytics Amazon Web Services Entity Framework 4 Swiftui Apache Kafka Ssrs 2008 Primefaces Sharepoint 2013 Layout View Gstreamer Excel Glsl Sbt Ignite Blackberry Sorting Visual Studio 2010 Formatting Cucumber Netsuite Zend Framework Jhipster Character Encoding Cocos2d Iphone Search Codeigniter Node.js Blazor Jqgrid Axapta Actions On Google Wso2 Maven Apache Flex Docusignapi Facebook Python Math Tridion Visual Studio Tableau Api Loopbackjs Sencha Touch 2 Content Management System Twitter Bootstrap Compilation Activerecord Tomcat Object Wolfram Mathematica Gnuplot Swing Uitableview Activemq Netbeans Xml Azure Ad B2c Menu Validation Awk Arm Hive E Commerce Telerik Nservicebus Paypal Weblogic Timer Next.js Class Ldap Animation Qml Swift2 Dataframe Vaadin Symfony Csv Centos Jpa Apache Nifi Ide Dictionary Hyperlink Vhdl Command Line Linq To Sql Couchbase Reflection Computer Vision Sql Server Puppet Inno Setup Ssh Bluetooth Design Patterns Instagram Collections Google Maps Api 3 Logstash Memory Text Internet Explorer 8 Zurb Foundation Ubuntu Resharper Linkedin Project Management Database Replace Biztalk Python 3.x Checkbox Sitecore Github Reactjs Deployment Openlayers 3 C++ Cli Hyperledger Fabric Opengl Rust Selenium Webdriver Statistics Parallel Processing Telegram Oauth

Copyright © 2024. All Rights Reserved by - Fatal编程技术网