Python 熊猫：如何总结数据、分组、独特_Python_Pandas - Fatal编程技术网

Python 熊猫：如何总结数据、分组、独特

python pandas

Python 熊猫：如何总结数据、分组、独特,python,pandas,Python,Pandas,以下是我正在使用的数据帧： company | pc-serial | software --------+-----------+-------------------- A | 1 | Word A | 1 | Excel A | 2 | Word A | 3 | PowerPoint B | 4 | Word B | 4 |

以下是我正在使用的数据帧：

company | pc-serial | software
--------+-----------+--------------------
A       | 1         | Word
A       | 1         | Excel
A       | 2         | Word
A       | 3         | PowerPoint
B       | 4         | Word
B       | 4         | Excel
B       | 4         | Visio
B       | 5         | Word
B       | 5         | PowerPoint

我想建立一个新的数据框架，告诉我每个公司拥有的独特软件的数量，结果应该是：

company | unique_sw
--------+--------------
A       | 3
B       | 4

A有3个（Word、Excel和PowerPoint），B有4个（Word、Excel、PowerPoint和Visio）

我尝试了

df.groupby（'company'）['software'].count（）

它给出了所有软件A有4个，B有5个的计数。如果我更改

unique（）

的

count（）

，它将首次出现“软件”

因此，我不知道如何汇总这些信息。

请使用以下信息：

df.groupby('company')['software'].nunique()

改用这个：

df.groupby('company')['software'].nunique()

或者您可以通过

删除重复项来修复代码
df.drop_duplicates(['company','software']).groupby('company').software.count()
Out[690]: 
company
A    3
B    4
Name: software, dtype: int64

或者您可以通过删除重复项来修复代码
df.drop_duplicates(['company','software']).groupby('company').software.count()
Out[690]: 
company
A    3
B    4
Name: software, dtype: int64

美丽的！谢谢！美丽的！谢谢！




[pandas]相关文章推荐



                                                        
Pandas Python：使用where语句为3个标签选项中的新列设置值
pandasnumpydataframe 
使用pandas数据帧将数据集加载到python中
pandas 
使用Pandas TimeGrouper时在列上更改应用函数
pandas 
Pandas 多次过滤具有多个条件的数据帧
pandasdataframefilter 
Pandas 如何在matplotlib中绘制具有两个不同轴的数据帧
pandasmatplotlib 
Pandas 如何将系列列表转换为两列数据帧？
pandaslistdataframe 
Pandas 根据给定的上下文检索单词例如，给定一份工作描述，我需要找到与技能相关的单词
pandasmachine-learningnlpcomputer-science 
Pandas 将kdb表保存到数据帧，然后将数据帧保存到csv。输出到csv的空值和字符串值不正确？
pandaskdb 
Pandas 将数据映射到地面真相列表
pandas 
Pandas 如何从存储在数据框中的每个箱子的边缘和值生成热图？
pandasmatplotlib 
Pandas 将覆盖旧数据的部分重叠JSON文件与最新数据合并
pandasdataframejoinmerge 
Pandas 按数据帧中的分隔符将列拆分为未知数量的列
pandas 
Pandas 如何在迭代数据帧时插入列？
pandas 
Pandas 多索引交叉表
pandas 
Pandas 6百万行的性能问题
pandasperformancejupyter-notebook 
Pandas `替换为'didn'；数据帧无法正常工作

范围索引：2356个条目，0到2355
数据列（共4列）：
#列非空计数数据类型
---  ------   --------------  ----- 
0 doggo 2356非空对象
1 Floorfer 2356非空对象
2木偶2356非空对象
3 puppo 2356非空对象
数据类型：对象（4）
内存使用率：73.8+KB
pandas 
使用pandas.style.to_excel（）时，如何缩短时间？
关于熊猫的问题
pandas 
Pandas 替换为loc将引发无效的类型比较
pandas 
Pandas 错误'；预期的元组，得到str'；在数据帧连接中
pandas 
Pandas pyspark使用一列元组列表从熊猫创建数据帧
pandasdataframeapache-sparkpyspark 
                                       





随机文章推荐



                                                        
Twitter bootstrap 3 使用ASP.NET MVC 5和Bootstrap 3默认安装，为什么样式会出现403错误，字体会出现404错误？
twitter-bootstrap-3asp.net-mvc-5 
Twitter bootstrap 3 Bootstrap 3中的响应背景图像
twitter-bootstrap-3 
Twitter bootstrap 3 旋转指示器到目标幻灯片
twitter-bootstrap-3 
Twitter bootstrap 3 yeoman webapp-grunt字体和引导字体文件复制任务
twitter-bootstrap-3gruntjs 
Twitter bootstrap 3 引导导航栏下拉错误
twitter-bootstrap-3 
Twitter bootstrap 3 折叠的引导菜单一旦打开，就不会再次关闭
我正处于从Bootstrap 2.3.2转换到3.3.2的应用程序的中间。我大部分时间都在工作，但是我的导航栏的折叠版本有问题。当我在手机上或缩小桌面窗口的缩小屏幕上时，我会看到折叠的菜单按钮，如果我单击该按钮，它会打开菜单，但再次单击它不会关闭菜单
twitter-bootstrap-3 
Twitter bootstrap 3 引导表中的多选择行
twitter-bootstrap-3 
Twitter bootstrap 3 带引导模式的Laravel Javascript
twitter-bootstrap-3 
Twitter bootstrap 3 AdSense在Twitter上发布的固定大小广告3
twitter-bootstrap-3responsive-design 
Twitter bootstrap 3 旋转木马上的响应性徽标
twitter-bootstrap-3 
Twitter bootstrap 3 设置引导弹出框的宽度不工作
twitter-bootstrap-3 
Twitter bootstrap 3 引导折叠在第二次单击后未折叠
twitter-bootstrap-3 
Twitter bootstrap 3 引导3嵌套手风琴工作不正常
twitter-bootstrap-3 
Twitter bootstrap 3 Xpages的引导登录
twitter-bootstrap-3xpages 
Twitter bootstrap 3 引导日期时间选择器不工作
twitter-bootstrap-3 
Twitter bootstrap 3 引导网格系统在我的网站中不工作
twitter-bootstrap-3 
Twitter bootstrap 3 Bootstrap 3等高缩略图（类似于Bootstrap 4中的等高卡）
twitter-bootstrap-3 
Twitter bootstrap 3 具有8个不同高度的分区的引导栅格
twitter-bootstrap-3responsive-design 
Twitter bootstrap 3 滚动条未显示
twitter-bootstrap-3 
Twitter bootstrap 3 引导4稳定吗？它还在开发中吗？
twitter-bootstrap-3bootstrap-4


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
批次当量为；资料来源；在Windows上：如何从virtualenv运行Python脚本
									Python
							 									Windows
							 									Scripting
							 									Batch File
							 
Python测试用例中的非测试方法
									Python
							 									Unit Testing
							 
Python的cProfile无法识别函数名
									Python
							 
Python ImportError:没有名为xlwt的模块
									Python
							 
Python 从mysqldb查询中获取原始十进制值
									Python
							 									Mysql
							 
Python代码：几何布朗运动-what'；怎么了？
									Python
							 
Python 剪贴板内容更改时触发事件
									Python
							 									Multithreading
							 									Macos
							 									Events
							 
Python Django-自定义管理操作日志
									Python
							 									Django
							 
Python 如何使用django仅呈现包含数据的部分html
									Python
							 									Django
							 
Python Flask RESTful跨域问题与Angular:PUT、OPTIONS和methods
									Python
							 									Angularjs
							 									Flask
							 									Cors
							 
如何在NPM安装期间使用不同版本的python？
									Python
							 									Node.js
							 									Centos
							 									Npm
							 
如何在Python中获得完全限定的主机名？
									Python
							 
Python 如何将批量数据上载到appengine数据存储？旧方法不起作用
									Python
							 									Google App Engine
							 									Google Cloud Storage
							 
python中使用pandas dataframe与statsmodels或scipy进行方差分析？
									Python
							 									Pandas
							 
使用Python urllib/urllib2发出http POST请求以上载文件
									Python
							 									Http
							 									Post
							 
Python 超级初始化与父级初始化__
									Python
							 									Python 2.7
							 									Inheritance
							 
使用SubmiverePL运行python代码时重用选项卡
									Python
							 									Sublimetext3
							 
Python中的pip是什么？
									Python
							 									Python 3.x
							 									Module
							 
Python argparse：展平操作的结果='；追加'；
									Python
							 
Python 将Keras model.summary（）对象转换为字符串
									Python
							 									Deep Learning
							 									Keras
							 
Python 如何在数据帧中展开列
									Python
							 									Pandas
							 									Dataframe
							 
Python 从dict创建数据帧，其中键是元组
									Python
							 									Pandas
							 
Python ImportError:没有名为pandas的模块。安装了pip的熊猫
									Python
							 									Macos
							 									Pandas
							 
Python：合并多个数据帧
									Python
							 									Pandas
							 									Dataframe
							 									Merge
							 
Python 如何将numpy数组列表加载到pytorch数据集加载程序？
									Python
							 									Numpy
							 									Pytorch
							 
Python Django:datetime按日期筛选忽略时间
									Python
							 									Django
							 
Python 一边是系数的相关矩阵图，另一边是散射图，对角线上是分布图
									Python
							 									Pandas
							 
Visual Studio代码Python等待调试器连接超时
									Python
							 									Debugging
							 									Visual Studio Code
							 
Python 功能名称在xgboost中不匹配，尽管有相同的列
									Python
							 
macOS Catalina:Python意外退出错误
									Python
							 									Macos
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Linker
Android
Active Directory
Vmware
Facebook
Eclipse
Select
Sqlalchemy
Solr
Cookies
Woocommerce
Timer
Ajax
Oracle
Autocomplete
Hyperledger Fabric
Fullcalendar
Gwt
Coldfusion
Unity3d
Aurelia
Permissions
Ip
Excel Formula
Gridview
Security
Erlang
Nestjs
Sharepoint 2010
Asp.net Mvc
Compiler Errors
Ruby On Rails 4
Css
Bluetooth
File
Parse Platform
Memory
Angular6
Neo4j
Vuejs2
Amazon Web Services
Synchronization
Windows Store Apps
Node.js
Clearcase
Apache Flink
Zend Framework2
Azure Devops
Swift2
Lotus Notes
Docusignapi
Gdb
Sublimetext3
Plsql
Apache Storm
Grep
Image
Aws Lambda
Input
Asp.net Mvc 2
Compilation
Robotframework
Wordpress
Pentaho
Fortran
Nservicebus
Memory Leaks
Zend Framework
Macros
Odata
Abap
Aframe
Sparql
Elixir
Kibana
Rspec
Templates
Windows
Pandas
C# 3.0
Discord.py
Chart.js
Csv
Azure Ad B2c
Reference
Openshift
Visual Studio 2017
Reporting Services
Asp.net Core
Sublimetext2
Ssis
Exception
Twig
Calendar
Scripting
Jekyll
Xaml
Jasper Reports
Azure Sql Database
Canvas
Visual Studio 2015
Tkinter
Neural Network
Http
Memory Management
Google Sheets
Asp Classic
Wix
Search
Sprite Kit
Scikit Learn
Json
Winforms
Web Applications
Rdf
Gatsby
Symfony1
Process
Dialogflow Es
Map
Firefox Addon
Routing
Winapi
Google Analytics
Sdk
Testing
Visual Studio 2010
Terminal
Sql Server 2005
Twitter Bootstrap 3
Qml
Apache Flex
Electron
Discord.js
Batch File
Doxygen
Ibm Mq
Actionscript
Unicode
Jenkins
Uitableview
Database Design
Xslt
Artificial Intelligence
Mapreduce
Nsis
Quickbooks
Replace
Ms Word
Excel
Audio
Opencart
System Verilog
Liferay
Firebase
Arm
Amazon Ec2
Jquery Ui
Cors
Opencv
Curl
Intellij Idea
Login
Asp.net Mvc 4
Azure Cosmosdb
Windows Phone
Prolog
Server
Random
Bootstrap 4
Groovy
Cron
Fiware
Frameworks
Raspberry Pi
Amp Html
Sbt
Import
Ssh
Twilio
Google App Maker
Multithreading
Tensorflow
Combobox
File Upload
Parsing
Web
Ldap
Tags
Object
3d
Dask
Rest
Asp.net Core Mvc
Anaconda
Jdbc
Exception Handling
Workflow
Xquery
Tfs
Google Apps Script


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网