Python 每个日期唯一ID的累积计数_Python_Pandas_Numpy - Fatal编程技术网

Python 每个日期唯一ID的累积计数

python pandas numpy

Python 每个日期唯一ID的累积计数,python,pandas,numpy,Python,Pandas,Numpy,假设我有以下DF： Date ID 2019-06-01 A 2019-06-01 B 2019-06-01 B 2019-06-02 A 2019-06-02 C 2019-06-03 C 2019-06-03 A 获取每个日期唯一ID的累积计数的最具python风格的方法是什么： Date ID 2019-06-01 2 2019-06-02 3 2019-06-03 3 我可以按

假设我有以下DF：

Date          ID   
2019-06-01    A
2019-06-01    B
2019-06-01    B
2019-06-02    A
2019-06-02    C
2019-06-03    C
2019-06-03    A

获取每个日期唯一ID的累积计数的最具python风格的方法是什么：

Date          ID   
2019-06-01    2
2019-06-02    3
2019-06-03    3

我可以按日期使用for循环，并使用

np.isin

，但这在性能方面太糟糕了

谢谢

让我们做吧

s = df.groupby('Date')['ID'].agg(list).cumsum()
s = s.map(lambda x : len(set(x))).reset_index()
s
         Date  ID
0  2019-06-01   2
1  2019-06-02   3
2  2019-06-03   3

让我们做吧

s = df.groupby('Date')['ID'].agg(list).cumsum()
s = s.map(lambda x : len(set(x))).reset_index()
s
         Date  ID
0  2019-06-01   2
1  2019-06-02   3
2  2019-06-03   3

使用

cumsum（）尝试groupby（）.nunique
：
输出：
2019-06-01    2.0
2019-06-02    3.0
2019-06-03    3.0
Freq: D, Name: ID, dtype: float64

使用cumsum（）尝试groupby（）.nunique
：
输出：
2019-06-01    2.0
2019-06-02    3.0
2019-06-03    3.0
Freq: D, Name: ID, dtype: float64

不知怎么的，这给了我错误的结果。例如，在我的例子中：A=第一天uniques:size 40941。B=第二天单件：尺寸28262。C=B[~np.isin（B，A）]大小12114，然后C+A=53055，代码给了我第一天的唯一性：3519和第二天的唯一性：5486不知何故，这给了我错误的结果。例如，在我的例子中：A=第一天uniques:size 40941。B=第二天单件：尺寸28262。C=B[~np.isin（B，A）]大小为12114，然后C+A=53055，代码给了我第一天的唯一性：3519和第二天：5486




[pandas]相关文章推荐



                                                        
通过对列数据、Pandas、iPython应用搜索功能选择行
pandasipython 
Pandas 数据透视表-重新组织多索引的顺序
pandas 
Pandas 在每两个元素之间插入一个空行（数据框+；列）
pandasdataframe 
Pandas 如何基于另一列创建新列，该列中缺少值？
pandas 
Pandas 星火
pandas 
Pandas 使用多个条件从数据帧中删除行
pandasdataframe 
Pandas 如何在谷歌云数据流中使用熊猫？
pandasgoogle-cloud-dataflow 
Pandas 熊猫：使用关键帧打印到csv数据帧
pandas 
Pandas 带有数据帧的loc功能警告
pandas 
Pandas 如何替换瑞典语字符äåö；python中的列名称？
pandaspython-2.7 
Pandas 汇流上直线图的自定义X轴
pandascsvgraphcharts 
Pandas 基于另一列区分列中的重复值
pandas 
Pandas median（）如何处理偶数个条目？
pandas 
Pandas 数据帧中缺失记录的可视化
pandas 
如何将pd.namedagh替换为符合pandas 0.24.2的代码？
pandasnumpy 
Pandas 数据帧重命名多个同名列
pandas 
Pandas 数据帧的分组和标题
pandas 
Pandas 使用条件合并和更新，而不重命名列
pandas 
Pandas 当要拆分的字符串数不确定时，在分隔符上拆分Dataframe列
pandas 
Pandas 自定义行和列选择
pandas 
                                       





随机文章推荐



                                                        
Mongodb Solr：按特定于术语的点击率排序
mongodbsolr 
在MongoDB中存储非常大的文档
mongodbnosql 
mongoDB中带更新的聚合
mongodb 
查询中的mongodb$elemMatch返回所有子文档
mongodb 
MongoDB在包含50000.000多个文档的大型集合上的写入性能较差
mongodb 
Mongodb 为什么不'；t firstResult和maxResult是否使用标准？
mongodbgrails 
查询MongoDb以获取小于现在的日期时间值
mongodbdatetime 
是否基于Mongodb中的键和引用对象删除重复项？
mongodbdictionarymongoose 
使用MongoDB通过引用对象进行文本查询
mongodb 
mongoDB中不包含查询
mongodb 
Mongodb 嵌入数组中嵌入文档中的项目字段
mongodb 
$mongodb到mongoose的投影查询
mongodbmongoose 
Mongodb postgres中的elemMatch等效项
mongodbpostgresql 
使用gridfs处理mongodb中的文件
mongodb 
仅检索MongoDB集合中对象数组中的查询元素
mongodb 
如何从MongoDB检索数据并在Rest-Assured测试中使用
mongodbrest 
Mongodb Typeorm：使用Mongo数据库为布尔值提供默认值
mongodb 
Mongodb studio3t工具：避免在每次查询运行时使用新的控制台输出选项卡
mongodb 
Mongodb 基于字段名收集数组元素的Mongo聚合
mongodb 
如何在spark中的mongodb聚合管道中进行提示
mongodbapache-spark


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
Python 2.6-上传zip文件-海报0.4
									Python
							 									File Upload
							 
Python 在字典中解析字典
									Python
							 									Json
							 									Dictionary
							 
Python 测试iFrame“；top===self"；巨蟒龙卷风
									Python
							 									Iframe
							 
Python 3.3.0错误
									Python
							 
Python：将浮点除以变量
									Python
							 
Python-包含值的语法错误
									Python
							 									Syntax
							 
Python 将一个数组中的所有元素乘以第二个数组中的相应元素
									Python
							 									Arrays
							 									List
							 
Python 为什么赢了'；t代码运行程序在代码结束时停止运行？
									Python
							 									Visual Studio Code
							 
Python 重新安装Anaconda：导入旧环境
									Python
							 									Anaconda
							 
Python 基于数据帧的多维计算
									Python
							 									Python 3.x
							 									Pandas
							 									Dataframe
							 
在Python中使用不同分隔符连接列表
									Python
							 									List
							 									Join
							 									Formatting
							 
Python mod_wsgi:将数据代理到守护进程时请求数据读取错误
									Python
							 									Apache2
							 
Python OOP：对象参数，取决于调用方法的位置
									Python
							 									Oop
							 
用python绘制单词关联网络
									Python
							 									Networking
							 									Text
							 
在python流式处理请求中，在请求完成执行循环之前不接收响应
									Python
							 									Json
							 									Rest
							 
Python Pycharm:已安装程序包Pycurl但未找到（ModuleNotFoundError）
									Python
							 									Module
							 									Pycharm
							 
在带有SLURM的HPC系统上使用GNU并行运行带有两个输入文件的python文件的多个实例
									Python
							 
Python 21点游戏不更新手每次每个人都抽签
									Python
							 									Function
							 									Variables
							 
Python 如何更有效地分割DateTime对象，并在每次迭代时计算给定的统计信息？
									Python
							 									Pandas
							 									Performance
							 
Python torch.pow（）生成Nan
									Python
							 									Pytorch
							 
Python Django-404生产中的媒体文件（Azure存储）
									Python
							 									Django
							 									Azure
							 
Python Gorilla包在超类方法中的应用
									Python
							 
Python 硒刮动态无限滚动没有AJAX
									Python
							 									Selenium
							 									Web Scraping
							 
Python 如何使用beautiful soup从这些行中提取div属性？
									Python
							 									Python 3.x
							 
Python 打开目录时出现PermissionError
									Python
							 
Python Matplotlib:三维线集合打印在任何其他图形的上方
									Python
							 									Matplotlib
							 
Python Opencv视频捕获。在不同的PC上读取不同的结果
									Python
							 									Opencv
							 
Python Discord.py的Discord ping命令
									Python
							 									Discord.py
							 
Python 逐行查找两个数据帧之间的相似性
									Python
							 									Pandas
							 
Python 如何创建知道起点、终点和行驶距离的路线
									Python
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Floating Point
Triggers
Logic
Fluent Nhibernate
Command Line
C# 4.0
Processing
Websocket
User Interface
Orm
Markdown
Xmpp
Ssis
Material Ui
Groovy
Latex
Uml
Nativescript
Sharepoint 2007
Nlp
Jaxb
Session
Tomcat
Meteor
Compression
Clojure
Certificate
Timer
Ruby On Rails 3
Google Drive Api
Gtk
Alfresco
Forms
Vb.net
Sml
Google Analytics
Gdb
Sql Server
Windows
Windows Installer
Ios5
Couchbase
Primefaces
Objective C
Ruby On Rails 3.1
Raspberry Pi
Asp.net Web Api
Events
Orientdb
Winforms
Cloud
Reactjs
Sass
Yocto
Sbt
Matrix
Utf 8
Computer Science
Xml
Wordpress
Actionscript
Compiler Construction
Ag Grid
Vaadin
Debugging
Linux Kernel
Electron
Html5 Canvas
Concurrency
Matplotlib
Extjs
Vbscript
Outlook
Twitter
Office365
Android
Hybris
Apache Kafka
Spotify
Rally
Python Sphinx
Eclipse Rcp
Sequelize.js
Push Notification
Eclipse
Blockchain
Sharepoint 2010
Google App Engine
Cors
Jasmine
Ibm Mq
Gnuplot
Prometheus
Visual Studio 2008
.net 4.0
Composer Php
Indexing
Ipython
Nhibernate
Service
Amazon S3
Sugarcrm
Mongodb
Struts2
Ms Office
Speech Recognition
Google Sheets
Rx Java
Nunit
Google Maps
Vb6
Google Compute Engine
Robotframework
Julia
Azure Sql Database
Monitoring
Windows 10
Antlr
Automated Tests
Haskell
Gstreamer
Magento
Discord.py
Ionic2
Jquery Ui
Swift
Sublimetext2
Gulp
Google Cloud Firestore
Deep Learning
Math
D3.js
Struct
Azure Functions
Moodle
Express
Cypress
View
Azure Ad B2c
Drupal 6
Google Plus
Date
Db2
Assembly
Wpf
Terraform
Geolocation
Checkbox
Activerecord
Scripting
Select
Module
Protractor
Animation
Clang
Ckeditor
Model
Hash
Io
Jwt
Puppet
Grep
Visual Studio Code
Bluetooth
Doxygen
Breeze
System Verilog
Filter
Reflection
Boost
Backbone.js
Angular Material
Windows Mobile
Hbase
C#
Join
Clearcase
Sqlalchemy
Vhdl
Bazel
Shopify
Printing
Keycloak
Linker
Dll
Google Maps Api 3
Optimization
Firefox
Hive
Dynamics Crm
Snowflake Cloud Data Platform
Ignite
Wcf
Cocos2d X
Centos
Arm
Post
Wso2
Autocomplete
Powershell
Apache Storm


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网