Python 熊猫：类别数据类型和过滤器_Python_Pandas_Filter_Categorical Data - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/345.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 熊猫：类别数据类型和过滤器_Python_Pandas_Filter_Categorical Data - Fatal编程技术网

Python 熊猫：类别数据类型和过滤器

python pandas filter

Python 熊猫：类别数据类型和过滤器,python,pandas,filter,categorical-data,Python,Pandas,Filter,Categorical Data,使用pandas 0.18.1，我在过滤dtype为category的列时实现了一种不同的行为。这里是一个最小的例子 import pandas as pd import numpy as np l = np.random.randint(1, 4, 50) df = pd.DataFrame(dict(c_type=l, i_type=l)) df['c_type'] = df.c_type.astype('category') df.info() <class 'pandas.c

使用pandas 0.18.1，我在过滤

dtype

为

category

的列时实现了一种不同的行为。这里是一个最小的例子

import pandas as pd
import numpy as np

l = np.random.randint(1, 4, 50)
df = pd.DataFrame(dict(c_type=l, i_type=l))
df['c_type'] = df.c_type.astype('category')

df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 50 entries, 0 to 49
Data columns (total 2 columns):
c_type    50 non-null category
i_type    50 non-null int64
dtypes: category(1), int64(1)
memory usage: 554.0 bytes

但是，对“类别类型”列进行相同的筛选时，会将该值作为条目进行筛选

df[df.c_type.isin([1, 2])].c_type.value_counts()

2    20
1    17
3     0
Name: c_type, dtype: int64

虽然过滤器可以工作，但这种行为对我来说似乎不寻常。例如，过滤器可用于从

pivot\u表

函数中排除未来的列，该函数在处理

类别

时需要额外的过滤器

这是预期的行为吗？

这是预期的行为，如果选中：

像Series.value_counts（）这样的系列方法将使用所有类别，即使数据中不存在某些类别：
因此，如果按
5
进行筛选（值不存在），则为每个类别获取
0
：

print (df[df.c_type.isin([5])].c_type.value_counts()) 3 0 2 0 1 0 Name: c_type, dtype: int64

In [100]: s = pd.Series(pd.Categorical(["a","b","c","c"], categories=["c","a","b","d"])) In [101]: s.value_counts() Out[101]: c 2 b 1 a 1 d 0 dtype: int64

print (df[df.c_type.isin([5])].c_type.value_counts()) 3 0 2 0 1 0 Name: c_type, dtype: int64

[pandas]相关文章推荐

Pandas 带有假日日历的熊猫中的日期偏移 pandas

Pandas 在给定键值数据帧的情况下填充密集数据帧 pandas

Pandas 什么'；使用条件逐行合并数据帧中的行是一种更有效的方法吗？ pandas join dataframe

Pandas 从数据框中的日期列中减去日期列 pandas

Pandas 箱线图错误：目前不支持1 ndim分类 pandas numpy machine-learning scikit-learn

Pandas 如何计算具有默认值的数据帧的运行和 pandas dataframe

Pandas 对dataframe.groupby（）的结果进行采样 pandas dataframe

Pandas 拆分一个大熊猫列并附加为新列，而不考虑索引 pandas

Pandas 需要在日期之间生成日期时间范围，但时间开始和结束时间为凌晨02:00:00 pandas datetime

Pandas 类型错误：'；元组'；对象不可调用 pandas numpy

Pandas 将字符串格式的负数转换为数字时，在结尾处使用符号as pandas dataframe

Pandas 取消单列数据帧的堆栈 pandas dataframe

Pandas 将df.plot.Bar（）中的第一个条隔开 pandas matplotlib

Pandas 来自熊猫数据框的Seaborn散射图，共2列 pandas

Pandas 如何添加新的5分钟间隔 pandas date

Pandas 将大型数据集与dask合并 pandas dask

Pandas 熊猫，填充缺少的值 pandas

Pandas 未填充数据帧中数据类型为int的NaN值 pandas

Pandas 转换数据帧中的多个数据类型 pandas dataframe

Pandas 熊猫在多个条件下丢弃重复项 pandas

随机文章推荐

Logging Server2008/iis7上的日志记录功能？ logging iis-7

Logging 长时间运行的脚本和打开的文本文档 logging text scripting file

Logging 错误注册服务器 logging

Logging 如何登录Haskell？ logging haskell

Logging Emacs自动滚动日志缓冲区 logging emacs

Logging 为集群环境中的日志指出weblogic中的公共路径 logging weblogic

Logging 在MVC5中使用ASP.NET运行状况监视 logging visual-studio-2013 asp.net-mvc-5

Logging 如何为slf4j标记配置logback.xml文件 logging

Logging 每次运行cron作业时都会创建新的日志文件 logging cron

Logging 使用拦截策略禁用驼峰日志DSL logging

Logging 用于关键日志记录的log4net logging log4net

Logging 如何在我的项目中的log4j2中显示数据库查询？ logging

Logging 在GO中滚动日志文件的最佳方法 logging go

Logging 如何在Log4j2的HTML布局中更改时间列 logging layout

Logging 为web服务上的一个方法禁用CXF日志记录 logging

Logging 基于日期前缀分割文件？ logging

Logging 榛丝装饰 logging hazelcast

Logging .NetCore日志输出GCP Stackdriver中的严重性错误 logging .net-core

Logging kern.log中的这一行是什么意思？ logging

Logging Kabana查询（KQL）用于搜索日志中的；[错误]”； logging kibana

[python]相关推荐

Tags

Properties Ssl Liferay Mqtt Wxpython Database Design Automation Osgi Design Patterns Swift3 Wolfram Mathematica Internet Explorer Php Unity3d Sed Three.js Embedded Configuration Testng Apache Camel Stm32 Jetty Drupal 7 Iframe Asp.net Mvc 2 Exchange Server Syntax Outlook Imagemagick Ssas Exception Odata Jenkins Nhibernate Webrtc Bison Nativescript Wso2 Google Maps Hive Pine Script Deployment Notifications Dictionary Google Chrome Devtools Svg Perl Weblogic Protocol Buffers Parse Platform Logstash Sharepoint Wcf Omnet++ Methods Nlp Tree Ffmpeg Hyperlink Coffeescript Marklogic Mfc Pdf Data Structures Xamarin.ios Visual Studio 2015 Netsuite Titanium Google Maps Api 3 Hash Telegram Dojo Apache Pig Curl Grafana Fortran User Interface Azure Google Cloud Firestore Grep Shiny Mpi Geometry Umbraco Google Chrome Extension Google Cloud Dataflow Html Pytorch Terraform Amazon Dynamodb Jupyter Notebook Fullcalendar Rspec Jmeter Snmp Reporting Services Http Reference Mdx Boost Tfs Binding Openstack Udp Jqgrid Jakarta Ee Devexpress Plugins Rest Protractor Mono C Ios5 Ide Xpath Sql Server 2012 Multithreading Jersey Orm Menu File Asp.net Mvc 3 Vba Pandas Powershell Sql Server Filesystems Google Drive Api Templates Visual Studio 2013 Gtk Struct Scripting Webstorm Vb.net File Io Google App Maker Inno Setup Clearcase Postgresql Doctrine Orm Jdbc Oracle Apex Functional Programming Clojure Activerecord Sequelize.js Shopify Gnuplot Svn Mapreduce Streaming Blazor Isabelle Sublimetext2 Windows Amazon S3 Material Ui Plone Visual Studio 2010 Iis Soap Groovy Spotify Nosql Printing Ftp Version Control Prestashop Jekyll Jquery Plugins Websocket Firefox Addon Install4j Graph Variables Jboss Compiler Errors Magento Aurelia Ubuntu Jira Spring Integration Entity Framework 4 Loops Sql Server 2008 R2 Azure Service Fabric Calendar Django Rest Framework Silverlight 4.0 Serial Port If Statement Neo4j Continuous Integration Bazel Exception Handling Activemq Windows 7 Pointers Actionscript Random

Copyright © 2024. All Rights Reserved by - Fatal编程技术网