Python 多项式中的分类变量_Python_Scikit Learn_Classification_Categorical Data - Fatal编程技术网

Python 多项式中的分类变量

python scikit-learn

Python 多项式中的分类变量,python,scikit-learn,classification,categorical-data,Python,Scikit Learn,Classification,Categorical Data,我是Python新手，这里有一个关于在多项式nb中设置X_列的简单问题因此，我想使用两个分类特征预测一个目标（“A1”、“A2”、“A5”）：工作日，具有7个唯一值（'Mon'、'Tue'、'Wed'、'Thu'、'Fri'、'Sat'、'Sun'），以及具有5个唯一值（'a'、'B'、'C'、'D'、'E'）的位置以下是我正在做的：使用pd.getdummies将工作日和位置转换为二进制输入向量使用LabelEncoder将目标转换为数值将数据拆分为培训/测试然后我执行以下操作

我是Python新手，这里有一个关于在多项式nb中设置X_列的简单问题

因此，我想使用两个分类特征预测一个目标（“A1”、“A2”、“A5”）：工作日，具有7个唯一值（'Mon'、'Tue'、'Wed'、'Thu'、'Fri'、'Sat'、'Sun'），以及具有5个唯一值（'a'、'B'、'C'、'D'、'E'）的位置
以下是我正在做的：

使用
pd.getdummies
将工作日和位置转换为二进制输入向量

使用
LabelEncoder
将目标转换为数值

将数据拆分为培训/测试

然后我执行以下操作（使用scikit学习）：
因此，我的问题是：

以上步骤正确吗？特别是，“获取虚拟对象”是处理分类特征的最佳方法吗？
通过执行上述操作，
```
X_列
```
将成形（N_样本，12），其中12来自7（矢量化工作日）和5（矢量化位置）。对于此问题，此设置是否正确

非常感谢

一般来说，你的步骤听起来是正确的。我确实发现

get_dummies（）

是为scikit learn准备无序分类功能的最简单方法。但是，您实际上可以删除每个分类变量的“基线级别”，将功能的数量减少到10（6+4而不是7+5）。此外，请记住，如果您有一个有序的分类功能，将其作为一个功能保留并将类别转换为“合理”的数值可能是有意义的
在的第2部分中，我将展示上述所有内容的示例

clf=MultinomialNB() clf.fit(X_train,y_train)

[scikit learn]相关文章推荐

Scikit learn 如何使用sklearn LogisticRegression启用多核处理？ scikit-learn

Scikit learn 如何在SKR中对记录进行加权？ scikit-learn

Scikit learn 如何在简历中使用scikit学习文档中提到的TimeSeriesSplit scikit-learn

Scikit learn 如何将数据分成三部分，其中一部分不使用？ scikit-learn

Scikit learn scikit学习管道：PCA后的归一化会产生不需要的随机结果 scikit-learn

Scikit learn 如何使用'；max#u功能'；与RFECV组合时是否在Gridsearch中？ scikit-learn

Scikit learn sklearn群集标签的格式是什么？ scikit-learn

Scikit learn GridSearchCv管道多输出分类器和XGBoostClassifier-如何通过提前停止和评估集？ scikit-learn

Scikit learn 将xgboost.Booster的实例转换为实现scikit学习API的模型 scikit-learn

Scikit learn 用GridSearchCV拟合三次多项式系数 scikit-learn

随机文章推荐

Push notification 使用生产证书时，APNS php无法发送给多个收件人 push-notification

Push notification 从主函数中注册两个应用程序 push-notification blackberry

Push notification 推送通知时的徽章编号？--活码 push-notification

Push notification Worklight:Android上的错误推送通知 push-notification ibm-mobilefirst

Push notification iOS和android都使用哪种推送通知方法 push-notification

Push notification 根据新的API，APNS反馈服务是否不再存在？ push-notification

Push notification 如何在Bitnami解析服务器中创建应用程序 push-notification

Push notification 在WL 7.1中按设备\用户ID订阅\更新标签订阅 push-notification tags ibm-mobilefirst

Push notification 一次将通知推送到多个设备 push-notification notifications

Push notification Ionic2推送通知页面移动 push-notification ionic2

Push notification 在iOS 10中，UNUserNotificationCenter'；用户的授权状态错误 push-notification

Push notification 仅当新邮件进入Gmail收件箱时接收推送通知 push-notification google-api

Push notification 如何向5个以上的主题发送FCM消息？ push-notification notifications

[python]相关推荐

Python 在抓取网页时调用javascript函数
Python

如何在OS X 10.8上更正Python/pip配置？
Python Macos Unix Pip

Python 如何使用matplotlib轻松启用/禁用共享轴
Python Matplotlib Plot

Python 使用类似jsbin的散列处理URL
Python Regex Django Url

记忆一个函数，使其不'；在Python中重新运行文件时，我不会重置
Python

Python 多维度情绪分析API，即积极性、情绪性等
Python Machine Learning Nlp

Python 如何将timeit与包含EOL字符的命令一起使用？
Python Windows Cmd

Python 错误：没有名为gtk.glade的模块
Python Gtk

Python django:需要修改已安装包中的视图
Python Django

为什么'；调用守护进程时是否返回Python check_output（）？我有一个Python V3.4应用程序，使用 CuffiOutPuxor（）/Cuffe >调用C++应用程序，调用原代码> For（）/，原始进程退出，子进程继续。似乎check_output（）也在等待子进程，而不是在主进程返回并表示守护进程已成功启动后返回我需要改变C++中的代码> > FoK（）/Cyto>或者Python < Cudio> CuthOutOutlook（）/Cuth>调用需
Python C++

Python 使用RDFib创建数据转储并将数据添加到图形中而无需迭代
Python Sparql

Python 错误：尚未支持命名空间包：跳过包'；pywintypes'；
Python Windows

Python django同步id'；他来自同一张桌子
Python Json Django

如何设计python过滤器函数，使其能够使用addition属性进行排序？
Python Python 2.7

Python 读取源代码后，运行Sphinx html make会被卡住
Python Pycharm Python Sphinx

Python 熊猫合并5个csv文件，其中只有1个不同的列名
Python Csv Pandas Merge

Python 在线性权重索引Tensorflow中查找字符串到散列桶的索引
Python Tensorflow

Python 将SQL查询转换为数据帧
Python Pandas

Python Unicode错误-'；utf-8'；can'；不能解码.py文件中的字节，但可以在交互环境中解码
Python Visual Studio Python 3.x Unicode Visual Studio 2015

分裂列python
Python Pandas

Python：计算坐标相对于周围坐标的百分比
Python Arrays Numpy Dictionary

Python 熊猫数据透视表和Matplotlib栏
Python Pandas Matplotlib

Python Lua：动态确定对象是否为；“类”；或；“实例”；
Python Lua

Python 想要一个aws Dynamodb的计数器吗
Python Amazon Web Services Lambda Amazon Dynamodb Aws Lambda

导入cntk在VS2017中的工作”；python环境“；但不在“中”；python项目“；
Python Visual Studio 2017

Python 如何从EB响应中删除标头
Python Amazon Web Services Docker Flask

Python 将字符串转换为列表，但在满足某些条件时连接元素
Python String Python 3.x List

Python 如何两次分解（或弹出）JSON数组
Python Pandas Dataframe

Python 尝试从引用数据帧返回数据帧中的记录
Python Pandas

Python 多键按键游戏
Python

Tags

Sms Mercurial Google App Maker Xml Sip Rxjs Vhdl Imagemagick Hybris Eclipse Rcp Paypal Jdbc Google Maps Api 3 Vuejs2 Javafx Dataframe Eclipse Jetty Joomla Pagination Parameters Notepad++ Virtual Machine Linq To Sql Protocol Buffers Itext Wcf Floating Point Filter Reporting Services Highcharts Frameworks Tree Amazon Redshift Boost Jqgrid Polymer Debian Atom Editor Ios7 Octave D Playframework 2.0 Google Chrome Extension Clearcase Jquery Plugins Teamcity Microsoft Graph Api Sql Server Compression Path Woocommerce Wpf Drupal 6 Haskell Silverlight Iframe Machine Learning Cassandra Delphi Mqtt Symfony1 C Big O Angular Material Firefox Addon Three.js Autodesk Forge Hbase Mariadb Jira Pip Activemq Air Aurelia Asp.net Mvc 5 Asp.net Socket.io Dask C++ Teradata Methods Azure Sql Database Rest Android Fragments Docker Compose Database Design Openshift Redis Web Openid Wicket Passwords Loops Apache Spark Types Bash Blackberry Xampp Docusignapi Azure Data Factory Visual Studio 2010 Geolocation Facebook Wolfram Mathematica Google Sheets Office365 Keyboard Plot Racket Arrays Ajax Typescript Ssrs 2008 Ftp Syntax Scroll Directory Macros Couchbase Iis 7 Emacs Internet Explorer Github Angular Sql Server 2005 Templates Twig Content Management System Sql String Logic Jsf Antlr4 Datatables Asynchronous Twitter Google Cloud Firestore Mfc Ethereum Ocaml Netty Talend Deep Learning Replace Tcl Spring Cloud Glsl Windows 7 Ruby On Rails 4 Prometheus Iphone Xsd Nest Yii Numpy Documentation List Ibm Midrange Silverstripe Mapping Printing Assembly Matlab Youtube Maven Shell Artificial Intelligence Udp Fiware Pine Script Checkbox Cron Spring Security File Gulp Testing Ravendb Bluetooth Javascript Amazon S3 Gps Terraform Directx Bots Cobol Shiny Testng Eclipse Plugin Python Sphinx Iis Smalltalk Tkinter Nginx Lotus Notes Django Ffmpeg Sass Karate Fullcalendar Mvvm

Copyright © 2024. All Rights Reserved by - Fatal编程技术网