Machine learning 数据挖掘中的离群点处理_Machine Learning_Data Mining_Missing Data_Outliers - Fatal编程技术网

Machine learning 数据挖掘中的离群点处理

machine-learning

Machine learning 数据挖掘中的离群点处理,machine-learning,data-mining,missing-data,outliers,Machine Learning,Data Mining,Missing Data,Outliers,我在体重指数栏中有一个更突出的数据，它与其他数据相去甚远。第二个最大值为38.1，而异常值为294。它实际上是29.4，在收集数据时出错。我不想删除该行，因为我的数据数量有限。有谁能告诉我解决这个问题的最佳技术方法吗？将该值视为缺失值并应用期望最大化插补或贝叶斯多重插补等方法是否是一种好方法？请帮我解决这个问题。感谢检测不良数据，如有必要，用您喜欢的任何数据插补技术替换它当然，如果你能留下不好的数据，并设计出足够稳健的整体方法来处理这个问题，那就更好了。是的，如果它真的是一个异常值，你可以删除

我在体重指数栏中有一个更突出的数据，它与其他数据相去甚远。第二个最大值为38.1，而异常值为294。它实际上是29.4，在收集数据时出错。我不想删除该行，因为我的数据数量有限。有谁能告诉我解决这个问题的最佳技术方法吗？将该值视为缺失值并应用期望最大化插补或贝叶斯多重插补等方法是否是一种好方法？请帮我解决这个问题。感谢

检测不良数据，如有必要，用您喜欢的任何数据插补技术替换它

当然，如果你能留下不好的数据，并设计出足够稳健的整体方法来处理这个问题，那就更好了。
是的，如果它真的是一个异常值，你可以删除它并使用插补技术来替换它
在使用多重插补之前，请确保您理解多重插补的概念。如果要正确使用MI，还必须在插补本身之后更改处理步骤。（如果您使用的是WARE，您可以查看mice软件包）
如果您不想处理多个插补数据集，基于EM的插补算法是一个可靠的选择。（如果您使用的是R，您可以查看VIM或imputeR软件包）

[deployment]相关文章推荐

Deployment msbuild的SFTP任务？ deployment msbuild

Deployment Nginx位置、别名、重写、根 deployment nginx

Deployment 什么'；从DVCS克隆运行实时站点有什么不对？ deployment

Deployment 部署时如何排除属性文件 deployment maven-2 jar

Deployment 使用pscp将war文件复制到远程服务器 deployment

Deployment 什么'；使用EC2可用性区域的最佳实践是什么？ deployment amazon-ec2

Deployment 如何检查我的UML部署图是否符合UML？ deployment uml

Deployment Maven 3站点描述符问题：部署工件不工作或站点未构建 deployment maven

Deployment 在不安装JRE的情况下运行Java可执行文件？ deployment java

Deployment 管理mvc应用程序的发布配置文件 deployment asp.net-mvc-4 visual-studio-2012

Deployment Teamcity MSBuild将生成输出复制到新文件夹 deployment msbuild teamcity

Deployment 如何从jetty:run war切换到jetty:run？ deployment intellij-idea jetty

Deployment activeadmin菜单链接在部署到根以外的命名空间时失败？ deployment menu routes

Deployment 如何在高可用性的蓝绿色部署场景中执行RavenDB索引更改？ deployment ravendb

Deployment 如何在没有发布的情况下部署和更新erlang应用程序 deployment erlang

Deployment 从STS直接将Appfuse-Spring MVC项目部署到Tomcat-错误 deployment

Deployment 持续整合和；“X作为代码”； deployment version-control continuous-integration

Deployment 卡拉夫部署目录 deployment

Deployment Firebase部署404 can'；找不到index.html deployment

Deployment 如何在weblogic server中部署solr war文件 deployment solr weblogic

随机文章推荐

Linux 当使用锁文件来避免一个脚本同时运行的两个实例时，如何避免竞争条件？ linux bash shell

Linux 为什么在find命令中使用dirname会给每个匹配点？ linux bash unix

Linux grep用于中间带有通配符的文本 linux sed awk grep

Linux 如何安装qemu修补版？ linux ubuntu

Linux Postfix邮件发送问题？ linux

Linux 查找由nohup命令运行的进程 linux process

Linux 致命：在/home/trx/.gitconfig中的错误配置文件第1行 linux git ubuntu

在puppet中管理linux的用户密码 linux puppet

Linux 管道ls输出到scp命令 linux

Linux 如何重命名共享库以避免同名冲突？ linux linker

Linux 减去小时和分钟 linux perl

Linux 需要对一个或另一个参数使用getopts linux bash parameters

Linux 是为32位还是64位机器编译共享对象？ linux

Linux Makefile目标颜色输出 linux bash colors makefile

Linux 返回通过ssh发送到主机的命令的状态 linux bash shell

Linux gnuplot:X轴上未显示XTIC linux graph gnuplot

Linux 在Ubuntu mmap中有什么解决方案来区分读写吗？ linux

Linux 如何覆盖另一个.bbappend linux yocto

Linux sed-在模式前后插入文本 linux bash sed

Linux 如何为sed转义某些字符？ linux sed

[machine learning]相关推荐

Machine learning 在机器学习中，您可以做些什么来限制所需训练样本的数量？
Machine Learning

Machine learning 选择分类算法对标称数据和数字数据混合进行分类？
Machine Learning

Machine learning 支持向量机中的决策边界
Machine Learning

Machine learning 动态数据的聚类算法
Machine Learning Artificial Intelligence

Machine learning 超平面和平面的区别是什么？为什么超平面用方程w^T+来表示；b=0？
Machine Learning

Machine learning 朴素贝叶斯分类中的未知词
Machine Learning

Machine learning 将决策表转换为决策树
Machine Learning

Machine learning 如果信号随时间显著下降，则有效提高信号的方法
Machine Learning Statistics

Machine learning 如何加快前馈、基于梯度的反向传播神经网络的学习速度
Machine Learning Neural Network Artificial Intelligence

Machine learning 非正统使用深度学习来发现隐藏模式
Machine Learning Deep Learning

Machine learning 使用Theano进行卷积时出现内存不足错误
Machine Learning Neural Network

Machine learning 对于凸函数，梯度下降算法是否保证在无限次迭代后收敛？
Machine Learning

Machine learning 优化部署的Tensorflow图
Machine Learning Neural Network Tensorflow

Machine learning 空间衰减越大，mse越低是否合理？
Machine Learning Deep Learning Keras

Machine learning 认知服务API
Machine Learning

Machine learning 如何在sklearn中检查预测值的特征值
Machine Learning

Machine learning 什么'；s a"；“好”；像yolo这样的DL模型的损失函数的值？
Machine Learning Neural Network Deep Learning

Machine learning 为什么不'；t max池层在解决回归问题时破坏CNN性能？
Machine Learning Computer Vision

Machine learning 预测分析-“预测分析”；为什么；因子&；模型解释性
Machine Learning

Machine learning 如何在Python中为一维向量数据构建二进制分类器/预测器
Machine Learning

Machine learning 图里创造：在黑魔法eGPU上缓慢的训练表现
Machine Learning

Machine learning 分类变量'；降维
Machine Learning

Machine learning 如何量化给定训练数据样本的偏差和方差
Machine Learning Statistics

Machine learning 处理不平衡的分类数据？
Machine Learning

Machine learning 如何计算单个CNN层中的权重数和偏差值？
Machine Learning Neural Network

Machine learning 通常如何对RNN/LSTM的序列数据执行批处理
Machine Learning Neural Network Pytorch

Machine learning 使用文本和数字列构建机器学习模型
Machine Learning Nlp

Machine learning 在NLP中，混合模型何时比纯ML模型更有效？
Machine Learning Nlp

Machine learning AlphaVantage API技术指标：是否仅使用过去的信息？
Machine Learning

Machine learning 手动图像标记和文件夹复制工具
Machine Learning Computer Vision

Tags

Biztalk Microservices Jestjs Jboss Fluent Nhibernate Ibm Mq Requirejs Charts Stream Silverlight 4.0 Apache Kafka Ruby Netsuite Date Redis Protocol Buffers Pascal Cryptography Java 8 Vector Map Keyboard Macos Stanford Nlp Ftp Stm32 Gcc Lucene Eclipse Plugin Actions On Google Polymer Cmake Reference Socket.io Blockchain Yii2 Cassandra Drupal String Ios4 Stripe Payments Scikit Learn Junit Service Opengl Vbscript Xcode Certificate Jquery Mobile Windbg Hash If Statement Apache Flex Hyperlink Silverstripe Facebook Graph Api Jasper Reports Project Management Firefox Addon Spring Boot Datatables Sharepoint 2013 Floating Point Automation Sapui5 Couchdb Ldap Joomla Sharepoint 2010 Webview Encoding Codeigniter Ckeditor Loops Configuration Activemq Redirect Twilio For Loop Collections Google Plus Discord.py Sonarqube Julia Menu Shell Algorithm Python Sphinx Automated Tests Big O Knockout.js Transactions Iis Regex Entity Framework 4 Random Python 2.7 Jpa Browser Path Ipad Clearcase Breeze Design Patterns Highcharts Windows Phone 8.1 Leaflet Common Lisp Zurb Foundation Push Notification Scheme Actionscript Tinymce Swing Calendar Puppet Flash Kubernetes Sip Azure Service Fabric Arduino Multithreading Grafana Django Ravendb Racket Wso2 Layout Arm Ant Wicket Nunit Domain Driven Design Asterisk Delphi Compression Oop Zend Framework2 Coding Style Asp.net Web Api Tsql Jquery Plugins Groovy Editor Azure Interface Uml Com Select Javafx Web Applications Syntax Identityserver4 Directx Asp.net Mvc Apache Storm Appium Angular Material Bazel Oracle10g Ms Word Applescript Jenkins Php Material Ui Iframe Networking Nosql Terminal Npm Api Llvm Rust Jwt Sencha Touch 2 Apache Flink Vim Documentation Airflow Cocoa Markdown Instagram Dynamic Rx Java Omnet++ Asp Classic Outlook Asp.net Core Ansible Mule Twitter Bootstrap 3 Selenium Webdriver Qml Amazon Web Services Meteor Smtp Microsoft Graph Api Tree Itext Safari Visual Studio 2012

Copyright © 2024. All Rights Reserved by - Fatal编程技术网