Machine learning 如何进一步调整超参数与以下趋势线的训练损失和验证损失？_Machine Learning_Deep Learning_Training Data_Hyperparameters - Fatal编程技术网

Machine learning 如何进一步调整超参数与以下趋势线的训练损失和验证损失？

machine-learning deep-learning

Machine learning 如何进一步调整超参数与以下趋势线的训练损失和验证损失？,machine-learning,deep-learning,training-data,hyperparameters,Machine Learning,Deep Learning,Training Data,Hyperparameters,我正在训练一个变分编码器（VAE），其MSE损失为100个历元，我得到了以下训练和验证损失。我的问题是：如果损失的改善很小，我是否应该继续训练学习率合适吗？我在这个例子中使用了学习速率衰减，我尝试了不同的初始值来计算学习速率，但得到了相似的结果我计划为测试选择列车和有效损耗之和最小的历元。对吗我可以调整哪些其他参数以使其更好？还有其他建议吗？谢谢看起来不错，训练和验证损失与低损失几乎相同。低损失意味着模型预测的置信度非常高（与损失相反）。从这一点上获得进一步的改进将花费大量时间调整

我正在训练一个变分编码器（VAE），其MSE损失为100个历元，我得到了以下训练和验证损失。我的问题是：

如果损失的改善很小，我是否应该继续训练

学习率合适吗？我在这个例子中使用了学习速率衰减，我尝试了不同的初始值来计算学习速率，但得到了相似的结果

我计划为测试选择列车和有效损耗之和最小的历元。对吗

我可以调整哪些其他参数以使其更好？还有其他建议吗？谢谢

看起来不错，训练和验证损失与低损失几乎相同。低损失意味着模型预测的置信度非常高（与损失相反）。从这一点上获得进一步的改进将花费大量时间调整所有超参数。@yudhiesh感谢您的评论。这很有帮助。假设我使用这个作为我的最终版本，用min（火车损失+有效损失）来选择纪元是否合理？或者我只使用上一个历元？您可以在每个历元使用
keras.callbacks.ModelCheckpoint
重新训练并保存模型，然后从中选择最佳的一个（最小值损失和列车损失）。

[deep learning]相关文章推荐

Deep learning Caffe总是在CPU模式下使用单核 deep-learning

Deep learning 这个图形化的可视化是正确的吗？ deep-learning

Deep learning Nvidia数字：损失保险（val和train）直接归零编辑：找到问题。我在创建数据集时忘记添加自定义类。业余爱好者犯了错误，但将此留给犯类似错误的其他人。 deep-learning

Deep learning 掩模RCNN的损失函数是什么？ deep-learning

Deep learning mnist&x27；s的深层特征可视化什么都不是 deep-learning

Deep learning 学习率对模型训练的影响 deep-learning keras

Deep learning eval（）模式下的训练模型在PyTorch中提供更好的结果？ deep-learning pytorch

Deep learning 如何正确设计用于信号处理的人工神经网络？ deep-learning neural-network

Deep learning '；顺序'；对象没有属性'；特点'；在提取vgg19 pytorch特征时 deep-learning pytorch

Deep learning 如何通过查询获取基于Pytorch dataloader的数据集中特定项的ID？ deep-learning pytorch

Deep learning 为什么我的tensorflow发生器的功能这么慢？ deep-learning

Deep learning ResNet-101特征图形状 deep-learning

Deep learning “目标检测”；一致性“；在逐帧处理视频时 deep-learning computer-vision

Deep learning pytorch:basic operations实现的实例规范与torch.nn.InstanceNorm2d的结果不同 deep-learning pytorch

Deep learning 用于实例分割的三维标注 deep-learning computer-vision

Deep learning 可变周期 deep-learning

Deep learning 滞后两天的预测值 deep-learning

Deep learning 我做错了什么？ deep-learning

Deep learning CNN池层与自动编码器中的编码器有何不同？ deep-learning

Deep learning 对inception v3模型使用32*32*3图像大小 deep-learning

随机文章推荐

Video 观看dsf视频文件 video

在angular.js指令中使用video.js video angularjs

Video 帧提取期间的ffmpeg视频扫描进度 video ffmpeg

Video 如何使用plupload将无限大小的文件上传到AmazonS3？ video file-upload amazon-s3

Video 流媒体视频 video web video-streaming

Video 使用Simplepie从YouTube提要获取视频视图 video youtube rss

Video IP摄像头-读取实时ASF视频流 video camera ffmpeg

Video 是否可以在M3U8播放列表中播放M3U8播放列表？ video ffmpeg video-streaming

Video Mediainfo-从自定义时间开始分析媒体曲目 video

Video FFmpeg标志，用于生成与视频长度相关的固定数量的图像 video ffmpeg

Video 使用Youtube dl从Youtube的每个搜索结果下载第一个视频？ video cmd youtube

Video MP4的时间戳信息 video ffmpeg

Video 可以在树莓皮上播放360度视频吗？ video raspberry-pi

Video 具有从左到右转换的Ffmpeg图像覆盖 video filter ffmpeg

Video 为什么建议在视频中添加链接，而不是嵌入链接，以符合ADA要求？ video

Video 旋转90并将视频与ffmpeg连接 video ffmpeg

Video ffmpeg avcodec_接收_数据包返回-11 video ffmpeg

Video 更改像素纵横比对播放没有影响 video

Video 如何在我的嵌入式页面中播放视频？ video iframe

Video FFmpeg屏幕混合模式将图像变成粉红色 video ffmpeg

[machine learning]相关推荐

Machine learning 文件路径名或URL分析
Machine Learning

Machine learning 在Weka中使用HMM
Machine Learning

Machine learning 向量x的概率
Machine Learning

Machine learning 一个直观的马尔可夫网络（MRFs）模型？
Machine Learning Artificial Intelligence

Machine learning 用于文本数据分类的朴素贝叶斯与支持向量机
Machine Learning Scikit Learn

Machine learning 虚拟机和Ubuntu 14.04中GoogLeNet模型的不同输出
Machine Learning Neural Network Computer Vision Deep Learning

Machine learning 回归问题的维数/降噪技术
Machine Learning Scikit Learn

Machine learning Sklearn GridSearchCV是否会检查估计器的所有可能默认选项'；s参数？
Machine Learning Scikit Learn

Machine learning 为什么我们需要纪元？
Machine Learning

Machine learning 基于上下文无监督聚类的Encog递归自组织映射
Machine Learning Neural Network Artificial Intelligence

Machine learning BLEU评分的变化
Machine Learning

Machine learning 当我使用文本文件输入时，syntaxnet demo.sh挂起
Machine Learning Nlp

Machine learning 基于迁移学习的Google InceptionV-3慢速预测
Machine Learning Tensorflow Computer Vision

Machine learning 如何确保Caffe分段网络输出大小与输入大小相同？
Machine Learning Deep Learning Computer Vision

Machine learning MNIST培训，检测数字序列？
Machine Learning Computer Vision

Machine learning 训练准确度积极提高，测试准确度稳定
Machine Learning Tensorflow

Machine learning 为什么一开始神经网络的验证损失和准确性会波动？
Machine Learning Neural Network

Machine learning 我应该使用什么机器学习模型？
Machine Learning Neural Network

Machine learning 每个状态都是终端的强化学习
Machine Learning

Machine learning 如何在数据帧上查找值（%）？
Machine Learning

Machine learning 随机森林多类多输出分数？
Machine Learning Scikit Learn

Machine learning 如何在决策树分类器的导出树图像上显示分类值？
Machine Learning

Machine learning 基于序列数据的流失率预测
Machine Learning Deep Learning Pytorch

Machine learning 为什么谷歌视觉要花这么多时间来训练这个模型？
Machine Learning Google Cloud Platform Computer Vision

Machine learning 机器学习中的宏观平均和加权平均之间有区别吗？
Machine Learning Scikit Learn

Machine learning 创建数据集的最佳实践
Machine Learning Deep Learning Computer Vision

Machine learning 类型错误：'<'；在'；元组'；和'；int'；在支持向量机中
Machine Learning

Machine learning 如何从代码中删除model.fit_generator（）函数导致的ValueError？
Machine Learning Model Artificial Intelligence

Machine learning 使用scikit在Databricks上学习
Machine Learning Scikit Learn

Machine learning LSTM不是只输出一个数字而不是一个数字向量吗
Machine Learning Deep Learning

Tags

Keycloak Dialogflow Es Apache Nifi Streaming Glsl Windows Phone 8.1 Language Agnostic Paypal Opengl Es Sharepoint 2010 Google Sheets Tkinter Zend Framework2 Visual Studio 2015 Intellij Idea Sbt Boost C# 4.0 Iis Web Login Pytorch Api Autohotkey Ember.js Actions On Google Requirejs Tcp Windows Phone 7 Properties Tree Dart Module File Upload Jersey Twilio Playframework 2.0 Vue.js Memory Leaks Sharepoint 2007 Umbraco Asp.net Web Api Odoo Cygwin 3d Sencha Touch Erlang Antlr4 Alfresco Routes Rdf Wix Aws Lambda Lambda Next.js Pagination Validation Service Phantomjs Binary Content Management System Exchange Server Matrix Windows Services Calendar Clearcase Stm32 Timer Outlook Google Cloud Storage Mediawiki Google Colaboratory Itext Zsh Java Me Google App Engine Forms Join Nest Maps Kotlin Performance Postman C# 3.0 Regex Docker Protractor Grid Oracle10g Apache Storm Wordpress Random Dojo Qt Docker Compose C Npm Internationalization C++ Shiny Web Scraping Ftp Google Compute Engine Html5 Canvas Gruntjs Amazon S3 Csv If Statement Sharepoint Mapping Yii Twitter Bootstrap 3 Sml Webstorm Google Maps Api 3 Wicket Ios5 Mod Rewrite Struts2 Mongodb Caching Abap Google Maps Microsoft Graph Api Sublimetext2 Ubuntu Project Management Doctrine Orm Google Chrome Devtools Virtual Machine Spring Cloud Sql Server 2008 R2 Sas Ruby On Rails 3 Linkedin Vuejs2 Proxy Keras Asp.net Core Blackberry Tfs Knockout.js Combobox Lua Audio Parameters Wpf Leaflet Dynamic Yaml Azure Cosmosdb C# Junit Windows 7 Ruby On Rails 3.1 Version Control Ruby On Rails 3.2 Youtube Api Influxdb Openssl Less Jetty Google Bigquery Jasmine Numpy Ios4 Methods Logging Smalltalk Report Windows Store Apps Mips Perforce Jakarta Ee Syntax Z3 Search Visual Studio 2010 Zurb Foundation Spring Integration Hadoop Mongoose Sparql Facebook .htaccess Extjs4 Tcl Ssl Sms Github Latex Time Complexity Hybris Embedded Coldfusion Optimization Stanford Nlp Openlayers Express Https Identityserver4

Copyright © 2024. All Rights Reserved by - Fatal编程技术网