Deep learning 雷鲁不行，乙状结肠也行_Deep Learning_Mobilenet - Fatal编程技术网

Deep learning 雷鲁不行，乙状结肠也行

deep-learning

Deep learning 雷鲁不行，乙状结肠也行,deep-learning,mobilenet,Deep Learning,Mobilenet,当我在输出层使用sigmoid时，有一个现象让我感到困惑，那就是网络运行良好。但是，如果我将输出的激活设置为Relu，则网络不会收敛。在第一个历元之后，训练损失不会减少。有人能解释这种现象吗网络的输入是图像。像素被重新缩放为0-1。输出为0-1之间的一个值谢谢。从您的上述问题中我了解到，当您使用ReLu作为最终激活时，模型不会收敛：如果是这种情况，那么答案就在ReLu函数本身中，ReLu所做的是，它不会在[0,1]之间缩放网络输出，而是返回max（0,x），这是您不希望在输出/基本真理在[

当我在输出层使用sigmoid时，有一个现象让我感到困惑，那就是网络运行良好。但是，如果我将输出的激活设置为Relu，则网络不会收敛。在第一个历元之后，训练损失不会减少。有人能解释这种现象吗

网络的输入是图像。像素被重新缩放为0-1。输出为0-1之间的一个值

谢谢。

从您的上述问题中我了解到，当您使用

ReLu

作为最终激活时，模型不会收敛：

如果是这种情况，那么答案就在

ReLu

函数本身中，

ReLu

所做的是，它不会在

[0,1]

之间缩放网络输出，而是返回

max（0,x）

，这是您不希望在

输出/基本真理

在[0,1]之间缩放的结果而

sigmoid

确实会在

[0,1]

之间缩放网络输出，这符合您的基本事实

为了更好地理解，考虑你的网络的最终层在代码< > [0, 1 ] < /代码>之间的概率，这是通过使用<代码> SigMID实现的，但是由于函数定义，不能通过<代码> Relu < /代码>实现。p>

因此，要计算损失，你的

地面真相

和你的

网络输出

应该在sigmoid实现的相同范围内，因此，在你的情况下，模型收敛于

sigmoid

。

从你上面的问题中我了解到，当您使用

ReLu

作为最终激活时，模型不会收敛：

如果是这种情况，那么答案就在

ReLu

函数本身中，

ReLu

所做的是，它不会在

[0,1]

之间缩放网络输出，而是返回

max（0,x）

，这是您不希望在

输出/基本真理

在[0,1]之间缩放的结果而

sigmoid

确实会在

[0,1]

之间缩放网络输出，这符合您的基本事实

为了更好地理解，考虑你的网络的最终层在代码< > [0, 1 ] < /代码>之间的概率，这是通过使用<代码> SigMID实现的，但是由于函数定义，不能通过<代码> Relu < /代码>实现。p>

因此，要计算损失，你的

地面真实值

和你的

网络输出

应该在sigmoid实现的相同范围内，因此，在你的情况下，

sigmoid的模型收敛。
我有一个类似的问题，现在已经解决了。神经网络只有3层来训练MNIST数据。乙状结肠激活有效，但雷鲁没有，其他一切都一样。我将学习率从3降低到了0.1，然后Relu开始工作。
我从这里得到了这个想法：
我有一个类似的问题，现在已经解决了。神经网络只有3层来训练MNIST数据。乙状结肠激活有效，但雷鲁没有，其他一切都一样。我将学习率从3降低到了0.1，然后Relu开始工作。
我从这里得到了一个想法：
您使用的是什么损失函数？relu和sigmoid都经过了测试。我在这个链接中详细描述了我的问题，你能给我一些建议吗。非常感谢。雷卢和西格；oid是激活函数，不是损失函数，损失函数类似于“交叉熵”或“mse”。事实上，在最后一层中使用relu作为激活没有多大意义，因为一般情况下，您有一种树类型：回归、二元分类或多类分类，在第一种情况下，最好不使用激活，使用mse作为损失，在第二种情况下，最好使用带二进制交叉熵的sigmoid，在最后一种情况下，最好使用带变分交叉熵的softmax。您使用的损失函数是什么？relu和sigmoid都经过了测试。我在这个链接中详细描述了我的问题，你能给我一些建议吗。非常感谢。雷卢和西格；oid是激活函数，不是损失函数，损失函数类似于“交叉熵”或“mse”。事实上，在最后一层中使用relu作为激活没有多大意义，因为一般情况下，您有一种树类型：回归、二元分类或多类分类，在第一种情况下，最好不使用激活，使用mse作为损失，在第二种情况下，最好使用带二进制交叉熵的sigmoid，在最后一种情况下，最好使用带变分交叉熵的softmax。谢谢你的回答。我还有一个问题，我训练了一个回归网络，使用resnet50作为主干。网络的输入为图像（224*224*3，像素重缩放为0-1），输出为一个值（0-1）。无论我使用relu或sigmoid作为输出的激活函数，网络都不会收敛。但当我使用VGG16作为主干，使用sigmoid作为输出的激活函数时，网络会收敛。你能给我一些建议吗。1-检查模型和VGG16的损失从何处开始，检查损失在何处达到2-检查模型的行为，当你提供更高的学习率时，在你的情况下，尝试对resnet50使用高LR（可能是1e-2），并检查损失是否有任何变化，损失没有改善的一个原因可能是它可能停留在一个可能的平台上。谢谢你的回答。我还有一个问题，我训练了一个回归网络，使用resnet50作为主干。网络的输入为图像（224*224*3，像素重缩放为0-1），输出为一个值（0-1）。无论我使用relu或sigmoid作为输出的激活函数，网络都不会收敛。但当我使用VGG16作为主干，使用sigmoid作为输出的激活函数时，网络会收敛。你能给我一些建议吗。1-检查模型和VGG16的损失从何处开始，检查损失在何处达到2-检查模型行为，当你提供更高的学习率时，在你的情况下，尝试对resnet50使用高LR（可能是1e-2），并检查损失是否有任何变化，损失的一个原因是没有

[asp.net web api]相关文章推荐

Asp.net web api WebApi Odata 406错误 asp.net-web-api odata

Asp.net web api 当涉及到自定义模型时，如何执行asp.net webapi路由？ asp.net-web-api

Asp.net web api 在运行时禁用ApicController asp.net-web-api

Asp.net web api 来自测试程序集的Web API自托管 asp.net-web-api

Asp.net web api 回收应用程序池后无法重新连接SignalR JS客户端 asp.net-web-api signalr

Asp.net web api WebAPI在App_数据文件夹中找不到我的控制器 asp.net-web-api

Asp.net web api asp.net web api odataNo IdLink asp.net-web-api odata

Asp.net web api asp.net webapi 2属性路由不工作 asp.net-web-api

Asp.net web api 在Web Api 2中，从OData客户端调用补丁的正确方法是什么 asp.net-web-api odata

Asp.net web api web api如何区分过载场景中的参数类型？ asp.net-web-api

Asp.net web api 在自托管ASP.NET Web API中访问路由信息 asp.net-web-api

Asp.net web api WebApi 2-无法使用structuremap确定DataProtection.IDataProtector依赖项注入 asp.net-web-api dependency-injection asp.net-mvc-5

Asp.net web api 如何基于可能的值数组限制WebApi路由参数 asp.net-web-api routes

Asp.net web api 服务结构承载的Web API asp.net-web-api azure-service-fabric

Asp.net web api 将请求中的加密参数发送到.Net核心WebApi asp.net-web-api asp.net-mvc-5 asp.net-core

Asp.net web api Post-OData服务在Postman中很好，但在web api程序中失败 asp.net-web-api odata

Asp.net web api 无法从.net core MVC发布api调用 asp.net-web-api asp.net-core-mvc

Asp.net web api 简单WebApi路由问题 asp.net-web-api

Asp.net web api 如何在MVCWebAPI中实现AmazonMWSAPI？ asp.net-web-api

Asp.net web api WebAPI将嵌套JSON转换为字符串 asp.net-web-api

随机文章推荐

Prometheus /致动器/普罗米修斯未暴露 prometheus

Prometheus AlertManager停机警报，除非429（对多个请求）HTTP状态代码 prometheus

Prometheus 理解普罗米修斯的记忆使用峰值 prometheus

Prometheus blackbox exporter引用了每个作业的几个模块 prometheus

Prometheus 黑盒导出器响应作为度量标签 prometheus grafana

Prometheus查询以在最后一天中获取不同的值 prometheus grafana

Prometheus 如何计算仪表在一天内设置为-1的总时间？ prometheus grafana

如何在Prometheus alert manager中通过查询为group by中的每个时间序列创建单独的警报？ prometheus

Prometheus 如何在Grafana版本7中创建具有动态行的简单表 prometheus grafana

Prometheus 当请求数量下降时，普罗米修斯警告火灾 prometheus

[deep learning]相关推荐

Deep learning Keras中的多维输入层
Deep Learning Keras

Deep learning 识别一个特定对象（或场景）的训练模型
Deep Learning

Deep learning caffe无法打开或找到文件
Deep Learning

Deep learning 不同Caffe'的计算考虑；s网络拓扑（输出数量差异）
Deep Learning

Deep learning 验证准确度是否总是可能与培训准确度一样高？
Deep Learning Keras

Deep learning OpenAI健身房键盘_agent.py：如何重新启动？
Deep Learning

Deep learning 如何分析和解释RBM学习到的特征？
Deep Learning

Deep learning 在贝洛特纸牌游戏中使用CNN进行竞价
Deep Learning Artificial Intelligence

Deep learning 丢失、val_丢失、acc和val_acc不会在所有历代更新
Deep Learning Keras

Deep learning 如何在培训期间以Keras打印/返回softmax分数？
Deep Learning Keras

Deep learning 在pytorch中，如何使用F.cross_entropy（）中的权重参数？
Deep Learning Pytorch

Deep learning 基于变压器的解码
Deep Learning

Deep learning 为什么损失用swa法跳到nan？
Deep Learning Pytorch

Deep learning 神经网络角度检测
Deep Learning Neural Network

Deep learning 关于将Alpha zero general应用于不同游戏的问题
Deep Learning

Deep learning 使用Colab和Pytorch3D的Mesh-R-CNN数据
Deep Learning Pytorch Google Colaboratory

Deep learning 将手动注释数据加载到列车RNN位置标记器
Deep Learning Pytorch

Deep learning torch.UNSQUEZE和target.UNSQUEZE之间的差异
Deep Learning Pytorch

Deep learning 为什么我的损失在训练10个阶段后没有减少？
Deep Learning Pytorch

Deep learning 用神经网络逼近多元函数，使用哪种激活函数？
Deep Learning

Tags

Ocaml Windows Installer Android Layout Validation Macos Swiftui Directx Smalltalk Ruby Windows Sharepoint 2010 Dictionary Eclipse Rcp Service Yocto Wicket Composer Php Open Source Download Ag Grid Asp.net Mvc 5 Enums Kotlin Pagination Resharper Silverstripe Xcode4 Phantomjs Plugins Google Cloud Firestore Xamarin.ios Apache Flex Spring Asp.net Mvc 4 C++11 .htaccess Formatting Ssh Powershell Apache Pig Codenameone Camera Prolog Nativescript Transactions Image Processing Polymer EmptyTag Haskell Opencv Lambda Redis Binary Jsf Opencart Xampp Reporting Services Content Management System Moodle Spring Mvc Video Jestjs Protocol Buffers Apache Spark Gradle Grid Visual C++ Zend Framework Data Binding Laravel 4 Windows 7 Firefox Highcharts Workflow Mule Robotframework Orientdb Xpath Odata Unit Testing Oracle Apex Firefox Addon Nhibernate Django Rest Framework Pyspark Language Agnostic Csv Mvvm Xna Url Rewriting Razor Stanford Nlp Woocommerce Solr Sql Server 2008 Hybris Uwp C++ Virtual Machine Excel Plsql Hadoop Racket Ms Access Sharepoint Botframework Charts Visual Studio 2017 D3.js Speech Recognition Math Telerik Akka Corda Ssrs 2008 Puppet Report Statistics Svg Exception Handling Serial Port Active Directory Asterisk Docusignapi Symfony Google Colaboratory File Io Eclipse Algorithm Linkedin Stored Procedures List Testing Google Cloud Storage Tcl Design Patterns Winforms Jquery Ui Google Calendar Api Arduino Ruby On Rails Phpmyadmin Chef Infra Dialogflow Es Jsf 2 Llvm Combobox Pine Script Protractor Knockout.js Gmail Ruby On Rails 4 Shiny Jquery Mobile Openlayers 3 Animation Stripe Payments Twilio Swing Amazon Web Services Extjs4 Jhipster Architecture Leaflet Applescript Ecmascript 6 Aframe Twitter Bootstrap 3 Orchardcms Next.js Coldfusion Rally Meteor Azure Sql Database Pytorch Websocket Sublimetext3 Passwords Memory Leaks Windows Phone Ignite Video Streaming Usb Amazon Dynamodb Adobe Qt Ftp Swift2 Heroku Glsl Cron Actionscript 3 Dojo Types Breeze Asp.net Web Api Class Signalr Forms Perforce Concurrency

Copyright © 2024. All Rights Reserved by - Fatal编程技术网