Deep learning 如何选择Q值最高的动作_Deep Learning_Action_Reinforcement Learning_Q Learning - Fatal编程技术网

Deep learning 如何选择Q值最高的动作

deep-learning

Deep learning 如何选择Q值最高的动作,deep-learning,action,reinforcement-learning,q-learning,Deep Learning,Action,Reinforcement Learning,Q Learning,我已经用经验回放实现了DQN。输入是50x50x1。批量大小为4时，输入将变为（4,50,50,1）。总输出操作数为10。如果批量大小为4，则输出为（4,10）。我想知道如何从这个（4,10）向量中选择最大q值。提前感谢这可能就是您想要的这将返回给定张量X的单个最大值在DQN的上下文中，批处理大小为4（4行），您需要选择4个最大Q值，每行一个。您可以通过以下方式执行此操作： X_max = tf.reduce_max(X, axis=1) 其中X是包含具有形状（4,10）的Q值的数据结构。

我已经用经验回放实现了DQN。输入是50x50x1。批量大小为4时，输入将变为（4,50,50,1）。总输出操作数为10。如果批量大小为4，则输出为（4,10）。我想知道如何从这个（4,10）向量中选择最大q值。提前感谢

这可能就是您想要的

这将返回给定张量X的单个最大值

在DQN的上下文中，批处理大小为4（4行），您需要选择4个最大Q值，每行一个。您可以通过以下方式执行此操作：

X_max = tf.reduce_max(X, axis=1)
其中X是包含具有形状（4,10）的Q值的数据结构。
这将在单个张量X_max中返回4个最大Q值，输出形状为（4,1）。
这可能就是您要查找的
这将返回给定张量X的单个最大值
在DQN的上下文中，批处理大小为4（4行），您需要选择4个最大Q值，每行一个。您可以通过以下方式执行此操作：

X_max = tf.reduce_max(X, axis=1)
其中X是包含具有形状（4,10）的Q值的数据结构。这将返回具有输出形状（4,1）的单个张量X_max中的4个最大Q值

[automated tests]相关文章推荐

Automated tests Perfecto-UFT集成 automated-tests

Automated tests nightwatchJS如何在表上使用assert.containsText？ automated-tests

Automated tests 在下拉菜单中单击，并在Capybara中滚动 automated-tests

Automated tests 是否有与Appium for windows应用程序（非mobile）等效的工具？ automated-tests appium robotframework

Automated tests 如何避免减少和修改违规？ automated-tests

Automated tests 如何在testcafe中减小鼠标指针的大小 automated-tests

Automated tests 空手道#regex ;；无法验证部分匹配 automated-tests karate

Automated tests Botium是否有松弛的接头？ automated-tests bots

Automated tests 柏树与普罗米修斯相容吗？ automated-tests prometheus cypress

Automated tests Cypress在单击注销时重定向到另一个url超时 automated-tests cypress

Automated tests 用木偶机进行多用户e2e测试 automated-tests

Automated tests Cy.click失败，因为目标元素已禁用或其部分可见 automated-tests cypress

随机文章推荐

.net 4.0 Windows工作流4.0 InstancePersistenceCommand错误 .net-4.0

.net 4.0 创建版本不同于父应用程序的.NET子应用程序 .net-4.0

.net 4.0 安装.net framework 4.0是否需要.net framework 3.5？ .net-4.0

.net 4.0 net 4上WixSharp的system.design参考问题 .net-4.0

[deep learning]相关推荐

Deep learning Keras中的卷积二维暹罗网络
Deep Learning Keras

Deep learning 如何准备用于caffe输入的灰度图像数据
Deep Learning

Deep learning 如何设置MXNET\u CUDNN\u自动调谐\u默认值？
Deep Learning

Deep learning 如何使openpose在不支持cuda的情况下使用caffe
Deep Learning Neural Network

Deep learning 是否有可能实现一个损失函数，将正确答案的优先级排在前k个概率中？
Deep Learning

Deep learning 在MXNet中使用高级API时，如何向网络中输入额外数据
Deep Learning

Deep learning 作为YOLOv3输入的非平方图像
Deep Learning Computer Vision Pytorch

Deep learning YOLO是否在完全连接层之前重新缩放锚？没有ROI池的解决方案是什么？
Deep Learning Computer Vision

Deep learning 如何从Pytork中的优化器获取/打印正则化损耗/l2损耗/重量衰减值？
Deep Learning Pytorch

Deep learning 时间序列模型参数问题
Deep Learning

Deep learning 无法导入名称'；打开'；从'；智能开放'；
Deep Learning Nlp

Deep learning DNN网络要求输入数据为2D，但我的训练数据为rgb（3D）
Deep Learning Pytorch

Deep learning MCEP能否用于音频分类'；什么样？
Deep Learning Pytorch

Deep learning Pytorch型号的CPU和GPU内存不足，can’；我不明白我是什么’；我做错了
Deep Learning Pytorch

Deep learning Pytorch为3x3,32 conv2d层和2x2 maxpool层添加超参数
Deep Learning Pytorch

Deep learning 带中间标签的多标签问题
Deep Learning Pytorch

Deep learning 错误：无法对像对象情感分析这样的字节使用字符串模式
Deep Learning Nlp

Tags

Keycloak Dialogflow Es Apache Nifi Streaming Glsl Windows Phone 8.1 Language Agnostic Paypal Opengl Es Sharepoint 2010 Google Sheets Tkinter Zend Framework2 Visual Studio 2015 Intellij Idea Sbt Boost C# 4.0 Iis Web Login Pytorch Api Autohotkey Ember.js Actions On Google Requirejs Tcp Windows Phone 7 Properties Tree Dart Module File Upload Jersey Twilio Playframework 2.0 Vue.js Memory Leaks Sharepoint 2007 Umbraco Asp.net Web Api Odoo Cygwin 3d Sencha Touch Erlang Antlr4 Alfresco Routes Rdf Wix Aws Lambda Lambda Next.js Pagination Validation Service Phantomjs Binary Content Management System Exchange Server Matrix Windows Services Calendar Clearcase Stm32 Timer Outlook Google Cloud Storage Mediawiki Google Colaboratory Itext Zsh Java Me Google App Engine Forms Join Nest Maps Kotlin Performance Postman C# 3.0 Regex Docker Protractor Grid Oracle10g Apache Storm Wordpress Random Dojo Qt Docker Compose C Npm Internationalization C++ Shiny Web Scraping Ftp Google Compute Engine Html5 Canvas Gruntjs Amazon S3 Csv If Statement Sharepoint Mapping Yii Twitter Bootstrap 3 Sml Webstorm Google Maps Api 3 Wicket Ios5 Mod Rewrite Struts2 Mongodb Caching Abap Google Maps Microsoft Graph Api Sublimetext2 Ubuntu Project Management Doctrine Orm Google Chrome Devtools Virtual Machine Spring Cloud Sql Server 2008 R2 Sas Ruby On Rails 3 Linkedin Vuejs2 Proxy Keras Asp.net Core Blackberry Tfs Knockout.js Combobox Lua Audio Parameters Wpf Leaflet Dynamic Yaml Azure Cosmosdb C# Junit Windows 7 Ruby On Rails 3.1 Version Control Ruby On Rails 3.2 Youtube Api Influxdb Openssl Less Jetty Google Bigquery Jasmine Numpy Ios4 Methods Logging Smalltalk Report Windows Store Apps Mips Perforce Jakarta Ee Syntax Z3 Search Visual Studio 2010 Zurb Foundation Spring Integration Hadoop Mongoose Sparql Facebook .htaccess Extjs4 Tcl Ssl Sms Github Latex Time Complexity Hybris Embedded Coldfusion Optimization Stanford Nlp Openlayers Express Https Identityserver4

Copyright © 2024. All Rights Reserved by - Fatal编程技术网