Machine learning 如何根据rl中的外部随机条件设计奖励函数？_Machine Learning_Reinforcement Learning_Reward - Fatal编程技术网

Machine learning 如何根据rl中的外部随机条件设计奖励函数？

machine-learning

Machine learning 如何根据rl中的外部随机条件设计奖励函数？,machine-learning,reinforcement-learning,reward,Machine Learning,Reinforcement Learning,Reward,例如，我想使用RL来训练一个系统，使其在任何攻击下都具有鲁棒性。但是，攻击者可以在我的系统中选择任何点，奖励功能自然取决于选择的攻击点由于攻击点的选择范围很广，我无法在计算奖励时直接列举所有选择，因此我可以随机抽样一些选择，并使用它们的平均/最大奖励作为奖励函数？还是有其他更好的方法来处理这种情况

例如，我想使用RL来训练一个系统，使其在任何攻击下都具有鲁棒性。但是，攻击者可以在我的系统中选择任何点，奖励功能自然取决于选择的攻击点

由于攻击点的选择范围很广，我无法在计算奖励时直接列举所有选择，因此我可以随机抽样一些选择，并使用它们的平均/最大奖励作为奖励函数？还是有其他更好的方法来处理这种情况

[cobol]相关文章推荐

COBOL数据类型 cobol

如何在COBOL中重新定义和执行PICX子句的算法 cobol

如何在cobol中显示comp变量的实际值 cobol

使用Cobol格式化信用卡磁道II数据分隔符 cobol

将comp移动到comp Cobol，但不要'；行不通 cobol

如何在Cobol中添加一串星号？ cobol

我已经编写了一个Cobol报告编写程序，但无法编译它 cobol

Cobol 字符串转换为多个变量 cobol

COBOL仅从输入文件的一行读取 cobol

随机文章推荐

Robotframework 如何在robot框架中获取日志文件名和日志目录 robotframework

Robotframework 机器人框架相当于pytest.fixture robotframework

Robotframework 使用Robot框架自动验证码 robotframework

Robotframework 用参数名传递python参数 robotframework

Robotframework 如何添加一个使用Sel2Lib的python模块而不会出现多个关键字错误？ robotframework

用于组织多个项目的RobotFramework目录结构 robotframework

Robotframework 通过robot框架循环mongodb中的文档 robotframework

RobotFramework：测试下载链接是否有效 robotframework

Robotframework 向Robot数据驱动测试用例添加无关步骤 robotframework

Robotframework 如何在robot框架中查找每个关键字的执行状态 robotframework

Robotframework &引用；运行关键字if"；-要使用此内置功能双击按钮吗 robotframework

Robotframework 如何使用robot freamwork选择iStream。（其中Xpath、Id和名称不可用） robotframework

Robotframework 机器人框架：URL获取位置和编辑URL robotframework

Robotframework 如何在Robot框架中定位消失的文本字段 robotframework

Robotframework 在可变截面-机器人框架中使用关键字 robotframework

Robotframework Robot框架：如何访问嵌套键：json响应中的值 robotframework

Robotframework 在不同实例/URL下执行Robot框架脚本 robotframework

Robotframework Robot框架-在WebTable中搜索文本 robotframework

Robotframework 可以使用命令行参数调用Robot框架测试套件吗？ robotframework

Robotframework 当响应中不存在属性时，Robot框架对JSON测试的响应失败 robotframework

[machine learning]相关推荐

Machine learning 使用遗传算法的酷项目？
Machine Learning

Machine learning 如何解析CFG'；有任意数量的邻居吗？
Machine Learning

Machine learning SVM灯的训练和测试文件
Machine Learning

Machine learning 进化规划和遗传规划有什么区别？
Machine Learning Artificial Intelligence Computer Science

Machine learning 为什么我会得到一些负值（预测值）作为回归估计（套索、岭、弹性网）的输出
Machine Learning Scikit Learn

Machine learning K-means文档集群-下一步是什么？
Machine Learning Artificial Intelligence

Machine learning 如何以信息增益为标准选择前n个要素
Machine Learning

Machine learning 短文（如新闻标题）分析
Machine Learning

Machine learning 除了单词袋（TF-IDF）外，还有哪些方法可以将文本特征转换为数字特征？
Machine Learning Nlp

Machine learning SVM排名：减少运行时间
Machine Learning

Machine learning 使用hdf5作为caffe输入，错误：HDF5Data不转换数据
Machine Learning Neural Network Deep Learning

Machine learning 使用TensorFlow的非线性回归结果为直线
Machine Learning Neural Network Tensorflow

Machine learning 基于深度学习的目标检测数据扩充
Machine Learning Computer Vision Neural Network Deep Learning

Machine learning keras中的函数模型：参数无效：必须为占位符张量'；编码器输入'；使用数据类型float和shape[8,64,10]
Machine Learning Tensorflow Keras

Machine learning 有没有一种方法可以使用Tensorflow自动化迁移学习？
Machine Learning Tensorflow

Machine learning 如何找到训练和测试数据的马氏距离
Machine Learning Artificial Intelligence

Machine learning 我怎样才能获得精确性&；Tensorflow中的回忆而非准确性
Machine Learning Tensorflow

Machine learning 让Jupyter笔记本在GCP上运行
Machine Learning Google Cloud Platform Jupyter Notebook

Machine learning 训练数据比测试数据差
Machine Learning Deep Learning

Machine learning 在有监督的学习中有没有光栅法？
Machine Learning

Machine learning 如何得出结论，回归结果可以'；不能再改进了吗？
Machine Learning

Machine learning 如何在GridSearchCV中使用最佳参数作为分类器的参数？
Machine Learning Scikit Learn

Machine learning 人工智能和机器学习的区别？
Machine Learning Artificial Intelligence

Machine learning 分类和回归树（CART）的分类属性拆分
Machine Learning

Machine learning 多目标变量回归分析的预测方法
Machine Learning

Machine learning 实现异构组的聚类
Machine Learning Scikit Learn

Machine learning 当不同类中的数据量不同时，该怎么办？
Machine Learning Deep Learning

Machine learning 映射时间序列+；将静态信息导入ML模型（XGBoost）
Machine Learning

Machine learning 保存/重用基于doc2vec的模型以进行进一步预测
Machine Learning Scikit Learn

Machine learning Tidymodels（在R中使用10倍交叉验证拟合袋装树）：x Fold01:model:Error:输入必须是向量，不能为NULL
Machine Learning Tree

Tags

D Debugging Collections Ruby On Rails 3 Sublimetext2 Rally Gremlin Dynamics Crm 2011 Webstorm Cuda Dependency Injection Keras Routing Assembly Sonarqube Wicket Firefox Time Complexity Jasper Reports Visual Studio 2017 Network Programming Shell Uwp Entity Framework Core Sapui5 Redirect Google Calendar Api Azure Devops Opengl Hadoop Geolocation Encoding Sip Actionscript 3 Typo3 Stripe Payments Omnet++ Parsing Neo4j Https Gis Elm Image Processing Jmeter Google Chrome Devtools Scrapy File Spring Mvc Transactions Knockout.js Compiler Errors Reference Puppet Grafana Tkinter Libgdx Io Inheritance Dotnetnuke Powerbi Authentication Zend Framework Kotlin Lua Server Web Services Racket Xquery Openstack Gulp Nlp Azure Functions Terraform Ag Grid Windows Phone Design Patterns Unity3d Directory Dependencies .net Core For Loop Resharper Pip Angular Yocto Google App Maker Continuous Integration Python Sphinx Automated Tests Protractor Flask Netlogo Hyperlink Aurelia Composer Php Rss Kentico User Interface Spring Security Visual C++ Visual Studio 2012 Paypal Firefox Addon Passwords Ibm Cloud Sprite Kit Uitableview Wpf Netsuite Virtualbox Speech Recognition Function Ios Junit Csv Log4net Checkbox Embedded Curl Yii Orientdb Apache Isabelle Subsonic Usb Macros Sitecore Fullcalendar Date Sequelize.js Sphinx Google App Engine Compilation View Documentation Ignite Error Handling Gtk Keycloak Laravel Snmp Ckeditor Jekyll Amp Html Nativescript Sml Jdbc Data Binding Database Ibm Midrange Xaml Gitlab Abap Encryption Timer Postman Oracle Apex Clearcase Vbscript Install4j Binding Video Npm Pascal Navigation Web Applications Css X86 Sugarcrm Iis Autodesk Forge Pdf Azure Computer Science Airflow Ibm Mq Workflow Asp Classic Jsf Openlayers 3 Model Apache Spark Drop Down Menu Drupal Sharepoint C .net 4.0 Azure Data Factory Vb.net Swift Spring Batch Sass Lotus Notes Ipad Google Visualization Opencart Ocaml Maven 2 Command Line Sublimetext3 Codeigniter

Copyright © 2024. All Rights Reserved by - Fatal编程技术网