A neural reinforcement learning model for tasks with unknown time delays - 代码天地

A neural reinforcement learning model for tasks with unknown time delays

其他 2020-05-23 20:38:11 阅读次数: 0

郑重声明：原文参见标题，如有侵权，请联系作者，将会撤销发布！

Abstract

　　我们提出了一个基于生物学的神经模型，能够在复杂的任务中执行强化学习。该模型的独特之处在于，它能够在一个行动、状态转换和奖励之间存在未知和可变时间延迟的环境中，解决需要智能体执行一系列未经奖励的操作以达到目标的任务。具体来说，这是第一个能够在半马尔可夫决策过程（Semi-Markov Decision Process，SMDP）框架内发挥作用的强化学习神经模型。我们认为，当前建模工作的这种扩展为人类决策的日益复杂的模型奠定了基础。

Keywords: 强化学习；神经模型；SMDP

1. Introduction

2. Background

3. Methods

3.1 Model architecture

3.2 Representing and computing with neural activities

3.3 Learning

3.4 Error calculation

4. Results

5. Discussion

猜你喜欢

转载自www.cnblogs.com/lucifer1997/p/12944231.html

A neural reinforcement learning model for tasks with unknown time delays

论文笔记12:Building Adaptive Tutoring Model using Artificial Neural Networks and Reinforcement Learning

Deep Reinforcement Learning is a waste of time

[Reinforcement Learning] Model-Free Prediction

Time Delays and deferred work

论文笔记系列-Neural Architecture Search With Reinforcement Learning

2017-ICLR-Neural Architecture Search with Reinforcement Learning 论文阅读

NAS：NEURAL ARCHITECTURE SEARCH WITH REINFORCEMENT LEARNING NAS开山之作

Neural Network Dynamics for Model-Based Deep Reinforcement Learniing with Model-Free Fine-Tuning

Reinforcement Learning强化学习系列之一：model-based learning

Linux Kernel Programming - Time,Delays,and Deferred Work

LDD-Time, Delays, and Deferred Work

CAPES:Unsupervised Storage Performance Tuning Using Neural Network-Based Deep Reinforcement Learning

网络结构搜索（1）—— NAS（Neural architecture search with reinforcement learning）论文笔记

《Graph Representation Learning》【5】——The Graph Neural Network Model

Reinforcement Learning(001)

Introduction to Reinforcement Learning

reinforcement-learning-1

Reinforcement Learning——MDP

Tutorials on Inverse Reinforcement Learning

A Distributional Perspective on Reinforcement Learning

Reinforcement Learning 增强学习

Robust Adversarial Reinforcement Learning

Control of a Quadrotor with Reinforcement Learning

Reinforcement Learning NOTE

Policy in Reinforcement Learning

Reinforcement Learning Cheatsheet

【ML】Reinforcement Learning

Reinforcement Learning 笔记（1）

Reinforcement Learning 笔记（3）

今日推荐

周排行

深度学习------Lingvo框架下的加速通道GPipe

webjars管理静态资源

C专家编程_2.2

mysql 源码安装

json文件操作

123231432

注解的实现

Spring MVC 控制器

《人月神话》读后感二

C#使用HttpWebRequest和HttpWebResponse上传文件示例

每日归档

更多

2024-09-08(0)

2024-09-07(0)

2024-09-06(0)

2024-09-05(0)

2024-09-04(0)

2024-09-03(0)

2024-09-02(0)

2024-09-01(0)

2024-08-31(0)

2024-08-30(0)