Control of a Quadrotor with Reinforcement Learning - 代码天地

Control of a Quadrotor with Reinforcement Learning

其他 2019-03-18 01:14:36 阅读次数: 0

Goal

Control a quadrotor with a neural network trained using reinforcement learning.
Policy network is a function directly mapping a state to rotor thrusts.

Related Work

Guided Policy Search with a MPC Controller
This work uses a policy that maps the raw sensor data to the rotor velocities.

Contribution

Propose a deterministic on-policy method using zero-bias, zero variance samples.
Use small number of high quality samples, so there is only a small burden in neural network.

Network Structure
input: {orientation(rotation matrix), position, angular velocity, linear velocity} --> 18-dimensional state vector
output: 4-dimensional action vector

Exploration Strategy
TRPO

猜你喜欢

转载自blog.csdn.net/weixin_42018112/article/details/88350713

Control of a Quadrotor with Reinforcement Learning

解读continuous control with deep reinforcement learning（DDPG）

Reinforcement Learning强化学习系列之三：MC Control

DRL前沿之：Benchmarking Deep Reinforcement Learning for Continuous Control

Human-Level Control Through Deep Reinforcement Learning论文解读

【5分钟 Paper】Continuous Control With Deep Reinforcement Learning

【论文笔记】Manufacturing Control in Job Shop Environments with Reinforcement Learning

第十二讲：强化学习（Reinforcement Learning）和控制（Control）

深度强化学习介绍【PPT】 Human-level control through deep reinforcement learning

【论文 PPT】【转】Human-level control through deep reinforcement learning（DQN）

Continuous control with deep reinforcement learning (DDPG强化学习) 论文翻译

论文笔记：Human-level control through deep reinforcement learning

【论文笔记】Deep Reinforcement Learning Control of Hand-Eye Coordination with a Software Retina

Continuous control with deep reinforcement learning(DDPG，深度确定策略梯度)练习

Reinforcement Learning(001)

reinforcement-learning-1

Introduction to Reinforcement Learning

Reinforcement Learning——MDP

Tutorials on Inverse Reinforcement Learning

A Distributional Perspective on Reinforcement Learning

Reinforcement Learning 增强学习

Robust Adversarial Reinforcement Learning

Reinforcement Learning NOTE

Policy in Reinforcement Learning

Reinforcement Learning Cheatsheet

【ML】Reinforcement Learning

Reinforcement Learning 笔记（1）

Reinforcement Learning 笔记（3）

Reinforcement Learning 笔记（4）

Reinforcement Learning, Fast and Slow

今日推荐

周排行

成为C++高手之宏与枚举

在CAD二次开发中使用进度条

Js插件ECharts，HighCharts学习网址整理

Celery提交任务出错(on windows.)

cephfs内核客户端性能追踪

thinkphp中PHPExcel用法

EntityFramework动态组合多排序字段

汇编语言（八）实验9 根据材料编程

安装ubuntu后必须做的事情（对我而言）

JS函数式编程

每日归档

更多

2024-10-22(0)

2024-10-21(0)

2024-10-20(0)

2024-10-19(0)

2024-10-18(0)

2024-10-17(0)

2024-10-16(0)

2024-10-15(0)

2024-10-14(0)

2024-10-13(0)