论文笔记：残差网络 Deep Residual Learning for Image Recognition

其他 2020-08-04 20:53:53 阅读次数: 0

论文笔记: Deep Residual Learning for Image Recognition

目标：利用残差网络是的训练更加简单

面对的问题：

degradation Problem：当添加的网络层次变多，精确度逐渐饱和，网络层次将趋近饱和。

Intuition：

文中的想法是将堆叠的感知器学习原有输出的残差。

具体表示是：假设 $\mathcal{H}(\mathbf{x})$ 为表示某几层感知器的， $x$ 是输入。那么我们令 $\mathcal{F}(\mathbf{x}):=\mathcal{H}(\mathbf{x})-\mathbf{x}$

那么原来的输出将变成： $\mathcal{F}(\mathbf{x})+\mathbf{x} =\mathcal{H}(\mathbf{x})$ , 让感知层用来学习 $\mathcal{F}$

尽管两者将同时迭代到需要的方程，但两者的难易程度不同。

我们假设后者比前者容易。

在这里插入图片描述

如图中所示，
$\mathbf{y}=\mathcal{F}\left(\mathbf{x},\left\{W_{i}\right\}\right)+\mathbf{x}$
其中 $\mathcal{F}=W_{2} \sigma\left(W_{1} \mathbf{x}\right)$ ， $\sigma$ 便是ReLU

如果 $\mathbf{x}$ and $\mathcal{F}$ 不是相同的纬度，那遍让 $\mathbf{x}$ 乘以一个矩阵 $W_s$ :
$\mathbf{y}=\mathcal{F}\left(\mathbf{x},\left\{W_{i}\right\}\right)+W_{s} \mathbf{x}$
即使纬度相同，也可以乘以一个方阵。但是作者认为单位矩阵就足够解决degradation problem.

关于 $\mathcal{F}$ 的选取，作者认为可以是各种样子，可以是很多层，但如果是只有一层，就与线性层没有差别，并不能看出什么优势。

网络结构

普通结构

Baseline 由VCG网络而来。大部分的过滤器由3x3构成，而且都服从以下两条规则：

(i) 如果输出的特征图相同，那么过滤器数量不变

(ii)如果输出的特征图纬度减半，那么过滤器数量翻倍

残差网络

在普通结构上使用残差结构。如果输入输出纬度相同，那我们直接使用单位矩阵。
$\mathbf{y}=\mathcal{F}\left(\mathbf{x},\left\{W_{i}\right\}\right)+\mathbf{x}$
如果维度（图中虚线部分）缩减，将有两种原则：

（A）捷径的部分（x）仍然使用单位矩阵。缺失的部分添加0来解决，这种方式没有增加多余的参数

（B）用（2）式中的方式，将 $\mathbf{x}$ 映射到相同纬度上来

对于两种方法，如果残差网络经过了纬度变换的时候，他们的步长为2（也就是说每种维度的特征图各取一层）

在这里插入图片描述

最后的实验证明两种方式（A）（B）并没有太多表现上的区别

猜你喜欢

转载自blog.csdn.net/ArchibaldChain/article/details/107747029

论文笔记：残差网络 Deep Residual Learning for Image Recognition

《Deep Residual Learning for Image Recognition》残差网络 -- 解析笔记

残差网络(Deep Residual Learning for Image Recognition)

论文-Deep Residual Learning for Image Recognition

Deep Residual Learning for Image Recognition笔记

Deep Residual Learning for Image Recognition 笔记

Deep Residual Learning for Image Recognition 论文笔记

《ResNet-Deep Residual Learning for Image Recognition》论文笔记

论文笔记：Deep Residual Learning for Image Recognition

Deep Residual Learning for Image Recognition

[深度学习]Deep Residual Learning for Image Recognition(ResNet,残差网络)阅读笔记

论文阅读(二)ResNet(Deep Residual Learning for Image Recognition)笔记

ResNet论文阅读---《Deep Residual Learning for Image Recognition》

[论文理解]Deep Residual Learning for Image Recognition

Deep Residual Learning for Image Recognition 论文学习

Deep Residual Learning for Image Recognition----ResNet论文阅读

ResNet来源论文《Deep Residual Learning for Image Recognition》读后总结

ResNet论文详解：《Deep Residual Learning for Image Recognition》

论文阅读——ResNet，Deep Residual Learning for Image Recognition

论文阅读|ResNet：Deep Residual Learning for Image Recognition

Deep Residual Learning for Image Recognition论文翻译（非google翻译）

Deep Residual Learning for Image Recognition （ResNet）论文详细解读

ResNet(Deep Residual Learning for Image Recognition)

翻译：Deep Residual Learning for Image Recognition

Deep Residual Learning for Image Recognition（译）

ResNet: Deep Residual Learning for Image Recognition详解

Deep Residual Learning for Image Recognition（ResNet）阅读

Deep Residual Learning for Image Recognition(ResNet)

ResNet-Deep Residual Learning for Image Recognition

Paper | Deep Residual Learning for Image Recognition

今日推荐

周排行

成为C++高手之宏与枚举

在CAD二次开发中使用进度条

Js插件ECharts，HighCharts学习网址整理

Celery提交任务出错(on windows.)

cephfs内核客户端性能追踪

thinkphp中PHPExcel用法

EntityFramework动态组合多排序字段

汇编语言（八）实验9 根据材料编程

安装ubuntu后必须做的事情（对我而言）

JS函数式编程

每日归档

更多

2024-10-22(0)

2024-10-21(0)

2024-10-20(0)

2024-10-19(0)

2024-10-18(0)

2024-10-17(0)

2024-10-16(0)

2024-10-15(0)

2024-10-14(0)

2024-10-13(0)