18-Learning Deep ResNet Blocks Sequentially using Boosting Theory - 代码天地

18-Learning Deep ResNet Blocks Sequentially using Boosting Theory

其他 2018-12-23 10:51:08 阅读次数: 0

文章指出一种基于 boosting（提升）原理，逐层训练深度残差神经网络的方法，并对性能及泛化能力给出了理论上的证明
参考

方法

框架

在这里插入图片描述

残差网络:

$g_{t+1}(x)=f(g_{t}(x))+g_{t}(x)$
hypothesis module:

$o_{t}(x)=softmax(W_{t}^{T}\cdot g_{t}(x))\epsilon R^{C}$ , 其中 CC 为分类任务的类别数。即这是一个线性分类器（Logistic Regression）
weak module classifier:

$h_{t}(x)=α_{t+1}o_{t+1}(x)−α_{t}o_{t}(x)∈\epsilon R^{C}$ , 其中 α 为标量，也即 h 是相邻两层 hypothesis 的线性组合。第一层没有更低层，因此，可以视为有一个虚拟的低层， $α_{0}=0$ 并且、 $o_{0}(x)=0$ .
将残差网络显示表示为 ensemble :

令残差网络的最后输出为 $F(x)$ ，并接合上述定义，显然有：

我们只需要逐级（residual block）训练残差网络，效果上便等同于训练了一系列弱分类的 enemble。其中，除了训练残差网络的权值外，还要训练一些辅助的参数——各层的 $α$ 及 $W$ （训练完成后即可丢弃）。

扫描二维码关注公众号，回复： 4615228 查看本文章

Telescoping Sum Boosting（裂项求和提升）

文章正文以二分类问题为例展开，我们更关心多分类问题，相关算法在附录部分。文章给出的伪代码说明相当清楚，直接复制如下：

理论

作者证明了 BoostResNet 保留了 boost 算法是优点：
1. 误差随网络深度（即弱分类器数量）指数减小；
2. 抗过拟合性，模型复杂度承网络深度线性增长。详细可参见论文。

讨论

BoostResNet 最大的特点是逐层训练，这样有一系列好处：
- 减少内存占用（Memory Efficient），使得训练大型的深层网络成为可能。（目前我们也只能在 CIFAR 上训练千层的残差网络，过过干瘾）
- 减少计算量（Computationally Efficient），每一级都只训练一个浅层模型。
- 因为只需要训练浅层模型，在优化方法上可以有更多的选择（非 SGD 方法）。
- 另外，网络层数可以依据训练情况动态的确定。

猜你喜欢

转载自blog.csdn.net/u010067397/article/details/84929520

18-Learning Deep ResNet Blocks Sequentially using Boosting Theory

Deep Learning: Theory and Experiments

ResNet(Deep Residual Learning for Image Recognition)

ResNet: Deep Residual Learning for Image Recognition详解

Deep Residual Learning for Image Recognition（ResNet）阅读

Deep Residual Learning for Image Recognition(ResNet)

ResNet-Deep Residual Learning for Image Recognition

Deep Residual Learning for Image Recognition (ResNet)

ResNet：Deep Residual Learning for Image Recognition

ResNet —— Deep Residual Learning for Image Recognition

(ResNet)Deep Residual Learning for Image Recognition

Deep Learning阅读笔记：Chapter 3—Probability and Information Theory

Deep Gamblers: Learning to Abstain with Portfolio Theory（理解）（github代码）

17、Energy Load Forecasting Using Deep Learning

Large Scale Distributed Deep Learning using Kubernetes

#Deep Learning回顾#之LeNet、AlexNet、GoogLeNet、VGG、ResNet

ResNet论文阅读---《Deep Residual Learning for Image Recognition》

《ResNet-Deep Residual Learning for Image Recognition》论文笔记

Deep Learning回顾#之LeNet、AlexNet、GoogLeNet、VGG、ResNet

ResNet: 深度残差网络---Deep Residual Learning for Image Recongnition

Deep Residual Learning for Image Recognition----ResNet论文阅读

ResNet来源论文《Deep Residual Learning for Image Recognition》读后总结

ResNet论文详解：《Deep Residual Learning for Image Recognition》

论文阅读(二)ResNet(Deep Residual Learning for Image Recognition)笔记

论文阅读——ResNet，Deep Residual Learning for Image Recognition

论文阅读|ResNet：Deep Residual Learning for Image Recognition

Deep Residual Learning for Image Recognition （ResNet）论文详细解读

残差网络(ResNet) -深度学习（Residual Networks (ResNet) – Deep Learning）

Deep Learning系列之一：数学基础--概率论(Probability Theory)

Using Learning Rate Schedules for Deep Learning Models in Python with Keras

今日推荐

周排行

Access的四舍五入取整

8.23 前端学习过程

入门学习过程方向与漏洞复现总结：

操作分布式文件之八：如何批量并行读写远程文件和事务补偿处理

应邀出个教程（搭建tensorflow跑网络环境）

Kubernetes之Pod控制器应用进阶

14-[mysql内置功能]--

HDU6212 区间dp 好题

VS2015生成代码图

验证手机号的工具类

每日归档

更多

2024-10-21(0)

2024-10-20(0)

2024-10-19(0)

2024-10-18(0)

2024-10-17(0)

2024-10-16(0)

2024-10-15(0)

2024-10-14(0)

2024-10-13(0)

2024-10-12(0)