系列论文阅读之知识蒸馏（二）《FitNets : Hints for Thin Deep Nets》

其他 2020-03-15 14:11:17 阅读次数: 0

本文成果：

从一个wide and deep的网路蒸馏成一个thin and deeper的网络。

主要的方法如下图所示：

实际上是在KD的基础上，增加了一个中间层的知识蒸馏。

以下是KD的主要方法：

训练要点：

两个loss function:

（1）Teacher网络的某一中间层的权值为Wt=Whint，Student网络的某一中间层的权值为Ws=Wguided。使用一个映射函数Wr来使得Wguided的维度匹配Whint，得到Ws'。其中对于Wr的训练使用MSEloss：

扫描二维码关注公众号，回复： 9830319 查看本文章

(2) 另外一个是改造的softmax loss（具体见Hinton的论文）:

liqiming100

发布了61 篇原创文章 · 获赞 12 · 访问量 6万+

私信关注

猜你喜欢

转载自blog.csdn.net/liqiming100/article/details/88935353

系列论文阅读之知识蒸馏（二）《FitNets : Hints for Thin Deep Nets》

知识蒸馏（Distillation）相关论文阅读（3）—— FitNets : Hints for Thin Deep Nets

FitNets: Hints for thin deep nets论文笔记

FitNets: Hints for Thin Deep Nets 原理与代码解析

Distillation论文总结（1）Do Deep Nets Really Need to be Deep?

A Fast Learning Algorithm for Deep Belief Nets - 论文学习

Do Deep Nets Really Need to be Deep?

LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks 论文阅读

文章阅读：Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

深度学习论文（九）---DeepLabV2-Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution,

深度学习论文（八）---DeepLabV1-SEMANTIC IMAGE SEGMENTATION WITH DEEP CONVOLUTIONAL NETS AND FULLY CONNECTED C

Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval 论文笔记

论文：SE3-Pose-Nets: Structured Deep Dynamics Models for Visuomotor Planning and Control

论文笔记-DeepLung: Deep 3D Dual Path Nets for Automated Pulmonary Nodule Detection and Classification

论文笔记：DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution,and......

SEMANTIC IMAGE SEGMENTATION WITH DEEP CONVOLUTIONAL NETS AND FULLY CONNECTED CRFS 论文精读

Training Deep Nets with Sublinear Memory Cost

A Taxonomy of Deep Convolutional Neural Nets for Computer Vision

论文阅读-位姿估计-SE3-Nets Learning Rigid Body Motion using Deep Neural Networks

论文阅读笔记十：DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs (DeepLabv2)

Hints

论文阅读——《Generative Adversarial Nets》

Deep Belief Nets in C++ and CUDA C: Volume 3: Convolutional Nets 免积分下载

【deeplab】Semantic Image Segmentation with Deep Convolutional Nets and Fully

【Deep Learning】SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

Oracle 表连接之Hints

论文阅读——《Conditional Generative Adversarial Nets》

Python type hints 之 Optional，Union

Generative Adversarial Nets (GAN) 阅读笔记

CGAN论文解读：Conditional Generative Adversarial Nets

今日推荐

周排行

(BIND最佳实践)Linux运维最佳实践

makefile ifeq之坑: 1. syntax error near unexpected token 2. *** missing separator. Stop.

easyui datagrid操作栏内置图片按钮

SQLyog连接MySQL时出现的2058错误解决方法

linux音频开发

hashcode方法简析

SpringBoot中使用Transaction注解遇到的坑

逆战-CSS中子元素在父元素中的4种水平垂直居中方法

Expression.Blend.4 Chapter 图片和视频的使用

springMVC返回void值

每日归档

2024-09-17(0)

2024-09-16(0)

2024-09-15(0)

2024-09-14(0)

2024-09-13(0)

2024-09-12(0)

2024-09-11(0)

2024-09-10(0)

2024-09-09(0)

2024-09-08(0)