[知识蒸馏] Data Efficient Stagewise Knowledge Distillation模型简介 - 代码天地

[知识蒸馏] Data Efficient Stagewise Knowledge Distillation模型简介

企业开发 2023-10-03 02:53:33 阅读次数: 0

文章目录

Introduction
Method

论文链接： https://arxiv.org/pdf/1911.06786v3.pdf
源码链接： https://github.com/IvLabs/stagewise-knowledge-distillation

Introduction

本文知识蒸馏方法（SKD）属于中间层蒸馏方法。特殊的是本文提出逐阶段（stagewise）训练,即训练学生网络时一次只训练一个部分（block）,其余模块冻结，该方法可以在较少的数据集下取得较好的模型精度提升效果。
在这里插入图片描述

Method

学生模型学习时使用MSE损失函数， $y_{\theta}(i,j),y_{\phi}(i,j)$ 分别是教师模型和学生模型在第i层特征图。
$L(y_{\theta},y_{\phi})=\sum\limits_{j=1}^{M}||y_{\theta}(i,j)-y_{\phi}(i,j)||_{2}\,$
分类任务时除了最后分类输入模块其余都逐一训练。分类模块单独训练，不使用教师模型，只使用标签信息。

猜你喜欢

转载自blog.csdn.net/qgh1223/article/details/112758930

[知识蒸馏] Data Efficient Stagewise Knowledge Distillation模型简介

知识蒸馏简介（Knowledge Distillation）

知识蒸馏（Knowledge Distillation）

知识蒸馏Knowledge Distillation

Knowledge Distillation 知识蒸馏详解

Knowledge Distillation(KD) 知识蒸馏

【知识蒸馏】知识蒸馏（Knowledge Distillation）技术详解

Knowledge Distillation(KD) 知识蒸馏 Pytorch实现

知识蒸馏是什么？（Knowledge Distillation）KD

【知识蒸馏】Knowledge Distillation with the Reused Teacher Classifier

【知识蒸馏】 Knowledge Distillation from A Stronger Teacher

知识蒸馏综述 Knowledge Distillation: A Survey

知识蒸馏（Knowledge distillation）必读论文合集

概念解析 | 知识蒸馏(Knowledge Distillation)

知识蒸馏（Knowledge Distillation）的Pytorch实现以及分析

一文搞懂【知识蒸馏】【Knowledge Distillation】算法原理

【经典简读】知识蒸馏(Knowledge Distillation) 经典之作

通俗易懂的知识蒸馏 Knowledge Distillation（上）——理论分析

知识蒸馏之Focal and Global Knowledge Distillation for Detectors

多老师知识蒸馏模型——Anomaly detection based on multi-teacher knowledge distillation

Knowledge Distillation 知识蒸馏之 Hint layer & self-knowledge distillation

Learning efficient object detection models with knowledge distillation论文笔记

知识蒸馏（Distillation）相关论文阅读（1）——Distilling the Knowledge in a Neural Network（以及代码复现）

知识蒸馏学习笔记2--Structured Knowledge Distillation for Semantic Segmentation

【知识蒸馏】 DistPro: Searching A Fast Knowledge Distillation Process via Meta Optimization

通俗易懂的知识蒸馏 Knowledge Distillation（下）——代码实践（附详细注释）

【蒸馏】PointDistiller: Structured Knowledge DistillationTowards Efficient and Compact 3D Detection

蒸馏法文章选读——Correlation Congruence for Knowledge Distillation

《Distilling the Knowledge in a Neural Network》知识蒸馏

《Distilling the Knowledge in a Neural Network》知识蒸馏

今日推荐

周排行

vue + echart +map中国地图，省市地图，区县地图

spring boot2 (31)-cors跨域请求

『学习资料推荐』299元买的微信营销资料打包

个人学习卷积神经网络的疑惑解答

网络工程师-软考

模拟人生4 春夏秋冬、星梦起飞版更新下载方法以及常见问题

python关于对象的字符串显示str和repr以及

奇怪的session混乱问题

【3】分治法（divide-and-conquer）

Java项目开发成绩管理系统（九）各模块实现信息修改

每日归档

更多

2024-08-07(0)

2024-08-06(0)

2024-08-05(0)

2024-08-04(0)

2024-08-03(0)

2024-08-02(0)

2024-08-01(0)

2024-07-31(0)

2024-07-30(0)

2024-07-29(0)