DAFL:Data-Free Learning of Student Networks - 代码天地

DAFL:Data-Free Learning of Student Networks

其他 2020-04-04 14:42:32 阅读次数: 0

Data-Free Learning of Student Networks

论文连接：https://arxiv.org/pdf/1904.01186.pdf
论文代码：https://github.com/huawei-noah/Data-Efficient-Model-Compression/tree/master/DAFL

compressing deep models without training data.
方法：the data-free teacher-student paradigm by exploiting GAN.

论文结构如图：
在这里插入图片描述

两阶段训练

1、将训练好的teacher network作为固定的判别器。输入一组随机向量，使用生成器G生成图像，然后通过teacher network优化生成器。使用 $L_{Total}$ loss 函数

the parameters of original network D are fixed during training G.
G=T，而
G：判别生成图片真伪
T：判断图片类别
所以gan的loss不适用，提出以下三个loss的结合共同组成 $L_{Total}$

one-hot loss function
输入分别表示学生网络和教师网络的输出。如果生成器G生成的图像与教师网络的训练数据分布相同，那么它们的输出也应该与训练数据具有相似的输出。因此使用one-hot loss促使教师网络生成的图像输出接近one-hot like vectors。也就是说，期望生成与教师网络完全兼容的合成图像，而不是适用于任何场景的一般真实图像。
activation value loss
如果输入真实图像，而不是一些随机的向量，特征图往往会收到更高的激活。

在这里插入图片描述

entropy loss
训练数据的类别基本均衡，entropy loss 来衡量生成图片的类别均衡，当所有的变量为 $1/k$ 时得到最大值。当loss最小的时候，每个 $\frac{1}{n}\sum y^i_S$ 应该等于 $\frac{1}{k}$ 。说明G生成的每个类别的图片的概率大致相等。因此，最小化 $L_ie$ 能够得到一组类别数量均衡的生成样本。

在这里插入图片描述

在这里插入图片描述
$\alpha$ 和 $\beta$ 是超参

2、使用知识蒸馏的方法将知识从teacher network迁移到student network。使用KD loss

在这里插入图片描述

算法
在这里插入图片描述

实验

在这里插入图片描述

在这里插入图片描述

Schnee_y

发布了46 篇原创文章 · 获赞 15 · 访问量 2万+

私信关注

猜你喜欢

转载自blog.csdn.net/sinat_34686158/article/details/104253947

DAFL:Data-Free Learning of Student Networks

Communication-Efficient Learning of Deep Networks from Decentralized Data

Neural Networks for Machine Learning

Neural Networks and Deep Learning

Cheat Sheets for AI, Neural Networks, Machine Learning, Deep Learning & Big Data

Machine Learning - Neural Networks Learning: Backpropagation in Practice

【Deep Learning】Sequence to Sequence Learning with Neural Networks

One Shot Learning with Siamese Networks

《Neural networks and deep learning》概览

Neural Networks and Deep Learning(1)

Decoupled Learning for Conditional Adversarial Networks

Neural networks and deep learning 概览

Sequence to Sequence Learning with Neural Networks

Matching Networks for One Shot Learning

Neural Networks and Deep Learning 整理

Neural Networks and Deep Learning 笔记

Reliable Federated Learning for Mobile Networks

【Deep Learning】Spatial Transformer Networks

Deep learning - Introduction to Neural Networks

【论文精读】MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels

联邦学习——FedAvg《Communication-Efficient Learning of Deep Networks from Decentralized Data》论文笔记

《Communication-Efficient Learning of Deep Networks from Decentralized Data》论文阅读

Knowledge Distillation and Student-Teacher Learning for Visual Intelligence

NEURAL NETWORKS（neural networks and deep learning by Charu C. Aggarwa）

Neural Networks and Deep Learning (Week 3)——Shallow neural networks

Neural Networks and Deep Learning (Week 2)——Neural Networks Basics

Neural Networks and Deep Learning-引论

Sequence to Sequence Learning with Neural Networks阅读笔记

Questions in Lecture 2 - Neural networks of machine learning

【One Shot】《Matching Networks for One Shot Learning》

今日推荐

周排行

成为C++高手之宏与枚举

在CAD二次开发中使用进度条

Js插件ECharts，HighCharts学习网址整理

Celery提交任务出错(on windows.)

cephfs内核客户端性能追踪

thinkphp中PHPExcel用法

EntityFramework动态组合多排序字段

汇编语言（八）实验9 根据材料编程

安装ubuntu后必须做的事情（对我而言）

JS函数式编程

每日归档

更多

2024-10-22(0)

2024-10-21(0)

2024-10-20(0)

2024-10-19(0)

2024-10-18(0)

2024-10-17(0)

2024-10-16(0)

2024-10-15(0)

2024-10-14(0)

2024-10-13(0)