19-Self-supervised-Visual-Feature-Learning-with-Deep-Neural-Networks-A-Survey - 代码天地

19-Self-supervised-Visual-Feature-Learning-with-Deep-Neural-Networks-A-Survey

编程语言 2019-05-05 20:21:04 阅读次数: 0

who

Longlong Jing and Yingli Tian ∗ , Fellow, IEEE
2019-

what

为了避免收集和注释大规模数据集的大量成本，作为无监督学习方法的子集，提出了自我监督学习方法，以从大规模未标记数据中学习一般图像和视频特征，而无需使用任何人工标注的标签。

一些术语

1. Pseudo label:

伪标签是基于pretext tasks的数据属性自动生成的标签。

2. Pretext Task

Pretext tasks 是网络要解决的预先设计的任务，通过学习Pretext tasks 的目标函数来学习视觉特征。

3. Downstream Task

用于评估自我监督学习所学习的特征的质量。
需要人工标注的标签来解决Downstream Task。
在某些应用程序中，Downstream Task可以与Pretext tasks 一样不使用任何人工注释标签。

4. Self-supervised Learning

无监督学习方法的一个子集。
学习方法，其中使用自动生成的标签明确训练ConvNets；

本综述仅关注视觉特征的自我监督学习方法

where

动机

1. 经过预先训练的模型，并针对其他任务进行了调整，主要有两个原因

从大规模不同数据集中学习的参数提供了一个很好的起点，因此，对其他任务的网络训练可以更快地收敛；
在大规模数据集上训练的网络已经学习了层次结构特征，这有助于减少其他任务训练期间的过拟合问题，特别是当其他任务的数据集很小或者训练标签很少时。

2. 要从未标记的数据中学习视觉特征

为了避免耗时且昂贵的数据标注；
一种流行的解决方案是提出网络要解决的各种pretext tasks，同时通过学习pretext tasks的目标函数来训练网络，并通过该过程学习特征。

3. pretext tasks共享两个共同属性

ConvNets需要捕获图像或视频的视觉特征来解决pretext tasks，
可以基于图像或视频的属性自动生成用于pretext tasks的伪标签。

整体思路框架

创新

据我们所知，这是第一个关于深度ConvNets的自我监督视觉特征学习的全面调查，这将有助于该领域的研究人员。
深入审查最近开发的自我监督学习方法和数据集。
提供了定量性能分析和现有方法的比较。

不同学习方法的函数

1. 监督学习函数

2. 半监督学习函数

3. 弱监督学习函数

4. Self-supervised Learning

自我监督学习也用数据 $X _{i}$ 及其伪标签 $p_{i}$ 训练，而 $p_{i}$ 是为预先定义的Pretext tasks自动生成的，不涉及任何人类注释。
伪标签 $p_{i}$ 可以通过使用图像或视频的属性来生成，例如图像的上下文，或者通过传统的手工设计方法。

how

从Pretext任务学习视觉特征

整体架构

步骤
1. ConvNets和视觉特征可以通过完成这个pretext task来学习到。
2. 可以在没有人类标注的情况下自动生成用于pretext task的伪标签P.
3. 通过最小化ConvNet O和伪标签P的预测之间的误差来优化ConvNet；
4. 在完成pretext task的训练之后，获得可以捕获图像或视频的视觉特征的ConvNet模型。

一般的pretext task

1. 基于生成的方法

Visual features are learned through the process of image generation tasks.
This type of methods includes
- image colorization [18],
- image super resolution [15],
- image inpainting
- image generation with Generative Adversarial Networks (GANs)

2. Context-based pretext tasks

Context Similarity
- image clusteringbased methods
- graph constraint-based methods
Spatial Context Structure
- image jigsaw puzzle
- context prediction
- geometric transformation recognition

Commonly Used Downstream Tasks for Evaluation

为了通过自我监督方法评估学习图像或视频特征的质量，采用自我监督学习的学习参数作为预训练模型，然后对Downstream Tasks进行调整，如图像分类，语义分割，

1. 选择图像分类作为Downstream Tasks来评估从自我监督学习方法中学习的图像特征的质量

自我监督学习模型应用于每个图像以提取特征，
然后用于训练分类器，如支持向量机（SVM）

2. e.g. image colorizaion任务

将灰度图像着色为彩色图像的任务。
the data X is the 通过RGB图像线性变换得来的gray-scale images；
pseudo label P is the RGB image itself.
对于图像分类任务的学习过程

IMAGE FEATURE LEARNING

1. Generation-based Image Feature Learning

Image Generation with GAN

Image Generation with Inpainting

2. Context-Based Image Feature Learning

簇在特征空间中具有较小的距离，并且来自不同簇的图像在特征空间中具有较大的距离。
可以训练ConvNet使用群集分配作为伪类标签对数据进行分类。

Performance of Image Feature Learning

训练pretext task，得到网络的特征：
- 使用AlexNet作为基础网络训练ImageNet数据集，而不使用类别标签。
处理down stream任务得到评估结果；
- 在ImageNet的训练中，在ConvNet的不同冻结卷积层上训练线性分类器；

得到三个结论
1. 来自不同层次的特征总是受益于自我监督的前期任务训练。自我监督学习方法的表现总是优于从头开始训练的模型的表现。
2. 所有自我监督的方法都能很好地利用conv3和conv4层的特性，同时使用conv1，conv2和conv5层的特性表现更差。这可能是因为浅层捕获了一般的低级特征，而深层捕获了与任务相关的特征。
3. 当用于pretext task训练的数据集与down stream的数据集之间存在域差距时，自监督学习方法能够与使用ImageNet标签训练的模型达到相当的性能。

猜你喜欢

转载自blog.csdn.net/u010067397/article/details/89846790

19-Self-supervised-Visual-Feature-Learning-with-Deep-Neural-Networks-A-Survey

Neural Networks and Deep Learning

A Survey of the Recent Architectures of Deep Convolutional Neural Networks

标签噪声：综述 Learning from Noisy Labels with Deep Neural Networks: A Survey

[论文阅读笔记58]Learning from Noisy Labels with Deep Neural Networks：A Survey

《Neural networks and deep learning》概览

Neural Networks and Deep Learning(1)

Neural networks and deep learning 概览

Neural Networks and Deep Learning 整理

Neural Networks and Deep Learning 笔记

Deep learning - Introduction to Neural Networks

《graph self- supervised learning：a survey》论文阅读

【Deep Learning】Sequence to Sequence Learning with Neural Networks

DySAT: Deep Neural Representation Learning on Dynamic Graph via Self-Attention Networks

《Deep Learning for Visual Tracking: A Comprehensive Survey》

NEURAL NETWORKS（neural networks and deep learning by Charu C. Aggarwa）

Neural Networks and Deep Learning (Week 3)——Shallow neural networks

Neural Networks and Deep Learning (Week 2)——Neural Networks Basics

Neural Networks and Deep Learning-引论

neural networks and deep learning 学习笔记

Neural Networks and Deep Learning 整理（二）

Neural Networks and Deep Learning 整理（三）

Neural Networks and Deep Learning A Textbook 2018.8

neural networks and deep learning 笔记（一）

读论文：Deep Neural Networks with Multitask Learning

Neural Networks and Deep Learning.2018.8

COMP9444 Neural Networks and Deep Learning

《Neural Networks and Deep Learning》课程笔记

【论文阅读笔记】---《A Survey of Model Compression and Acceleration for Deep Neural Networks》

Coursera, Deep Learning 1, Neural Networks and Deep Learning - week4, Deep Neural Networks

今日推荐

周排行

四大线程池详解

如何高效使用Vim

Mogodb的常用操作总结

Spyder默认页面布局调整

SAR日志分析

OAuth是一个关于授权（authorization）的开放网络标准，在全世界得到广泛应用，目前的版本是2.0版。本文对OAuth 2.0的设计思路和运行流程，做一个简明通俗的解释，主要参考材料为R

WebService中注解开发，CXF，Spring整合，Rest风格

2019考研英语一 Text1分析

windows下安装docker详细步骤

CentOS 7/6系统升级内核版本到5.2.2

每日归档

更多

2024-08-05(0)

2024-08-04(0)

2024-08-03(0)

2024-08-02(0)

2024-08-01(0)

2024-07-31(0)

2024-07-30(0)

2024-07-29(0)

2024-07-28(0)

2024-07-27(0)