CVPR2018】论文整理(收藏这一篇就够了)

CVPR 2018

CVPR作为CV界最受关注的三大顶会之一，每一个CVer都应该好好关注CVPR的论文。CVPR2018在今年6月18日-22日在美国盐湖城举行。

先介绍一下CVPR2018的一些数据：

今年一共收到3309篇文章，其中979篇被录用。投录比约为29.5%。
收录论文按专家评分，分为三个层次：Poster, Spotlight, Oral。
Spotlight(亮点论文)一共有224篇，占收录论文(224/979)的22.88%。
Oral(演示论文)一共有70篇，占收录论文(70/979)的7.1%。

用一张韦恩图表示收录文章占比：

所以说，不光中篇CVPR难，中篇spotlight更难，中篇oral基本可以说是灰常难了。就这么说吧，今年国内所有高校加起来中的CVPR oral是个位数。

当然，最牛的还是Best paper 和best student paper，只会分别选出1篇。

今年的best paper给了来自Stanford和Berkeley的合作论文，论文标题为：

Taskonomy: Disentangling Task Transfer Learning

下载地址为：https://arxiv.org/abs/1804.08328

最佳学生论文来自CMU，标题为：

Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies

下载地址为：https://arxiv.org/abs/1801.01615v1

当然，就像奥斯卡颁奖一样，最佳论文奖提名也可以突出文章质量很高。今年四篇最佳论文提名奖如下：

标题 
第一单位 
下载地址


 Deep_Learning_of_Graph_Matching 
 Lund University

http://openaccess.thecvf.com
/content_cvpr_2018
/CameraReady/1830.pdf

SPLATNet: Sparse Lattice Networks for Point Cloud Processing 
UMass Amherst
 https://arxiv.org/pdf/1802.08275.pdf


 CodeSLAM-learning a Compact, Optimisable Representation for Dense Visual SLAM 
 帝国理工
  https://arxiv.org/pdf/1804.00874.pdf 


Efficient Optimization for Rank-based Loss Functions 
IIIT Hyderabad
 https://arxiv.org/pdf/1604.08269.pdf

所以，客观认为的论文含金量是：
best paper (2篇) > honorable mention(提名奖 4篇) > Oral (70篇) > Spotlight(224篇) > poster(其他)

CVPR2018虽好，可不要贪杯，一共有979篇，每天看1篇也得看3年，待你看完之日也是算法过时之时。所以，给各位CVer(包括自己)一些建议：

从高质量论文开始看，至少优先看spotlight或者oral论文。
在自己的领域找论文看，别想做什么CVPR的集大成者，如果你是CVPR oral大神，那么当我这条没说过。
哪里有CVPR论文分享会就去听，听原作者自己讲一个小时，比自己看一礼拜更管用。如果没有现场版，看看视频也是好的。

最后

附上68篇oral论文标题：（文末有下载链接）

DensePose: Multi-Person Dense Human Pose Estimation In The Wild

Context Encoding for Semantic Segmentation

Augmented Skeleton Space Transfer for Depth-based Hand Pose Estimation

Semi-parametric Image Synthesis

Practical Block-wise Neural Network Architecture Generation

Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning

PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume

Illuminant Spectra-based Source Separation Using Flash Photography

SPLATNet: Sparse Lattice Networks for Point Cloud Processing

Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies

Deep Layer Aggregation

Left-Right Comparative Recurrent Model for Stereo Matching

Analytic Expressions for Probabilistic Moments of PL-DNN with Gaussian Input

An Analysis of Scale Invariance in Object Detection - SNIP

Finding Tiny Faces in the Wild with Generative Adversarial Network

Taskonomy: Disentangling Task Transfer Learning

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

Finding “It”: Weakly-Supervised Reference-Aware Visual Grounding in Instructional Video

Unsupervised Discovery of Object Landmarks as Structural Representations

Rotation Averaging and Strong Duality

Im2Flow: Motion Hallucination from Static Images for Action Recognition

Group Consistent Similarity Learning via Deep CRFs for Person Re-Identification

3D-RCNN: Instance-level 3D Scene Understanding via Render-and-Compare

Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

Context Contrasted Feature and Gated Multi-scale Aggregation for Scene Segmentation

Squeeze-and-Excitation Networks

DoubleFusion: Real-time Capture of Human Performance with Inner Body Shape from a Single Depth Sensor

Learning to Find Good Correspondences

Actor and Action Video Segmentation from a Sentence

Maximum Classifier Discrepancy for Unsupervised Domain Adaptation

Detail-Preserving Pooling in Deep Networks

Convolutional Neural Networks with Alternately Updated Clique

Deep Learning of Graph Matching

Synthesizing Images of Humans in Unseen Poses

Neural Inverse Kinematics for Unsupervised Motion Retargetting

Direction-aware Spatial Context Features for Shadow Detection

Density Adaptive Point Set Registration

Hybrid Camera Pose Estimation

Relation Networks for Object Detection

Revisiting Salient Object Detection: Simultaneous Detection, Ranking, and Subitizing of Multiple Salient Objects

Im2Pano3D: Extrapolating 360 Structure and Semantics Beyond the Field of View

Polarimetric Dense Monocular SLAM

Wasserstein Introspective Neural Networks

The Perception-Distortion Tradeoff

Discriminative Learning of Latent Features for Zero-Shot Recognition

Photometric Stereo in Participating Media Considering Shape-Dependent Forward Scatter

Fast and Furious: Real Time End-to-End 3D Detection, Tracking and Motion Forecasting with a Single Convolutional Net

Trapping Light for Time of Flight

Feature Space Transfer for Data Augmentation

Self-supervised Multi-level Face Model Learning for Monocular Reconstruction at over 250Hz

CodeSLAM — Learning a Compact, Optimisable Representation for Dense Visual SLAM

FlipDial: A Generative Model for Two-Way Visual Dialogue

OATM: Occlusion Aware Template Matching by Consensus Set Maximization

Surface Networks

VirtualHome: Simulating Household Activities via Programs

Egocentric Activity Recognition on a Budget

Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering

Efficient Optimization for Rank-based Loss Functions

MakeupGAN: Makeup Transfer via Cycle-Consistent Adversarial Networks

Revisiting Deep Intrinsic Image Decompositions

StarGAN: Unified Generative Adversarial Networks for Controllable Multi-Domain Image-to-Image Translation

Ordinal Depth Supervision for 3D Human Pose Estimation

Multi-Cell Classification by Convolutional Dictionary Learning with Class Proportion Priors

Accurate and Diverse Sampling of Sequences based on a “Best of Many” Sample Objective

MapNet: An Allocentric Spatial Memory for Mapping Environments

A Globally Optimal Solution to the Non-Minimal Relative Pose Problem

A Volumetric Descriptive Network for 3D Object Synthesis

Learning Face Age Progression: A Pyramid Architecture of GANs

我已经整理出所有oral文章，想打包下载的可以点击part1、part2

CVPR2018】论文整理(收藏这一篇就够了)

猜你喜欢