CVPR2021 最新出爐的最佳paper，陸續更新中...（附論文位址）

計算機視覺研究院專欄

作者：Edison_G

今年的CVPR也陸續被大家熟知，錄取的paper也公布出來，大家有興趣的可以深入了解自己感興趣的領域。作為計算機視覺領域三大頂會之一，CVPR2021目前已公布了所有接收論文ID，一共有1663篇論文被接收，接收率為23.7%，雖然接受率相比去年有所上升，但競争也是非常激烈。

首先我們先分享曆年比較好的，然後分享今年最新最佳的paper！

CVPR幹貨 | ATSS——最新技術的目标檢測（文末源碼下載下傳）

CVPR2020最佳檢測 | 帶有注意力RPN和多關系檢測器的小樣本目标檢測網絡（提供源碼和資料及下載下傳）

代碼實踐 | CVPR2020——AdderNet（加法網絡）遷移到檢測網絡（代碼分享）

CVPR2020最佳新架構｜大規模人臉表情識别（附源代碼）

CVPR2020 | 用有噪聲的學生網絡進行自我訓練提高ImageNet分類

CVPR2020 | 人臉識别基于通用表示學習（文末附有下載下傳位址）

CVPR2020 | 超越MobileNetV3的輕量級網絡(文末論文下載下傳)

CVPR 2021

緻力于計算機視覺和模式識别包括顔色檢測、跟蹤、運動、物體識别、音響和目标檢測。

圖像目标檢測(Image Object Detection)

Instance Localization for Self-supervised Detection Pretraining

Multiple Instance Active Learning for Object Detection（用于對象檢測的多執行個體主動學習）

Open-world object detection(開放世界中的目标檢測)

Positive-Unlabeled Data Purification in the Wild for Object Detection(野外檢測對象的陽性無标簽資料提純)

UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

Image-to-image Translation via Hierarchical Style Disentanglement Xinyang Li, Shengchuan Zhang, Jie Hu, Liujuan Cao, Xiaopeng Hong, Xudong Mao, Feiyue Huang, Yongjian Wu, Rongrong Ji https://arxiv.org/abs/2103.01456 https://github.com/imlixinyang/HiSD

FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation https://arxiv.org/pdf/2012.08512.pdf https://tarun005.github.io/FLAVR/Code https://tarun005.github.io/FLAVR/

Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition Stephen Hausler, Sourav Garg, Ming Xu, Michael Milford, Tobias Fischer https://arxiv.org/abs/2103.01486

Depth from Camera Motion and Object Detection Brent A. Griffin, Jason J. Corso https://arxiv.org/abs/2103.01468

UP-DETR: Unsupervised Pre-training for Object Detection with Transformers https://arxiv.org/pdf/2011.09094.pdf

Multi-Stage Progressive Image Restoration https://arxiv.org/abs/2102.02808 https://github.com/swz30/MPRNet

Weakly Supervised Learning of Rigid 3D Scene Flow https://arxiv.org/pdf/2102.08945.pdf https://arxiv.org/pdf/2102.08945.pdf https://3dsceneflow.github.io/

Exploring Complementary Strengths of Invariant and Equivariant Representations for Few-Shot Learning Mamshad Nayeem Rizve, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah https://arxiv.org/abs/2103.01315

Re-labeling ImageNet: from Single to Multi-Labels, from Global to Localized Labels https://arxiv.org/abs/2101.05022 https://github.com/naver-ai/relabel_imagenet

Rethinking Channel Dimensions for Efficient Model Design https://arxiv.org/abs/2007.00992 https://github.com/clovaai/rexnet

Coarse-Fine Networks for Temporal Activity Detection in Videos Kumara Kahatapitiya, Michael S. Ryoo https://arxiv.org/abs/2103.01302

A Deep Emulator for Secondary Motion of 3D Characters Mianlun Zheng, Yi Zhou, Duygu Ceylan, Jernej Barbic https://arxiv.org/abs/2103.01261

Fair Attribute Classification through Latent Space De-biasing https://arxiv.org/abs/2012.01469 https://github.com/princetonvisualai/gan-debiasing https://princetonvisualai.github.io/gan-debiasing/

Auto-Exposure Fusion for Single-Image Shadow Removal Lan Fu, Changqing Zhou, Qing Guo, Felix Juefei-Xu, Hongkai Yu, Wei Feng, Yang Liu, Song Wang https://arxiv.org/abs/2103.01255

Less is More: CLIPBERT for Video-and-Language Learning via Sparse Sampling https://arxiv.org/pdf/2102.06183.pdf https://github.com/jayleicn/ClipBERT

MetaSCI: Scalable and Adaptive Reconstruction for Video Compressive Sensing Zhengjue Wang, Hao Zhang, Ziheng Cheng, Bo Chen, Xin Yuan https://arxiv.org/abs/2103.01786

GAN/生成式/對抗式(GAN/Generative/Adversarial)

Exploiting Spatial Dimensions of Latent in GAN for Real-time Image Editing（利用GAN中潛在的空間次元進行實時圖像編輯）

Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs(Hijack-GAN：意外使用經過預訓練的黑匣子GAN)

Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation(樣式編碼：用于圖像到圖像翻譯的StyleGAN編碼器)

A 3D GAN for Improved Large-pose Facial Recognition(用于改善大姿勢面部識别的3D GAN)

AttentiveNAS: Improving Neural Architecture Search via Attentive https://arxiv.org/pdf/2011.09011.pdf

Diffusion Probabilistic Models for 3D Point Cloud Generation Shitong Luo, Wei Hu https://arxiv.org/abs/2103.01458

There is More than Meets the Eye: Self-Supervised Multi-Object Detection and Tracking with Sound by Distilling Multimodal Knowledge Francisco Rivera Valverde, Juana Valeria Hurtado, Abhinav Valada https://arxiv.org/abs/2103.01353 http://rl.uni-freiburg.de/research/multimodal-distill

Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation https://arxiv.org/abs/2008.00951 https://github.com/eladrich/pixel2style2pixel https://eladrich.github.io/pixel2style2pixel/

Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational Graph Xin Ye, Yezhou Yang https://arxiv.org/abs/2103.01350

RepVGG: Making VGG-style ConvNets Great Again https://arxiv.org/abs/2101.03697 https://github.com/megvii-model/RepVGG

Transformer Interpretability Beyond Attention Visualization https://arxiv.org/pdf/2012.09838.pdf https://github.com/hila-chefer/Transformer-Explainability

PREDATOR: Registration of 3D Point Clouds with Low Overlap https://arxiv.org/pdf/2011.13005.pdf https://github.com/ShengyuH/OverlapPredator https://overlappredator.github.io/

CVPR2021 最新出爐的最佳paper，陸續更新中...（附論文位址）

CVPR 2021

繼續閱讀

【考研政治】2021肖八整理（時政部分）

分享開源Cesium地形制作工具

Cesium格式3dtile制作工具

視訊對象分割（Video Object Segmentation）研究小記任務定義與資料集技術路線分類基于神經網絡的模型總結

git關聯問題解決

github 如何和 xcode 聯系起來

localstack 1.0 ga 了

opencv視覺跟蹤——消除背景模組化

解決方案之：DM relay 處理單元報錯

圖形處理單元(GPU)的演進

用 Canvas 編織璀璨星空圖

2021-09-30三維點雲測量正方形包裹體積

《2020失業潮，普通人能否出奇制勝？》筆記

DOG算子

cs231n斯坦福基于卷積神經網絡的CV學習筆記（一）KNN和線性分類器/分類器損失/反向傳播一，KNN圖像分類算法二，線性分類器三，線性分類器損失四，反向傳播五，神經網絡

開源按鍵元件Multi_Button的使用,含測試工程