what is global average pooling ? 全局平均池化層

2023-05-08 22:12:58

一顆行走的大白菜

引用network in network 中的解釋

Instead of adopting the traditional fully connected layers for classification in CNN, we directly output the spatial average of the feature maps from the last mlpconv layer as the confidence of categories via a global average pooling layer, and then the resulting vector is fed into the softmax layer. In traditional CNN, it is difficult to interpret how the category level information from the objective cost layer is passed back to the previous

convolution layer due to the fully connected layers which act as a black box in between. In contrast, global average pooling is more meaningful and interpretable as it enforces correspondance between feature maps and categories, which is made possible by a stronger local modeling using the micro network. Furthermore, the fully

connected layers are prone to overfitting and heavily depend on dropout regularization [4] [5], while global average pooling is itself a structural regularizer, which natively prevents overfitting for the overall structure.

In this paper, we propose another strategy called global average pooling to replace the traditional fully connected layers in CNN. The idea is to generate one feature map for each corresponding category of the classification task in the last mlpconv layer. Instead of adding fully connected layers on top of the feature maps, we take the average

of each feature map, and the resulting vector is fed directly into the softmax layer. One advantage of global average pooling over the fully connected layers is that it is more native to the convolution structure by enforcing correspondences between feature maps and categories. Thus the feature maps can be easily interpreted as

categories confidence maps. Another advantage is that there is no parameter to optimize in the global average pooling thus overfitting is avoided at this layer. Futhermore, global average pooling sums out the spatial information, thus it is more robust to spatial translations of the input.

We can see global average pooling as a structural regularizer that explicitly enforces feature maps to be confidence maps of concepts (categories). This is made possible by the mlpconv layers, as they makes better approximation to the confidence maps than GLMs.

這個概念出自于 network in network

主要是用來解決全連接配接的問題，其主要是是将最後一層的特征圖進行整張圖的一個均值池化，形成一個特征點，将這些特征點組成最後的特征向量進行softmax中進行計算。舉個栗子，假如，最後的一層的資料是1000個224*224的特征圖，global average pooling是将每一張特征圖計算所有像素點的均值，輸出一個資料值，這樣1000個特征圖就會輸出1000個資料點，将這些資料點組成一個1000維的向量的，就可以送入到softmax的分類中計算了。

what is global average pooling ? 全局平均池化層

繼續閱讀

Improving Semantic Segmentation via Video Propagation and Label Relaxation

【語義分割】Tensorflow deeplabv3+訓練自己的資料集一、制作語義分割資料集二、修改代碼三、訓練四、驗證五、可視化六、導出模型七、可能存在的一些問題

圖像語義分割——利用DeeplabV3+預測單張照片

語義分割最新進展

【PSPNet】Pyramid Scene Parsing Network引介AbstractMotivationIntroductionRelated WorkPSPNetExperimentsConclusionReference相關資料

車道線檢測網絡-LaneNet(論文簡述)摘要一介紹二方法三結果四總結

SEC [ECCV16]

基于Box Supervision的弱監督圖像語義分割

Weakly-Supervised Semantic Segmentation by Iteratively Mining Common Object Features-CVPR2018論文筆記Abstract.Introduction.Related work.Architecture of the Proposed MCOFMining Common Object Features

FCN 語義分割批量測試驗證集代碼:infer.py

圖像語義分割樣本制作——使用Matlab子產品Image Labeler 标記樣本

MIT Kimera閱讀筆記

[論文筆記] (CVPR2019) Structured Knowledge Distillation for Semantic Segmentation

FCN/MRF圖像語義分割與馬克爾夫随機場

Holistically-Nested Edge Detection讀書筆記