簡單了解反向注意力(Reverse Attention)機制

2023-06-21 02:01:09

反向注意力(Reverse Attention)機制由《Reverse Attention for Salient Object Detection》一文提出。其核心思想為，在顯著目标檢測(二分割)網絡中，對象的大緻全局位置資訊在網絡的深層便可以獲得，是以Decoder的淺層部分隻需要關注對象的局部細節即可。具體做法則是，将decoder深層的輸出給取反，那麼網絡關注的位置即為對象以外的邊緣部分，進而使得最終結果局部細節更加出色。

Reverse Attention的結構如下圖所示：

代碼(取自原文github倉庫)如下：

class RA(nn.Module):
    def __init__(self, in_channel, out_channel):
        super(RA, self).__init__()
        self.convert = nn.Conv2d(in_channel, out_channel, 1)
        self.convs = nn.Sequential(
            nn.Conv2d(out_channel, out_channel, 3, padding=1), nn.ReLU(True),
            nn.Conv2d(out_channel, out_channel, 3, padding=1), nn.ReLU(True),
            nn.Conv2d(out_channel, out_channel, 3, padding=1), nn.ReLU(True),
            nn.Conv2d(out_channel, 1, 3, padding=1),
        )
        self.channel = out_channel
	
	# x:待被施加空間注意力的淺層特征
	# y:用于計算reverse attention map的深層特征
    def forward(self, x, y):
        a = torch.sigmoid(-y)	# reverse并壓縮至0~1區間内以用作空間注意力map
        x = self.convert(x)		# 統一x, y通道數
        x = a.expand(-1, self.channel, -1, -1).mul(x)	# x, y相乘，完成空間注意力
        y = y + self.convs(x)	# 殘差連接配接(圖中未畫出)
        return y

簡單了解反向注意力(Reverse Attention)機制

繼續閱讀

【顯著性物體檢測】【CVPR2018】Progressive Attention Guided Recurrent Network for Salient Object Detection【論文筆記】

生成對抗網絡GAN損失函數loss的簡單了解

C語言中的自增自減運算符詳解，printf等函數的應用，及其源碼等前言

[論文閱讀] TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

[論文閱讀] Suggestive Annotation: A Deep Active Learning Framework for Biomedical Image Segmentation

[論文閱讀] TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation

[論文閱讀] Conformer: Local Features Coupling Global Representations for Visual Recognition

Data structure of the experimental order of a: a row of fast row（learning quick sorting）.

Pytorch nn.BCEWithLogitsLoss()的簡單了解與用法

[論文閱讀] Deep Automatic Natural Image Matting

[論文閱讀] A Late Fusion CNN for Digital Matting

簡單入門了解半監督中的Mean Teacher

[論文閱讀] Multi-Task Learning for Thyroid Nodule Segmentation with Thyroid Region Prior

檢視英偉達NVIDIA顯示卡型号

[論文閱讀] Unifying Global-Local Representations in Salient Object Detection with Transformer

Python set()函數的簡單用法