EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation-LAGA

iceeci發表於2024-11-09

原文網址 : https://www.cnblogs.com/plumIce/p/18536883

論文
程式碼
`
import torch
import torch.nn as nn
from functools import partial
from torch.nn.init import trunc_normal_
import math
from timm.models.helpers import named_apply

def act_layer(act, inplace=False, neg_slope=0.2, n_prelu=1):
# activation layer
act = act.lower()
if act == 'relu':
layer = nn.ReLU(inplace)
elif act == 'relu6':
layer = nn.ReLU6(inplace)
elif act == 'leakyrelu':
layer = nn.LeakyReLU(neg_slope, inplace)
elif act == 'prelu':
layer = nn.PReLU(num_parameters=n_prelu, init=neg_slope)
elif act == 'gelu':
layer = nn.GELU()
elif act == 'hswish':
layer = nn.Hardswish(inplace)
else:
raise NotImplementedError('activation layer [%s] is not found' % act)
return layer

def init_weights(module, name, scheme=''):
if isinstance(module, nn.Conv2d) or isinstance(module, nn.Conv3d):
if scheme == 'normal':
nn.init.normal(module.weight, std=.02)
if module.bias is not None:
nn.init.zeros_(module.bias)
elif scheme == 'trunc_normal':
trunc_normal_(module.weight, std=.02)
if module.bias is not None:
nn.init.zeros_(module.bias)
elif scheme == 'xavier_normal':
nn.init.xavier_normal_(module.weight)
if module.bias is not None:
nn.init.zeros_(module.bias)
elif scheme == 'kaiming_normal':
nn.init.kaiming_normal_(module.weight, mode='fan_out', nonlinearity='relu')
if module.bias is not None:
nn.init.zeros_(module.bias)
else:
# efficientnet like
fan_out = module.kernel_size[0] * module.kernel_size[1] * module.out_channels
fan_out //= module.groups
nn.init.normal_(module.weight, 0, math.sqrt(2.0 / fan_out))
if module.bias is not None:
nn.init.zeros_(module.bias)
elif isinstance(module, nn.BatchNorm2d) or isinstance(module, nn.BatchNorm3d):
nn.init.constant_(module.weight, 1)
nn.init.constant_(module.bias, 0)
elif isinstance(module, nn.LayerNorm):
nn.init.constant_(module.weight, 1)
nn.init.constant_(module.bias, 0)

Large-kernel grouped attention gate (LGAG)

class LGAG(nn.Module):
'''
結合特徵圖與注意力係數，啟用高相關性特徵，高層特徵的門控訊號來控制網路不通階段間的資訊流動
在LAGA機制中，能夠有效地融合來自skip連結的資訊，以更少的計算在更大的區域性上下文中捕獲顯著的特徵。
用在需要將兩個shape相同的tensor融合的地方。
'''
def init(self, F_g, F_l, F_int, kernel_size=3, groups=1, activation='relu'):
super(LGAG, self).init()

    if kernel_size == 1:
        groups = 1
    self.W_g = nn.Sequential(
        nn.Conv2d(F_g, F_int, kernel_size=kernel_size, stride=1, padding=kernel_size // 2, groups=groups,
                  bias=True),
        nn.BatchNorm2d(F_int)
    )
    self.W_x = nn.Sequential(
        nn.Conv2d(F_l, F_int, kernel_size=kernel_size, stride=1, padding=kernel_size // 2, groups=groups,
                  bias=True),
        nn.BatchNorm2d(F_int)
    )
    self.psi = nn.Sequential(
        nn.Conv2d(F_int, 1, kernel_size=1, stride=1, padding=0, bias=True),
        nn.BatchNorm2d(1),
        nn.Sigmoid()
    )
    self.activation = act_layer(activation, inplace=True)

    self.init_weights('normal')

def init_weights(self, scheme=''):
    named_apply(partial(_init_weights, scheme=scheme), self)

def forward(self, g, x):
    g1 = self.W_g(g)
    x1 = self.W_x(x)
    psi = self.activation(g1 + x1)
    psi = self.psi(psi)

    return x * psi

if name == 'main':
in_dim=128
width=4
hidden_dim=in_dim//width

block = LGAG(in_dim,in_dim,hidden_dim).cuda()
g = torch.randn(3,128,64,64).cuda() #輸入 B C H W
x = torch.randn(3,128,64,64).cuda() #輸入 B C H W
output = block(g,x)

print(input.size())
print(output.size())

Medical Image Reader
2024-07-04
【論文翻譯】MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
2018-05-10
APP
卷積塊注意模組 CBAM: Convolutional Block Attention Module
2020-12-05
卷積BloC
Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps
2019-03-19
IDE
Restormer Efficient Transformer for High-Resolution Image Restoration——2022CVPR
2024-05-22
RESTORM
去噪論文 Attention-Guided CNN for Image Denoising
2020-10-12
GUIIDECNN
遷移學習《Cluster-Guided Semi-Supervised Domain Adaptation for Imbalanced Medical Image Classification》
2023-04-11
遷移學習GUIIDEAIAPT
論文翻譯：2020_WaveCRN: An efficient convolutional recurrent neural network for end-to-end speech enhancement
2021-11-23
深度學習論文翻譯解析（十七）：MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
2021-01-27
深度學習APP
[論文閱讀] VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION
2024-11-03
論文翻譯：2020_RESIDUAL ACOUSTIC ECHO SUPPRESSION BASED ON EFFICIENT MULTI-TASK CONVOLUTIONAL NEURAL NETWORK
2022-01-10
2019 IJCNN之GAN（image transfer（face））：Attention-Guided Generative Adversarial Networks for Unsupervis
2020-06-27
CNNGUIIDE
Attention
2024-03-15
MPHY0041 Machine Learning in Medical Imaging
2024-12-01
Mac
Memory-Efficient Adaptive Optimization
2024-09-10
APT
Convolutional Neural Networks（CNN）
2024-04-16
CNN
機器閱讀理解Attention-over-Attention模型
2021-09-09
模型
EMNLP’19-Mask-Predict: Parallel Decoding of Conditional Masked Language Models
2020-10-12
Parallel
phpcms不能外網訪問，ERR_CONTENT_DECODING_FAILED
2018-03-29
PHPAI
Attention與SelfAttention
2023-03-17
PaperRead - Comparison of Fundamental Mesh Smoothing Algorithms for Medical Surface Models
2022-01-05
Go
Self-Attention GAN 中的 self-attention 機制
2019-03-06
Deep Hashing Network for Efficient Similarity Retrieval
2020-10-08
MILA
WPF Image Image clip EllipseGeometry
2024-06-15
CNN (Convolutional Neural Networks) Abstract
2018-05-07
CNN
sparse_cross_attention
2024-11-23
ROS
Efficient DevSecOps Workflows with a Little Help from AI
2024-09-15
devAI
論文閱讀：Multi-Scale Boosted Dehazing Network with Dense Feature Fusion
2020-09-24
Image
2018-04-09
Paper -- DenseNet：Densely Connected Convolutional Network
2020-12-07
SENet
Matters Needing Attention as A SAP Freelancer
2022-06-04
Unlearn What You Want to Forget Efficient Unlearning for LLMs
2024-06-02
WPF Image add watermark and save marked image as jpg
2024-06-15
「A Convolutional Click Prediction Model」- 論文摘要
2020-03-07
論文閱讀——Deformable Convolutional Networks
2020-12-25
ORM
docker image
2020-05-31
Docker
PIL Image
2019-02-13
Image Manipulation
2024-10-19

EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation-LAGA

Large-kernel grouped attention gate (LGAG)

相關文章