An Efficient and Rapid Medical Image Segmentation Network | IEEE Journals & Magazine | IEEE Xplore

An Efficient and Rapid Medical Image Segmentation Network


Diagram of the internal structure of SHFormer and visualization of the computational flow in the Spatial-Channel Connection module.

Abstract:

Accurate medical image segmentation is an essential part of the medical image analysis process that provides detailed quantitative metrics. In recent years, extensions of...Show More

Abstract:

Accurate medical image segmentation is an essential part of the medical image analysis process that provides detailed quantitative metrics. In recent years, extensions of classical networks such as UNet have achieved state-of-the-art performance on medical image segmentation tasks. However, the high model complexity of these networks limits their applicability to devices with constrained computational resources. To alleviate this problem, we propose a shallow hierarchical Transformer for medical image segmentation, called SHFormer. By decreasing the number of transformer blocks utilized, the model complexity of SHFormer can be reduced to an acceptable level. To improve the learned attention while keeping the structure lightweight, we propose a spatial-channel connection module. This module separately learns attention in the spatial and channel dimensions of the feature while interconnecting them to produce more focused attention. To keep the decoder lightweight, the MLP-D module is proposed to progressively fuse multi-scale features in which channels are aligned using Multi-Layer Perceptron (MLP) and spatial information is fused by convolutional blocks. We first validated the performance of SHFormer on the ISIC-2018 dataset. Compared to the latest network, SHFormer exhibits comparable performance with 15 times fewer parameters, 30 times lower computational complexity and 5 times higher inference efficiency. To test the generalizability of SHFormer, we introduced the polyp dataset for additional testing. SHFormer achieves comparable segmentation accuracy to the latest network while having lower computational overhead.
Diagram of the internal structure of SHFormer and visualization of the computational flow in the Spatial-Channel Connection module.
Published in: IEEE Journal of Biomedical and Health Informatics ( Volume: 28, Issue: 5, May 2024)
Page(s): 2979 - 2990
Date of Publication: 12 March 2024

ISSN Information:

PubMed ID: 38457317

I. Introduction

As a pivotal procedure in medical image analysis, medical image segmentation (MIS) plays an important role in computer-aided diagnosis and intelligent medicine, with the goal of assisting doctors to better understand changes in patients' anatomical or pathological structures in order to make more correct judgments or develop more appropriate treatment plans [1]. Over the past decade, the application of deep learning has contributed to the booming development of MIS, with numerous excellent models emerging. However, most networks sacrifice efficiency for better performance by increasing their size, which greatly affects real-world applications.

Contact IEEE to Subscribe

References

References is not available for this document.