Loading [MathJax]/extensions/MathZoom.js
Small-Object Sensitive Segmentation Using Across Feature Map Attention | IEEE Journals & Magazine | IEEE Xplore

Small-Object Sensitive Segmentation Using Across Feature Map Attention


Abstract:

Semantic segmentation is an important step in understanding the scene for many practical applications such as autonomous driving. Although Deep Convolutional Neural Netwo...Show More

Abstract:

Semantic segmentation is an important step in understanding the scene for many practical applications such as autonomous driving. Although Deep Convolutional Neural Networks-based methods have significantly improved segmentation accuracy, small/thin objects remain challenging to segment due to convolutional and pooling operations that result in information loss, especially for small objects. This article presents a novel attention-based method called Across Feature Map Attention (AFMA) to address this challenge. It quantifies the inner-relationship between small and large objects belonging to the same category by utilizing the different feature levels of the original image. The AFMA could compensate for the loss of high-level feature information of small objects and improve the small/thin object segmentation. Our method can be used as an efficient plug-in for a wide range of existing architectures and produces much more interpretable feature representation than former studies. Extensive experiments on eight widely used segmentation methods and other existing small-object segmentation models on CamVid and Cityscapes demonstrate that our method substantially and consistently improves the segmentation of small/thin objects.
Page(s): 6289 - 6306
Date of Publication: 30 September 2022

ISSN Information:

PubMed ID: 36178991

Funding Agency:

Author image of Shengtian Sang
Department of Radiation Oncology, Stanford University, Stanford, CA, USA
Shengtian Sang received the PhD degree from the College of Computer Science and Technology, Dalian University of Technology, Dalian, China. He is currently a post-doctoral scholar with the Laboratory of Artificial Intelligence in Medicine and Biomedical Physics, Department of Radiation Oncology, Stanford University. His current research interests include medical data mining, medical image computing, and machine learning. ...Show More
Shengtian Sang received the PhD degree from the College of Computer Science and Technology, Dalian University of Technology, Dalian, China. He is currently a post-doctoral scholar with the Laboratory of Artificial Intelligence in Medicine and Biomedical Physics, Department of Radiation Oncology, Stanford University. His current research interests include medical data mining, medical image computing, and machine learning. ...View more
Author image of Yuyin Zhou
Department of Computer Science and Engineering, University of California, Santa Cruz, CA, USA
Yuyin Zhou received the PhD degree from the Computer Science Department, Johns Hopkins University, in 2020, and was a postdoctoral researcher with Stanford University from 2020 to 2021. She is currently an assistant professor of computer science and engineering with UC Santa Cruz. Her research interests span the fields of medical image computing, computer vision, and machine learning, especially the intersection of them. ...Show More
Yuyin Zhou received the PhD degree from the Computer Science Department, Johns Hopkins University, in 2020, and was a postdoctoral researcher with Stanford University from 2020 to 2021. She is currently an assistant professor of computer science and engineering with UC Santa Cruz. Her research interests span the fields of medical image computing, computer vision, and machine learning, especially the intersection of them. ...View more
Author image of Md Tauhidul Islam
Department of Radiation Oncology, Stanford University, Stanford, CA, USA
Md Tauhidul Islam (Student Member, IEEE) received the BSc and MSc degrees in electrical and electronic engineering from the Bangladesh University of Engineering and Technology (BUET), Dhaka, in 2011 and 2014, respectively. He is a post-doctoral scholar with the Laboratory of Artificial Intelligence in Medicine and Biomedical Physics, Department of Radiation Oncology, Stanford University. In his PhD study, he worked on ult...Show More
Md Tauhidul Islam (Student Member, IEEE) received the BSc and MSc degrees in electrical and electronic engineering from the Bangladesh University of Engineering and Technology (BUET), Dhaka, in 2011 and 2014, respectively. He is a post-doctoral scholar with the Laboratory of Artificial Intelligence in Medicine and Biomedical Physics, Department of Radiation Oncology, Stanford University. In his PhD study, he worked on ult...View more
Author image of Lei Xing
Department of Radiation Oncology, Stanford University, Stanford, CA, USA
Lei Xing received the PhD degree in physics from the Johns Hopkins University, Baltimore, MD, USA, in 1992. He completed the Medical Physics Training with the University of Chicago, Chicago, IL, USA. He is currently the Jacob Haimson & Sarah S. Donaldson professor of medical physics and the director of the Medical Physics Division, Radiation Oncology Department, Stanford University, Stanford, CA, USA. He also holds affili...Show More
Lei Xing received the PhD degree in physics from the Johns Hopkins University, Baltimore, MD, USA, in 1992. He completed the Medical Physics Training with the University of Chicago, Chicago, IL, USA. He is currently the Jacob Haimson & Sarah S. Donaldson professor of medical physics and the director of the Medical Physics Division, Radiation Oncology Department, Stanford University, Stanford, CA, USA. He also holds affili...View more

1 Introduction

Semantic segmentation is an important processing step in natural or medical image analysis for the detection of distinct types of objects in images [1]. In this process, a semantic label is assigned to each pixel of a given image. The breakthrough of semantic segmentation methods came when fully convolutional neural networks (FCN) were first used by [2] to perform end-to-end segmentation of images. While semantic segmentation has achieved significant improvement based on the conception of fully convolutional networks, small and thin items in the scene remain difficult to segment because the information of small objects is lost throughout the convolutional and pooling processes [3], [4], [5], [6]. For example, Fig. 1a is an image of size 800 by 1200 pixels, which contains two cars: the larger car is 160 by 220 pixels (Fig. 1b), and the smaller one is 30 by 40 (Fig. 1c). After a convolution operation with a convolution kernel of 10×10, the length and width of the image are compressed to one-tenth of the original size (as shown in Fig. 1d). Accordingly, the dimensions of the large and small cars become 16 by 22 and 3 by 4 pixels, respectively. As seen from the example, we can still see the car's features from Fig. 1e (feature map of the large car), but we can hardly see the features of the small car from the 12-pixel size Fig. 1c (feature map of the small car). This is because the high-level representation from convolutional and pooling operations generated along lowers the resolution, which often leads to the loss of the detailed information of small/thin objects [3] — as a result, recovering the car information from the coarse feature maps is difficult for segmentation models [7]. However, accurately segmenting small objects is critical in many applications, such as autonomous driving, where the segmentation and recognition of small-sized cars and pedestrians in the distance is critical [8], [9], [10], [11].

Author image of Shengtian Sang
Department of Radiation Oncology, Stanford University, Stanford, CA, USA
Shengtian Sang received the PhD degree from the College of Computer Science and Technology, Dalian University of Technology, Dalian, China. He is currently a post-doctoral scholar with the Laboratory of Artificial Intelligence in Medicine and Biomedical Physics, Department of Radiation Oncology, Stanford University. His current research interests include medical data mining, medical image computing, and machine learning. In his PhD study, he worked on biomedical literature-based discovery and data mining.
Shengtian Sang received the PhD degree from the College of Computer Science and Technology, Dalian University of Technology, Dalian, China. He is currently a post-doctoral scholar with the Laboratory of Artificial Intelligence in Medicine and Biomedical Physics, Department of Radiation Oncology, Stanford University. His current research interests include medical data mining, medical image computing, and machine learning. In his PhD study, he worked on biomedical literature-based discovery and data mining.View more
Author image of Yuyin Zhou
Department of Computer Science and Engineering, University of California, Santa Cruz, CA, USA
Yuyin Zhou received the PhD degree from the Computer Science Department, Johns Hopkins University, in 2020, and was a postdoctoral researcher with Stanford University from 2020 to 2021. She is currently an assistant professor of computer science and engineering with UC Santa Cruz. Her research interests span the fields of medical image computing, computer vision, and machine learning, especially the intersection of them. She has more than 20 peer-reviewed publications at top-tier conferences and journals including CVPR, ICCV, AAAI, the IEEE Transactions on Pattern Analysis and Machine Intelligence, IEEE Transactions on Medical Imaging, MedIA, etc. She has led the ICML 2021 workshop on Interpretable Machine Learning in Healthcare, the ICCV 2021 workshop on Computer Vision for Automated Medical Diagnosis, and co-organized ML4H 2021, the 9th CVPR MCV workshop. She served as a senior program committee for IJCAI 2021 and AAAI 2022, an area chair for MICCAI 2022, CHIL 2022.
Yuyin Zhou received the PhD degree from the Computer Science Department, Johns Hopkins University, in 2020, and was a postdoctoral researcher with Stanford University from 2020 to 2021. She is currently an assistant professor of computer science and engineering with UC Santa Cruz. Her research interests span the fields of medical image computing, computer vision, and machine learning, especially the intersection of them. She has more than 20 peer-reviewed publications at top-tier conferences and journals including CVPR, ICCV, AAAI, the IEEE Transactions on Pattern Analysis and Machine Intelligence, IEEE Transactions on Medical Imaging, MedIA, etc. She has led the ICML 2021 workshop on Interpretable Machine Learning in Healthcare, the ICCV 2021 workshop on Computer Vision for Automated Medical Diagnosis, and co-organized ML4H 2021, the 9th CVPR MCV workshop. She served as a senior program committee for IJCAI 2021 and AAAI 2022, an area chair for MICCAI 2022, CHIL 2022.View more
Author image of Md Tauhidul Islam
Department of Radiation Oncology, Stanford University, Stanford, CA, USA
Md Tauhidul Islam (Student Member, IEEE) received the BSc and MSc degrees in electrical and electronic engineering from the Bangladesh University of Engineering and Technology (BUET), Dhaka, in 2011 and 2014, respectively. He is a post-doctoral scholar with the Laboratory of Artificial Intelligence in Medicine and Biomedical Physics, Department of Radiation Oncology, Stanford University. In his PhD study, he worked on ultrasound elastography at Ultrasound and Elasticity Imaging Laboratory, Department of Electrical Engineering, Texas A&M University. His current research interests include high dimensional medical data analysis using deep learning, manifold embedding, and interpretability of deep neural networks. His past research interests were in diverse areas of biomechanics, ultrasound imaging, elastography, and signal processing.
Md Tauhidul Islam (Student Member, IEEE) received the BSc and MSc degrees in electrical and electronic engineering from the Bangladesh University of Engineering and Technology (BUET), Dhaka, in 2011 and 2014, respectively. He is a post-doctoral scholar with the Laboratory of Artificial Intelligence in Medicine and Biomedical Physics, Department of Radiation Oncology, Stanford University. In his PhD study, he worked on ultrasound elastography at Ultrasound and Elasticity Imaging Laboratory, Department of Electrical Engineering, Texas A&M University. His current research interests include high dimensional medical data analysis using deep learning, manifold embedding, and interpretability of deep neural networks. His past research interests were in diverse areas of biomechanics, ultrasound imaging, elastography, and signal processing.View more
Author image of Lei Xing
Department of Radiation Oncology, Stanford University, Stanford, CA, USA
Lei Xing received the PhD degree in physics from the Johns Hopkins University, Baltimore, MD, USA, in 1992. He completed the Medical Physics Training with the University of Chicago, Chicago, IL, USA. He is currently the Jacob Haimson & Sarah S. Donaldson professor of medical physics and the director of the Medical Physics Division, Radiation Oncology Department, Stanford University, Stanford, CA, USA. He also holds affiliate faculty positions with the Department of Electrical Engineering, Bio-X and Molecular Imaging Program, Stanford University. He has been a member of the Radiation Oncology Faculty, Stanford University, since 1997. His current research interests include medical imaging, artificial intelligence in medicine, treatment planning, image-guided interventions, nanomedicine, and applications of molecular imaging in radiation oncology.
Lei Xing received the PhD degree in physics from the Johns Hopkins University, Baltimore, MD, USA, in 1992. He completed the Medical Physics Training with the University of Chicago, Chicago, IL, USA. He is currently the Jacob Haimson & Sarah S. Donaldson professor of medical physics and the director of the Medical Physics Division, Radiation Oncology Department, Stanford University, Stanford, CA, USA. He also holds affiliate faculty positions with the Department of Electrical Engineering, Bio-X and Molecular Imaging Program, Stanford University. He has been a member of the Radiation Oncology Faculty, Stanford University, since 1997. His current research interests include medical imaging, artificial intelligence in medicine, treatment planning, image-guided interventions, nanomedicine, and applications of molecular imaging in radiation oncology.View more
Contact IEEE to Subscribe

References

References is not available for this document.