Conferences >IGARSS 2023 - 2023 IEEE Inter...

Graph Encoding based Hybrid Vision Transformer for Automatic Road Network Extraction

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper introduces a graph encoding-based hybrid vision transformer for automatic road network extraction via high-resolution remote sensing imagery. Given that high-r...Show More

Metadata

Abstract:

This paper introduces a graph encoding-based hybrid vision transformer for automatic road network extraction via high-resolution remote sensing imagery. Given that high-resolution remote sensing images covered large urban areas, traditional segmentation-based road extraction methods usually can generate good binary classification maps in simple structured road surfaces but fail in complex highway and bridge-covered areas. We introduce a graph encoding-based mechanism to address the above issues, enabling the road extraction framework extracts the road segmentation feature and build the graph structure map jointly. Compared to only segmentation-based methods, our approach learns prior geometrical structure information from the extracted ViT feature maps and has a non-local awareness of the whole road network structure. Eexperimental results demonstrated that the proposed approach outperforms the traditional segmentation-based methods.

Published in: IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium

Date of Conference: 16-21 July 2023

Date Added to IEEE Xplore: 20 October 2023

ISBN Information:

ISSN Information:

DOI: 10.1109/IGARSS52108.2023.10283247

Conference Location: Pasadena, CA, USA

Contents

1. INTRODUCTION

Remote Sensing technology-based road extraction plays an important role in many applications, such as urban planning, navigation, autonomous driving, and geographic information update. With the fast development of sensing platforms, high-resolution remote sensing imagery provides a promising avenue for automatic road network extraction, but it is still challenging. The existing road network extraction methods can be categorized as segmentation-based and graph-based. The segmentation-based methods utilize image segmentation to classify each pixel into road and non-road [1] [2] [3]. Then a manual interpretation or complex post-processing is conducted on the segmentation results to generate the road network extraction. Although existing researches provide promising results by utilizing convolution neural networks for segmentation and extraction [4] [5] [6], the pixel-based road area segmentation results still suffer from occlusion, noise, and complexity background of the high-resolution remote sensing imagery. The graph-based methods [7] [8] utilized the iterative graph construction methods for automatic road network extraction. It starts with a random node and utilizes prior experience rules to generate the whole road network as a graph iteratively. The graph-based methods may generate accurate road network maps, but it usually requires a lot of prior knowledge and human interactions, which makes the graph-based methods not robust to the domain variations. Besides, the graph generation process is usually iterative, which makes the neural network focus more on local information. To avoid the local optimal, recent work [9] proposes the sequential generative model to make the network aware of global information.

References is not available for this document.

Graph Encoding based Hybrid Vision Transformer for Automatic Road Network Extraction

Abstract:

Metadata

Abstract:

ISSN Information:

1. INTRODUCTION

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Graph Encoding based Hybrid Vision Transformer for Automatic Road Network Extraction

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

1. INTRODUCTION

References