Journals & Magazines >IEEE Access >Volume: 12

Decision-Level Fusion Classification of Ovarian CT Benign and Malignant Tumors Based on Radiomics and Deep Learning of Dual Views

Fusion at the decision level.

Abstract:

Ovarian cancer is one of the most prevalent malignant tumors in the female reproductive system, and its early diagnosis has always posed a challenge. Computed tomography ...Show More

Metadata

Abstract:

Ovarian cancer is one of the most prevalent malignant tumors in the female reproductive system, and its early diagnosis has always posed a challenge. Computed tomography (CT) is a widely utilized clinical management tool that can extract much detail through computer algorithms, playing a vital role in the early diagnosis of ovarian cancer. This research aims to develop an ovarian benign-malignant classification model based on radiomics and deep learning of dual views. A retrospective analysis of CT images from 135 ovarian tumor patients was conducted using the StratifiedKFold method (K =5) for cross-validation. Radiomics features were extracted from CT data and inputted into an automated machine learning (A-ML) framework. Meanwhile, the deep learning (DL) model called Dual-View Global Representation and Local Cross Transformer (D_GR_LCT) was proposed for ovarian tumor classification using a global-local parallel analysis approach for end-to-end training. Radiomics results indicate the superiority of 3D input over 2D, with an average AUC-ROC of 88.35% and an average AUC-PR of 88.73%. Comparative experiments demonstrate enhanced model performance with parameter settings. The DL model achieves an average AUC-ROC and AUC-PR of 88.15% and 85.17%, respectively, validated by ablative and comparative experiments. At the decision-making level, the fusion of radiomics and DL models demonstrates an average AUC-ROC and AUC-PR of 91.35% and 90.20%, respectively, utilizing the stacking method. The fusion model outperformed individual models. Thus, models based on radiomics and dual-view DL are recommended for early identification and screening of ovarian cancer in clinical practice.

Fusion at the decision level.

Published in: IEEE Access ( Volume: 12)

Page(s): 102381 - 102395

Date of Publication: 19 July 2024

Electronic ISSN: 2169-3536

DOI: 10.1109/ACCESS.2024.3430983

Funding Agency:

Contents

SECTION I.

Introduction

Ovarian cancer (OC) is a prevalent gynecologic cancer, ranking behind cervical and uterine cancers [1]. The incidence and mortality rate of OC in China have both steadily increased over the past 30 years, and this trend is projected to continue for the next 30 years [2]. Most ovarian tumors are diagnosed at mid to advanced stages due to the lack of early visible symptoms and reliable screening tests. Current clinical methods for early diagnosis of OC include pelvic examination, imaging tests (such as ultrasound or CT scan of the abdomen and pelvis), blood tests to assess organ function, tumor marker tests such as CA-125, genetic tests, etc. However, these diagnostic procedures are laborious, time-consuming, costly, and require highly skilled examiners and physicians. Ultimately, patients may still fail to obtain precise diagnostic results. Therefore, the primary challenge facing OC lies in accurately and efficiently distinguishing between cancer patients and normal/benign patients in the early stages without imposing additional burdens on clinical practice and patients [3], [4].

Radiomics, an emerging research method, was first proposed by Lambin et al. [5]. This method reveals the relationship between tumor biological features, heterogeneity, and image data by extracting high-throughput image features. Doctors can utilize it to develop descriptive and predictive models that aid them in making more precise diagnoses [6]. Medical scanning imaging is widely used as a clinical management method for patients in most hospitals, which means that radiomics can be tightly integrated with clinical practice. Radiomics extracts a significant amount of analyzable data from medical images through computer algorithms to quantitatively capture features such as the shape, size, volume, and texture of a tumor or normal tissue region. Ultimately, these features are utilized to obtain valuable diagnostic or prognostic information to support clinical decision-making without burdening existing workflows significantly [7], [8].

Compared to the gold standard of pathological biopsy, radiomics offers a non-invasive method that not only reduces patient discomfort but also improves work efficiency and reduces the financial burden on patients. It provides a healthier and safer means of assessing their condition in the future [9]. Additionally, biopsies have limited ability to characterize lesions’ spatial and temporal heterogeneity, making them inferior to radiomics [10].

In recent years, several reviews have been developed on the application of radiomics for OC, providing a comprehensive summary of the developing experiences in OC radiomics [11], [12], [13]. Additionally, numerous radiomics models for OC have been proposed and applied to various medical scenarios. These include predicting postoperative recurrence [14], early detection and diagnosis [15], [16], [17], assessing patient survival [18], preoperative classification [19], [20], [21], [22], cancer typing [23], [24], chemotherapy sensitivity [25], and prognostic [26].

Deep learning (DL), originally developed for image analysis, has shown remarkable performance in diverse image processing tasks, including registration, segmentation, feature extraction, and classification. It has demonstrated outstanding performance in extracting latent information from medical images, making it a powerful tool for lesion detection and classification tasks, such as lung cancer [27], thyroid cancer [28], and breast cancer [29]. Numerous studies have established DL as the most effective method for computer-aided diagnosis (CAD) [30], [31]. However, traditional DL models typically rely on a single-view input, which may lead to underestimation or even neglect of the spatial correlation between tumor locations, particularly in small datasets. To address this limitation, recent research has shifted towards exploring multi-view/double-view approaches to improve model performance. In medical imaging, the term “Multi-view” refers to images obtained from different angles or planes. For CT scans, multi-view typically includes axial (transverse), coronal, and sagittal views: 1) The axial view provides a detailed perspective of different anatomical levels by scanning along the transverse plane, aiding in accurate localization and measurement of lesions. 2) The coronal view is generated by scanning along the frontal and posterior directions, offering information about the width and thickness of the organ. 3) The sagittal view is produced by scanning along the left and right directions, revealing the depth and position of organs while also helping to determine their spatial relationships. The utilization of multi-view approaches allows DL models to thoroughly learn the features of the region of interest (ROI), thereby enhancing the model’s performance and accuracy in medical image analysis tasks.

For example:

A case study on liver cancer utilized a Deep Multi-view Comparative Learning (DMCL) approach for cancer subtype identification [32].
A paper published in Medical Image Analysis designed attention-enhanced deep neural networks that jointly analyze Bone scintigraphy from anterior and posterior views to automatically diagnose the absence or presence of bone metastasis [33].
Chen et al. used local and global transformation modules for modeling dependencies within and between Mammograms to accurately identify lesion regions and compute features from unregistered multiple mammograms [34].
Chen et al. constructed a multi-view local co-occurrence and global consistency learning model using two mammographic views (main and auxiliary) as inputs, further improving the generalization of mammogram classification [35].
Gao et al. proposed a new method (MuVAL) for multi-view synthesis learning of CT images using an attention mechanism, to accurately predict residual lesions after ovarian cystectomy surgery [36].

Currently, research on the dual-view DL of ovarian has not yet fully developed, especially in distinguishing ovarian cysts.

The radiomics features are extracted through medical imaging data without the need for large datasets, as required by DL. This method possesses strong biological and clinical interpretability [37]. However, it requires high data quality and standardization and is influenced by the imaging acquisition equipment and parameters. On the other hand, DL also possesses unique advantages. Firstly, it can automatically learn and extract features without the need for manual algorithms. Secondly, it can acquire multi-level representations, which aids in a better understanding and analysis of image data. Thirdly, it can parallel process large-scale image data, thereby improving efficiency. Fourthly, it can achieve end-to-end learning, simplifying the entire image analysis process and enhancing model performance. Lastly, it demonstrates a strong generalization ability to adapt to different types and styles of image data. The deep features extracted by DL models have demonstrated powerful representational capabilities and robustness against interference, enabling the quantification of high-level semantic information within the data. However, it requires a large amount of annotated data and computational resources and lacks interpretability [38]. In general, the strong interpretability of radiomics features can complement the shortcomings of deep features, while deep features provide deeper semantic information, which helps assist the study of radiomics. Therefore, research about radiomics and DL is receiving increasing attention.

In recent years, there has been an increasing number of studies focusing on the classification of OC by combining radiomics and DL [39]. However, there is still a lack of research on the classification of benign and malignant ovarian tumors based on radiomics & dual-view DL.

The purpose of this paper is to construct a model for the precise classification of ovarian benign and malignant tumors based on radiomics and DL technology. We propose a model that combines radiomics and dual-view DL, using the Stacking method to optimize the effectiveness of each approach. The results show that this fusion strategy effectively integrates the decision levels of radiomics and the DL model, successfully achieving precise classification of benign and malignant ovarian tumors. The innovation of this study lies in:

Regarding radiomics, we used Pyradiomis and improved automated machine learning (A-ML) to carry out experiments, successfully achieving precise differentiation of benign and malignant tumors in enhanced ovarian CT.
In the field of DL, we have developed a novel model for ovarian tumor classification known as Dual-view Global Representation and Local Cross Transformer (D_GR_LCT). The model takes dual views (axial view and coronal view) as input for the first time and employs a global-local parallel analysis approach to assess the global representation and local information within ROI. To generate local information, we have developed a new Cross Attention Transformer (CAttnT) to facilitate the exchange of information between different view features.
In this paper, the dual-view DL model is integrated with radiomics for the first time to achieve ovarian tumor classification. The stacking method was employed for decision-making between the two models. The final results demonstrate that the combined model outperforms the individual models (radiomics/DL).

SECTION II.

Datasets Preparation

A. Data Acquisition

This study focuses on the enhanced CT arterial phase data of patients with ovarian tumors. A retrospective analysis was conducted on clinical data from 135 patients with confirmed ovarian tumors at Nanfang Hospital in Guangzhou, Guangdong Province, between June 2011 and August 2018. The patient data were collected using the Siemens SOMATOM Definition scanner, which utilized specific parameters: a tube voltage of 120 kVp, a tube current range of 122-673 mA, collimation widths of 19.2, 28.8, and 80 mm, minimum reconfigurable thicknesses of 0.6, 0.625, and 1.2 mm, a data acquisition diameter of 500 mm, and an exposure time of 500 ms. The layer thickness ranged from 0.6-5 mm and the image resolution of $512\times 512$ .

B. Outlining the ROI

The open-source medical image processing and visualization software ITK-SNAP is utilized in medical research and clinical practice to facilitate accurate image analysis and diagnosis by doctors and researchers. In this study, imaging experts used the ITK-SNAP software to meticulously map the ROI layer by layer for ovarian tumors in patient images. Figure 1 shows a 3D reconstruction of the sketched image.

FIGURE 1.

Stereo image after 3D reconstruction.

Decision-Level Fusion Classification of Ovarian CT Benign and Malignant Tumors Based on Radiomics and Deep Learning of Dual Views

Alerts

Abstract:

Metadata

Abstract:

Funding Agency:

Introduction

Datasets Preparation

A. Data Acquisition

B. Outlining the ROI

C. Raw Data Analysis

Research Methodology

A. Division of the Data Set

B. Evaluation Index

C. Radiomics

1) Radiomics Feature Extraction

2) Construction of Radiomics

D. DL Model

1) Data Preprocessing

2) Construction of DL Model

3) Single-View Model

4) Implementation Details

E. Fusion Model Based on Radiomics & DL Model

Results and Analysis

A. Results Based Radiomics Only

1) Comparative Experiments

2) Summary Based Radiomics

B. Results Based DL Model Only

1) Ablation Experiments

2) Comparison Experiments

3) Summary Based DL Model

C. Results Based Radiomics & DL

Conclusion

Conflicts of Interest

References

IEEE Account

Purchase Details

Profile Information

Need Help?