CT-γ-Net: A Hybrid Model Based on Convolutional Encoder-Decoder and Transformer Encoder for Brain Tumor Localization

Authors

  • Punam Bedi Department of Computer Science, University of Delhi, India
  • Ningyao Ningshen Department of Computer Science, University of Delhi, India
  • Surbhi Rani Department of Computer Science, University of Delhi, India
  • Pushkar Gole Department of Computer Science, University of Delhi, India
  • Veenu Bhasin P.G.D.A.V. College, University of Delhi, India

DOI:

https://doi.org/10.47852/bonviewJDSIS42022514

Keywords:

brain tumor segmentation, transformer, convolutional encoder–decoder, deep learning, disease diagnosis using artificial intelligence

Abstract

Brain tumor is a life-threatening disease, and its early diagnosis can save human life. Computer-aided brain tumor segmentation and localization in magnetic resonance imaging (MRI) images have emerged as pivotal approaches for expediting the disease diagnosis process. In the past few decades, various researchers combined the strengths of convolutional networks and transformer to perform brain tumor segmentation. However, these models require a large number of trainable weights parameters, and there is still scope for performance improvement in them. To bridge these research gaps, this paper proposes a novel hybrid model named "CT-γ-Net" for effective and efficient brain tumor localization. The proposed CT-γ-Net model follows an encoder-decoder structure in which the convolutional encoder (CE) and transformer encoder (TE) are used for encoding, whereas the convolutional decoder (CD) is utilized for decoding the combined output of CE and TE to generate the segmentation masks. In CE and CD components of the CT-γ-Net model, conventional convolutional layers are replaced by depth-wise separable convolutional layers, as these layers significantly reduce trainable weights parameters. The proposed model achieves 95.5% MeanIoU, 94.82% Dice score, and 99.24% pixel accuracy on a publicly available dataset named the Cancer Imaging Archive. These experimental results demonstrate that the CT-γ-Net model outperformed other state-of-the-art research works, despite using roughly 28% fewer trainable weights parameters. Hence, the proposed model’s lightweight nature and its high performance make it a suitable candidate for deployment on mobile devices, facilitating the precise localization of brain tumor regions in MRI images.

 

Received: 23 January 2024 | Revised: 18 March 2024 | Accepted: 29 March 2024 

 

Conflicts of Interest

The authors declare that they have no conflicts of interest to this work. 

 

Data Availability Statement

The data that support the findings of this study are openly available in Kaggle at https://www.kaggle.com/datasets/mateuszbuda/lgg-mri-segmentation.

 

Author Contribution Statement

Punam Bedi: Conceptualization, Validation, Writing - review & editing, Supervision, Project administration. Ningyao Ningshen: Conceptualization, Methodology, Software, Formal analysis, Writing - original draft, Visualization. Surbhi Rani: Conceptualization, Methodology, Software, Formal analysis, Writing - original draft, Visualization. Pushkar Gole: Conceptualization, Validation, Writing - review & editing. Veenu Bhasin: Conceptualization, Validation, Writing - review & editing, Supervision.


Downloads

Published

2024-04-01

Issue

Section

Research Articles

How to Cite

Bedi, P., Ningshen, N., Rani, S., Gole, P., & Bhasin, V. (2024). CT-γ-Net: A Hybrid Model Based on Convolutional Encoder-Decoder and Transformer Encoder for Brain Tumor Localization. Journal of Data Science and Intelligent Systems. https://doi.org/10.47852/bonviewJDSIS42022514