Adapting a Swin Transformer for License Plate Number and Text Detection in Drone Images

Authors

  • Srikanta Pal Maynooth International Engineering College, Maynooth University, Ireland
  • Ayush Roy Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, India https://orcid.org/0000-0002-9330-6839
  • Palaiahnakote Shivakumara Faculty of Computer Science and Information Technology, University of Malaya, Malaysia
  • Umapada Pal Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, India

DOI:

https://doi.org/10.47852/bonviewAIA3202549

Keywords:

MSER, deep learning, Swin transformer, text detection, license plate number detection

Abstract

The use of drones and unmanned aerial vehicles has significantly increased in various real-world applications such as monitoring illegal car parking, tracing vehicles, controlling traffic jams, and chasing vehicles. However, accurate detection of license plate numbers in drone images becomes complex and challenging due to variations in height distances and oblique angles during image capturing, unlike most existing methods that focus on normal images for text/license plate number detection. To address this issue, this work proposes a new model for license plate number detection in drone images using Swin transformer. The Swin transformer is chosen due to its special properties such as higher accuracy, efficiency, and fewer computations, making it suitable for license plate number/text detection in drone images. To further improve the performance of the proposed model under adverse conditions such as degradations, poor quality, and occlusion, the proposed work incorporates a maximally stable extremal region-based regional proposal network to represent text data in the images. Experimental results on both normal license plates and drone images demonstrate the superior performance of the proposed model over state-of-the-art methods.

 

Received: 23 November 2022 | Revised: 28 March 2023 | Accepted: 4 April 2023

 

Conflicts of Interest

Palaiahnakote Shivakumara is an editor-in-chief and Umapada Pal is an advisory board member for Artificial Intelligence and Applications, and were not involved in the editorial review or the decision to publish this article. The authors declare that they have no conflicts of interest to this work.

Metrics

Metrics Loading ...

Downloads

Published

2023-04-06

How to Cite

Pal, S. ., Roy, A. ., Shivakumara, P. ., & Pal, . U. . (2023). Adapting a Swin Transformer for License Plate Number and Text Detection in Drone Images. Artificial Intelligence and Applications, 1(3), 145–154. https://doi.org/10.47852/bonviewAIA3202549

Issue

Section

Research Article