Adapting a Swin Transformer for License Plate Number and Text Detection in Drone Images

Authors

  • Srikanta Pal Maynooth International Engineering College, Maynooth University, Ireland
  • Ayush Roy Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, India https://orcid.org/0000-0002-9330-6839
  • Shivakumara Palaiahnakote Faculty of Computer Science and Information Technology, University of Malaya, Malaysia
  • Umapada Pal Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, India

DOI:

https://doi.org/10.47852/bonviewAIA3202549

Keywords:

MSER, deep learning, Swin transformer, text detection, license plate number detection

Abstract

The use of drones and unmanned aerial vehicles has significantly increased in various real-world applications such as monitoring illegal car parking, tracing vehicles, controlling traffic jams, and chasing vehicles. However, accurate detection of license plate numbers in drone images becomes complex and challenging due to variations in height distances and oblique angles during image capturing, unlike most existing methods that focus on normal images for text/license plate number detection. To address this issue, this work proposes a new model for license plate number detection in drone images using Swin transformer. The Swin transformer is chosen due to its special properties such as higher accuracy, efficiency, and fewer computations, making it suitable for license plate number/text detection in drone images. To further improve the performance of the proposed model under adverse conditions such as degradations, poor quality, and occlusion, the proposed work incorporates a maximally stable extremal region-based regional proposal network to represent text data in the images. Experimental results on both normal license plates and drone images demonstrate the superior performance of the proposed model over state-of-the-art methods.

 

Received: 23 November 2022 | Revised: 28 March 2023 | Accepted: 4 April 2023

 

Conflicts of Interest

Palaiahnakote Shivakumara is an Editor-in-Chief and Umapada Pal is an Advisory Board Member for Artificial Intelligence and Applications, and were not involved in the editorial review or the decision to publish this article. The authors declare that they have no conflicts of interest to this work.


Metrics

File downloads
835
Apr 07 '23Apr 10 '23Apr 13 '23Apr 16 '23Apr 19 '23Apr 22 '23Apr 25 '23Apr 28 '23May 01 '23May 04 '235.0
|

Downloads

Published

2023-04-06

Issue

Section

Research Article

How to Cite

Pal, S. ., Roy, A. ., Palaiahnakote , S., & Pal, . U. . (2023). Adapting a Swin Transformer for License Plate Number and Text Detection in Drone Images. Artificial Intelligence and Applications, 1(3), 129-138. https://doi.org/10.47852/bonviewAIA3202549