DCT-DWT-FFT based method for text detection in underwater images
Text detection in underwater images is an open challenge because of the distortions caused by refraction, absorption of light, particles, and variations depending on depth, color, and nature of water. Unlike existing methods aimed at text detection in natural scene images, in this paper, we have pro...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Conference or Workshop Item |
Published: |
SPRINGER INTERNATIONAL PUBLISHING AG
2022
|
Subjects: | |
Online Access: | http://eprints.um.edu.my/41004/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.um.eprints.41004 |
---|---|
record_format |
eprints |
spelling |
my.um.eprints.410042025-02-25T04:47:06Z http://eprints.um.edu.my/41004/ DCT-DWT-FFT based method for text detection in underwater images Banerjee, Ayan Shivakumara, Palaiahnakote Pal, Soumyajit Pal, Umapada Liu, Cheng-Lin QA75 Electronic computers. Computer science Text detection in underwater images is an open challenge because of the distortions caused by refraction, absorption of light, particles, and variations depending on depth, color, and nature of water. Unlike existing methods aimed at text detection in natural scene images, in this paper, we have proposed a novel method for text detection in underwater images through a new enhancement model. Based on observations that fine details of text in image share with high energy, spatial resolution, and brightness, we consider Discrete Cosine Transform (DCT), Discrete Wavelet Transform (DWT), and Fast Fourier Transform (FFT) for image enhancement to highlight the text features. The enhanced image is fed to a modified Character Region Awareness for Text Detection (CRAFT) model to detect text in underwater images. To explore enhancement methods, we evaluate six combinations of image enhancement techniques, namely, DCT-DWT-FFT, DCT-FFT-DWT, DWT-DCT-FFT, DWT-FFT-DCT, FFT-DCT-DWT, FFT-DWTDCT. Experimental results on our dataset of underwater images and benchmark datasets of natural scene text detection, namely, MSRA-TD500, ICDAR 2019 MLT, ICDAR 2019 ArT, Total-Text, CTW1500, and COCO Text show that the proposed method performs well for both underwater and natural scene images and outperforms the existing methods on all the datasets. SPRINGER INTERNATIONAL PUBLISHING AG 2022 Conference or Workshop Item PeerReviewed Banerjee, Ayan and Shivakumara, Palaiahnakote and Pal, Soumyajit and Pal, Umapada and Liu, Cheng-Lin (2022) DCT-DWT-FFT based method for text detection in underwater images. In: 6th Asian Conference on Pattern Recognition, ACPR 2021, 9-12 November 2021, Virtual, Online. |
institution |
Universiti Malaya |
building |
UM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaya |
content_source |
UM Research Repository |
url_provider |
http://eprints.um.edu.my/ |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Banerjee, Ayan Shivakumara, Palaiahnakote Pal, Soumyajit Pal, Umapada Liu, Cheng-Lin DCT-DWT-FFT based method for text detection in underwater images |
description |
Text detection in underwater images is an open challenge because of the distortions caused by refraction, absorption of light, particles, and variations depending on depth, color, and nature of water. Unlike existing methods aimed at text detection in natural scene images, in this paper, we have proposed a novel method for text detection in underwater images through a new enhancement model. Based on observations that fine details of text in image share with high energy, spatial resolution, and brightness, we consider Discrete Cosine Transform (DCT), Discrete Wavelet Transform (DWT), and Fast Fourier Transform (FFT) for image enhancement to highlight the text features. The enhanced image is fed to a modified Character Region Awareness for Text Detection (CRAFT) model to detect text in underwater images. To explore enhancement methods, we evaluate six combinations of image enhancement techniques, namely, DCT-DWT-FFT, DCT-FFT-DWT, DWT-DCT-FFT, DWT-FFT-DCT, FFT-DCT-DWT, FFT-DWTDCT. Experimental results on our dataset of underwater images and benchmark datasets of natural scene text detection, namely, MSRA-TD500, ICDAR 2019 MLT, ICDAR 2019 ArT, Total-Text, CTW1500, and COCO Text show that the proposed method performs well for both underwater and natural scene images and outperforms the existing methods on all the datasets. |
format |
Conference or Workshop Item |
author |
Banerjee, Ayan Shivakumara, Palaiahnakote Pal, Soumyajit Pal, Umapada Liu, Cheng-Lin |
author_facet |
Banerjee, Ayan Shivakumara, Palaiahnakote Pal, Soumyajit Pal, Umapada Liu, Cheng-Lin |
author_sort |
Banerjee, Ayan |
title |
DCT-DWT-FFT based method for text detection in underwater images |
title_short |
DCT-DWT-FFT based method for text detection in underwater images |
title_full |
DCT-DWT-FFT based method for text detection in underwater images |
title_fullStr |
DCT-DWT-FFT based method for text detection in underwater images |
title_full_unstemmed |
DCT-DWT-FFT based method for text detection in underwater images |
title_sort |
dct-dwt-fft based method for text detection in underwater images |
publisher |
SPRINGER INTERNATIONAL PUBLISHING AG |
publishDate |
2022 |
url |
http://eprints.um.edu.my/41004/ |
_version_ |
1825160582171459584 |
score |
13.244109 |