Wavelet Video Compression

The thesis proposes the current approaches in wavelet technology which has provided efficient framework of multi-resolution space-frequency representation with promising applications in video processing. This discrete wavelet transform (DWT) is becoming increasingly important in visual applications...

Full description

Saved in:
Bibliographic Details
Main Author: Fakeh, Rohmad
Format: Thesis
Language:English
English
Published: 2006
Online Access:http://psasir.upm.edu.my/id/eprint/438/1/1600480.pdf
http://psasir.upm.edu.my/id/eprint/438/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.upm.eprints.438
record_format eprints
spelling my.upm.eprints.4382013-05-27T06:48:21Z http://psasir.upm.edu.my/id/eprint/438/ Wavelet Video Compression Fakeh, Rohmad The thesis proposes the current approaches in wavelet technology which has provided efficient framework of multi-resolution space-frequency representation with promising applications in video processing. This discrete wavelet transform (DWT) is becoming increasingly important in visual applications because of its flexibility in representing nonstationary signals such as images and video sequences. The main objective of this thesis is to develop a wavelet video compression system. There are however many parameters within a wavelet analysis and synthesis which govern the quality of a decoded video sequences such as boundary policies, quantization threshold, decomposition strategies and the choice of wavelet filter-banks. An evaluation of the visual quality of images and video sequences at different parameter settings leads to recommendations on the wavelet filter parameters to be used in video compression. In this thesis the video compression schemes of 2D frame by frame and 3D spatio-temporal wavelet transformation are proposed. The standard spatio-temporal scheme has fixed number of sub-bands generated after the temporal decomposition and adopting adaptive quantization to the fixed number of sub-bands. The proposed spatio-temporal scheme proposed a flexible number of sub-bands generated depending on the penetration depth of the wavelet transformation. The global and level-dependent-threshold quantization methodology with the statistical adaptive estimation of wavelet shrinkage for the transformed coefficients have been adopted and provides high compression performance even without entropy coding and comparable to other coding scheme utilizing other quantization methods. The leveldependent- threshold is found to be a useful tool for providing fixed rate and used throughout the simulations for the empirical evaluation of the tested parameters as compared to global threshold. Extensive experimental investigations on a wide variety of monochrome and color images, and video sequences in QCIF, CIF and SIF resolutions are reported in this thesis. The international benchmark of visual quality evaluation of Mean Squared Error (MSE), Compression Ratio (CR), and Peak Signal to Noise Ratio (PSNR) are used as the objective measure of performance quality. Experimental results had shown that bi-orthogonal 9/7 (Bior-4.4) wavelet filters perform comparable for images and video sequences with less temporal activity however filter-banks from Symlet family (Sym-5) has shown to perform the best and out-performed others when applied to video sequences with even higher background activity such as the Car-phone and Akiyo sequences. Coding performance has been reported and performed best with dyadic DWT decomposition, periodic extensions and level-dependent threshold quantization. 2006-12 Thesis NonPeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/438/1/1600480.pdf Fakeh, Rohmad (2006) Wavelet Video Compression. PhD thesis, Universiti Putra Malaysia. English
institution Universiti Putra Malaysia
building UPM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
url_provider http://psasir.upm.edu.my/
language English
English
description The thesis proposes the current approaches in wavelet technology which has provided efficient framework of multi-resolution space-frequency representation with promising applications in video processing. This discrete wavelet transform (DWT) is becoming increasingly important in visual applications because of its flexibility in representing nonstationary signals such as images and video sequences. The main objective of this thesis is to develop a wavelet video compression system. There are however many parameters within a wavelet analysis and synthesis which govern the quality of a decoded video sequences such as boundary policies, quantization threshold, decomposition strategies and the choice of wavelet filter-banks. An evaluation of the visual quality of images and video sequences at different parameter settings leads to recommendations on the wavelet filter parameters to be used in video compression. In this thesis the video compression schemes of 2D frame by frame and 3D spatio-temporal wavelet transformation are proposed. The standard spatio-temporal scheme has fixed number of sub-bands generated after the temporal decomposition and adopting adaptive quantization to the fixed number of sub-bands. The proposed spatio-temporal scheme proposed a flexible number of sub-bands generated depending on the penetration depth of the wavelet transformation. The global and level-dependent-threshold quantization methodology with the statistical adaptive estimation of wavelet shrinkage for the transformed coefficients have been adopted and provides high compression performance even without entropy coding and comparable to other coding scheme utilizing other quantization methods. The leveldependent- threshold is found to be a useful tool for providing fixed rate and used throughout the simulations for the empirical evaluation of the tested parameters as compared to global threshold. Extensive experimental investigations on a wide variety of monochrome and color images, and video sequences in QCIF, CIF and SIF resolutions are reported in this thesis. The international benchmark of visual quality evaluation of Mean Squared Error (MSE), Compression Ratio (CR), and Peak Signal to Noise Ratio (PSNR) are used as the objective measure of performance quality. Experimental results had shown that bi-orthogonal 9/7 (Bior-4.4) wavelet filters perform comparable for images and video sequences with less temporal activity however filter-banks from Symlet family (Sym-5) has shown to perform the best and out-performed others when applied to video sequences with even higher background activity such as the Car-phone and Akiyo sequences. Coding performance has been reported and performed best with dyadic DWT decomposition, periodic extensions and level-dependent threshold quantization.
format Thesis
author Fakeh, Rohmad
spellingShingle Fakeh, Rohmad
Wavelet Video Compression
author_facet Fakeh, Rohmad
author_sort Fakeh, Rohmad
title Wavelet Video Compression
title_short Wavelet Video Compression
title_full Wavelet Video Compression
title_fullStr Wavelet Video Compression
title_full_unstemmed Wavelet Video Compression
title_sort wavelet video compression
publishDate 2006
url http://psasir.upm.edu.my/id/eprint/438/1/1600480.pdf
http://psasir.upm.edu.my/id/eprint/438/
_version_ 1643821823445434368
score 13.211869