Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction
The classification of environmental sounds is important for emerging applications such as automatic audio surveillance, audio forensics, and robot navigation. Existing techniques combined multiple features and stacked many CNN layers (very deep learning) to reach the desired accuracy. Instead of usi...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English English |
Published: |
Springer Nature
2021
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/90215/7/90215_Rethinking%20environmental%20sound%20classification%20using%20convolutional%20neural%20networks_SCOPUS.pdf http://irep.iium.edu.my/90215/8/90215_Rethinking%20environmental%20sound%20classification%20using%20convolutional%20neural%20networks.pdf http://irep.iium.edu.my/90215/ https://link.springer.com/article/10.1007/s00521-021-06091-7 https://doi.org/10.1007/s00521-021-06091-7 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.iium.irep.90215 |
---|---|
record_format |
dspace |
spelling |
my.iium.irep.902152021-06-15T06:30:27Z http://irep.iium.edu.my/90215/ Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction Al-Hattab, Yousef Abd Mohd Zaki, Hasan Firdaus Shafie, Amir Akramin TK7885 Computer engineering The classification of environmental sounds is important for emerging applications such as automatic audio surveillance, audio forensics, and robot navigation. Existing techniques combined multiple features and stacked many CNN layers (very deep learning) to reach the desired accuracy. Instead of using many features and going deeper by stacking layers that are resource extensive, this paper proposes a novel technique that uses only a single feature, namely the Mel-Frequency Cepstral Coefficient (MFCC) and just three layers of CNN. We demonstrate that such a simple network can considerably outperform several conventional and deep learning-based algorithms. Through a carefully and empirically parameters fine-tuning of the data input, we reported a model that is significantly less complex in the architecture yet has recorded a similar accuracy of 95.59% compared to state-of-the-art deep models on UrbanSound8k dataset. We conjecture that our accurate lightweight model is an excellent environmental sound recognizer for the application on resource-constraint embedded platform. Springer Nature 2021-05-26 Article PeerReviewed application/pdf en http://irep.iium.edu.my/90215/7/90215_Rethinking%20environmental%20sound%20classification%20using%20convolutional%20neural%20networks_SCOPUS.pdf application/pdf en http://irep.iium.edu.my/90215/8/90215_Rethinking%20environmental%20sound%20classification%20using%20convolutional%20neural%20networks.pdf Al-Hattab, Yousef Abd and Mohd Zaki, Hasan Firdaus and Shafie, Amir Akramin (2021) Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction. Neural Computing and Applications. pp. 1-19. ISSN 0941-0643 E-ISSN 1433-3058 https://link.springer.com/article/10.1007/s00521-021-06091-7 https://doi.org/10.1007/s00521-021-06091-7 |
institution |
Universiti Islam Antarabangsa Malaysia |
building |
IIUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
International Islamic University Malaysia |
content_source |
IIUM Repository (IREP) |
url_provider |
http://irep.iium.edu.my/ |
language |
English English |
topic |
TK7885 Computer engineering |
spellingShingle |
TK7885 Computer engineering Al-Hattab, Yousef Abd Mohd Zaki, Hasan Firdaus Shafie, Amir Akramin Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction |
description |
The classification of environmental sounds is important for emerging applications such as automatic audio surveillance, audio forensics, and robot navigation. Existing techniques combined multiple features and stacked many CNN layers (very deep learning) to reach the desired accuracy. Instead of using many features and going deeper by stacking layers that are resource extensive, this paper proposes a novel technique that uses only a single feature, namely the Mel-Frequency Cepstral Coefficient (MFCC) and just three layers of CNN. We demonstrate that such a simple network can considerably outperform several conventional and deep learning-based
algorithms. Through a carefully and empirically parameters fine-tuning of the data input, we reported a model that is significantly less complex in the architecture yet has recorded a similar accuracy of 95.59% compared to state-of-the-art deep models on UrbanSound8k dataset. We conjecture that our accurate lightweight model is an excellent environmental sound recognizer for the application on resource-constraint embedded platform. |
format |
Article |
author |
Al-Hattab, Yousef Abd Mohd Zaki, Hasan Firdaus Shafie, Amir Akramin |
author_facet |
Al-Hattab, Yousef Abd Mohd Zaki, Hasan Firdaus Shafie, Amir Akramin |
author_sort |
Al-Hattab, Yousef Abd |
title |
Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction |
title_short |
Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction |
title_full |
Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction |
title_fullStr |
Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction |
title_full_unstemmed |
Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction |
title_sort |
rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction |
publisher |
Springer Nature |
publishDate |
2021 |
url |
http://irep.iium.edu.my/90215/7/90215_Rethinking%20environmental%20sound%20classification%20using%20convolutional%20neural%20networks_SCOPUS.pdf http://irep.iium.edu.my/90215/8/90215_Rethinking%20environmental%20sound%20classification%20using%20convolutional%20neural%20networks.pdf http://irep.iium.edu.my/90215/ https://link.springer.com/article/10.1007/s00521-021-06091-7 https://doi.org/10.1007/s00521-021-06091-7 |
_version_ |
1703960212662124544 |
score |
13.211869 |