Steganalysis of adaptive multi-rate speech with unknown embedding rates using multi-scale transformer and multi-task learning mechanism
As adaptive multi-rate (AMR) speech applications become increasingly widespread, AMR-based steganography presents growing security risks. Conventional steganalysis methods often assume known embedding rates, limiting their practicality in real-world scenarios where embedding rates are unknown. To ov...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | en |
| Published: |
Multidisciplinary Digital Publishing Institute (MDPI)
2025
|
| Subjects: | |
| Online Access: | http://psasir.upm.edu.my/id/eprint/123114/1/123114.pdf http://psasir.upm.edu.my/id/eprint/123114/ https://www.mdpi.com/2624-800X/5/2/29 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | As adaptive multi-rate (AMR) speech applications become increasingly widespread, AMR-based steganography presents growing security risks. Conventional steganalysis methods often assume known embedding rates, limiting their practicality in real-world scenarios where embedding rates are unknown. To overcome this limitation, we introduce a novel framework that integrates a multi-scale transformer architecture with multi-task learning for joint classification and regression. The classification task effectively distinguishes between cover and stego samples, while the regression task enhances feature representation by predicting continuous embedding values, providing deeper insights into embedding behaviors. This joint optimization strategy improves model adaptability to diverse embedding conditions and captures the underlying relationships between discrete embedding classes and their continuous distributions. The experimental results demonstrate that our approach achieves higher accuracy and robustness than existing steganalysis methods across varying embedding rates. |
|---|
