Steganalysis of adaptive multi-rate speech with unknown embedding rates using multi-scale transformer and multi-task learning mechanism

As adaptive multi-rate (AMR) speech applications become increasingly widespread, AMR-based steganography presents growing security risks. Conventional steganalysis methods often assume known embedding rates, limiting their practicality in real-world scenarios where embedding rates are unknown. To ov...

Full description

Saved in:
Bibliographic Details
Main Authors: Sun, Congcong, Abdullah, Azizol, Samian, Normalia, Roslan, Nuur Alifah
Format: Article
Language:en
Published: Multidisciplinary Digital Publishing Institute (MDPI) 2025
Subjects:
Online Access:http://psasir.upm.edu.my/id/eprint/123114/1/123114.pdf
http://psasir.upm.edu.my/id/eprint/123114/
https://www.mdpi.com/2624-800X/5/2/29
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:As adaptive multi-rate (AMR) speech applications become increasingly widespread, AMR-based steganography presents growing security risks. Conventional steganalysis methods often assume known embedding rates, limiting their practicality in real-world scenarios where embedding rates are unknown. To overcome this limitation, we introduce a novel framework that integrates a multi-scale transformer architecture with multi-task learning for joint classification and regression. The classification task effectively distinguishes between cover and stego samples, while the regression task enhances feature representation by predicting continuous embedding values, providing deeper insights into embedding behaviors. This joint optimization strategy improves model adaptability to diverse embedding conditions and captures the underlying relationships between discrete embedding classes and their continuous distributions. The experimental results demonstrate that our approach achieves higher accuracy and robustness than existing steganalysis methods across varying embedding rates.