DATA EMBEDDING IN SPEECH SIGNALS USING PERCEPTUAL MASKING (ThuPmPO1)
Author(s) :
Ariel Sagi (Technion, Israel)
David Malah (Technion, Israel)
Abstract : In this paper, a data embedding technique for speech signals, exploiting the masking property of the human auditory system, is presented. The signal in the frequency domain is partitioned into subbands. The data embedding parameters of each subband are computed from the auditory masking threshold function and a channel noise estimate. Data embedding is performed by modifying the Discrete Hartley Transform (DHT) coefficients according to the principles of the Scalar Costa Scheme (SCS). A maximum likelihood detector is employed in the decoder for embedded-data presence detection and data-embedding quantization-step estimation. We demonstrate the proposed data embedding technique by simulation of data embedding in a speech signal transmitted over a telephone line. The demonstrated system achieves transparent data-embedding at the rate of 300 information bits/second with a bit-error-rate of approximately 10−4. The proposed technique outperforms spread spectrum (SS) based data-embedding techniques for speech signals.

Menu