Application of exponential power estimator for speech coding

Mathew, George

View/Open

T02763.pdf (63.28Mb)

Author

Mathew, George

Metadata

Show full item record

Abstract

In speech coding, it is known that the Differential Pulse Code Modulation (DPCM) schemes lead to the reduction of bandwidth requirement (by removing the redundancy) as compared to pulse Code Modulation (PCM) schemes, for a specified speech quality. To achieve adequate dynamic range and good subjective quality, however, it is necessary to adapt the parameters of the quantizer and predictor to the input signal characteristics. Several algorithms have been developed for adapting the quantizer and predictor, most of which suffer from high computational complexity and are not easily implementable. The study reported in this thesis is concerned with the development of adaptive PCM and DPCM (APCM and ADPCM) systems which are based on computationally simple adaptive quantization and prediction algorithms. Two points are noteworthy: firstly, variance estimation of the quantizer input forms the basis of adaptive quantization and secondly, in predictor adaptation using the Least Mean Square gradient (LMS) algorithm it is the estimation of the reconstructed signal power which is important. Hence, it is possible to reduce the computational complexity of the APCM and ADPCM systems by resorting to computationally simple power estimation schemes. This is achieved here by implementing the adaptation algorithms for the quantizer and predictor using the Exponential power Estimation Technique (EPE), which is a modified form of an existing power estimation method. The estimation using EPE avoids multiplications and divisions and it expresses the power estimate in the form 2^, where i is an integer. Hence the use of EPE leads to a significant reduction in computational complexity. Comparative studies of EPE with two conventional power SYNOPSIS ii estimation schemes (Block and Recursive power estimators: BPE and RPE) have been presented in the context of step response, spectral parameter identification and adaptive prediction of speech. The results indicate that EPE is much superior to RPE and identical to BPE in performance. However,' the performance of EPE is limited by the finite number of distinct power estimate values which the EPE estimate is allowed to take. The adaptive quantizer used in the APCM and ADPCM systems is of the hybrid type. EPE is used to estimate the instantaneous and syllabic standard deviations of the quantizer input signal, for the purpose of adaptation. In order to make the quantizer robust to transmission errors, a leakage mechanism has been introduced in the EPE algorithm, performance evaluation study of these systems have been done under noise free and noisy channel conditions using uniform and nonuniform quantizing characteristics at different bit rates. The input signals used are sinusoidal, correlated Gaussian sequence and digitized speech. segmented signal to noise ratio is the performance measure used. The APCM systems simulated at bit rates 24 and 32 kbps using the EPE-quantizer (EPEQ) is found to provide almost identical performance as that of jayant's adaptive quantizer for speech input while the performance degrades for sinusoidal input (due to the speech specific design of EPEQ). A detailed performance evaluation of the ADPCM system, whose quantizer and predictor are adapted based on EPE, led to the conclusion that the EPE based ADPCM is best suited for low bit rate applications. At 16 kbps, this system outperformed the two conventional ADPCM systems considered for comparison purpose, with the computational advantage of having (3N + B + 1) multiplications and one division compared to the first system and (B + 1) multiplications compared to the second system, where N is the predictor order and B is the ill number of bits/sample. However, EPE requires one extra exponentiation operation. Further, modifications are done to improve the quantizer performance. These include introduction of additional step sizes in the quantizer and modifications aimed at improving the dynamic range and noisy channel performances. The thesis has been organized in the following manner: The significance of a computationally simple power estimation scheme in the context of adaptive quantization and prediction is brought out in chapter 1. This chapter also presents a review of those speech digitization methods which are relevant to the topic of this thesis. A detailed introduction and study of the Exponential power Estimation technique is given in chapter 2. The application of EPE in adaptive quantization is dealt with in chapter 3. In Chapter 4, the development of a computationally simple ADPCM system (16 kbps) which makes use of EPE for adapting its quantizer and predictor is considered. The results are summarized in chapter 5. The thesis concludes with some suggestions for further work in this- area.

URI

https://etd.iisc.ac.in/handle/2005/7296

Collections

Electrical Communication Engineering (ECE) [474]