Towards a synergistic multistage speech coder

Manohar Murthi, Bhaskar D. Rao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

In this paper, we propose some new modeling techniques that provide a more synergistic approach to multistage time-domain speech compression. In particular, we propose a new error criterion for determining all-pole filters, and a unique method for jointly coding the pulse information in excitation vectors. The new error criterion for determining all-pole filters is based upon minimizing the sum of the residual signal's absolute values raised to a power less than one. It is shown to be a desirable cost function for yielding residual signals that are more sparse, and consequently better suited for multistage compression than Linear Prediction residuals. Statistical reasons supporting the new criterion are also provided. Furthermore, exploiting the properties of, and the relationship between, the Linear Prediction and Minimum Variance spectra, we propose a novel parameter set for jointly coding the excitation vector's pulse position, sign, and gain information.

Original languageEnglish (US)
Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
PublisherIEEE
Pages369-372
Number of pages4
Volume1
StatePublished - 1998
Externally publishedYes
EventProceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP. Part 1 (of 6) - Seattler, WA, USA
Duration: May 12 1998May 15 1998

Other

OtherProceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP. Part 1 (of 6)
CitySeattler, WA, USA
Period5/12/985/15/98

Fingerprint

coders
Poles
linear prediction
coding
poles
Cost functions
filters
pulses
excitation
costs

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Acoustics and Ultrasonics
  • Software

Cite this

Murthi, M., & Rao, B. D. (1998). Towards a synergistic multistage speech coder. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 1, pp. 369-372). IEEE.

Towards a synergistic multistage speech coder. / Murthi, Manohar; Rao, Bhaskar D.

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 1 IEEE, 1998. p. 369-372.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Murthi, M & Rao, BD 1998, Towards a synergistic multistage speech coder. in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. vol. 1, IEEE, pp. 369-372, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP. Part 1 (of 6), Seattler, WA, USA, 5/12/98.
Murthi M, Rao BD. Towards a synergistic multistage speech coder. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 1. IEEE. 1998. p. 369-372
Murthi, Manohar ; Rao, Bhaskar D. / Towards a synergistic multistage speech coder. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 1 IEEE, 1998. pp. 369-372
@inproceedings{84668d6a5a1a40fda350166f7d3377a6,
title = "Towards a synergistic multistage speech coder",
abstract = "In this paper, we propose some new modeling techniques that provide a more synergistic approach to multistage time-domain speech compression. In particular, we propose a new error criterion for determining all-pole filters, and a unique method for jointly coding the pulse information in excitation vectors. The new error criterion for determining all-pole filters is based upon minimizing the sum of the residual signal's absolute values raised to a power less than one. It is shown to be a desirable cost function for yielding residual signals that are more sparse, and consequently better suited for multistage compression than Linear Prediction residuals. Statistical reasons supporting the new criterion are also provided. Furthermore, exploiting the properties of, and the relationship between, the Linear Prediction and Minimum Variance spectra, we propose a novel parameter set for jointly coding the excitation vector's pulse position, sign, and gain information.",
author = "Manohar Murthi and Rao, {Bhaskar D.}",
year = "1998",
language = "English (US)",
volume = "1",
pages = "369--372",
booktitle = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",
publisher = "IEEE",

}

TY - GEN

T1 - Towards a synergistic multistage speech coder

AU - Murthi, Manohar

AU - Rao, Bhaskar D.

PY - 1998

Y1 - 1998

N2 - In this paper, we propose some new modeling techniques that provide a more synergistic approach to multistage time-domain speech compression. In particular, we propose a new error criterion for determining all-pole filters, and a unique method for jointly coding the pulse information in excitation vectors. The new error criterion for determining all-pole filters is based upon minimizing the sum of the residual signal's absolute values raised to a power less than one. It is shown to be a desirable cost function for yielding residual signals that are more sparse, and consequently better suited for multistage compression than Linear Prediction residuals. Statistical reasons supporting the new criterion are also provided. Furthermore, exploiting the properties of, and the relationship between, the Linear Prediction and Minimum Variance spectra, we propose a novel parameter set for jointly coding the excitation vector's pulse position, sign, and gain information.

AB - In this paper, we propose some new modeling techniques that provide a more synergistic approach to multistage time-domain speech compression. In particular, we propose a new error criterion for determining all-pole filters, and a unique method for jointly coding the pulse information in excitation vectors. The new error criterion for determining all-pole filters is based upon minimizing the sum of the residual signal's absolute values raised to a power less than one. It is shown to be a desirable cost function for yielding residual signals that are more sparse, and consequently better suited for multistage compression than Linear Prediction residuals. Statistical reasons supporting the new criterion are also provided. Furthermore, exploiting the properties of, and the relationship between, the Linear Prediction and Minimum Variance spectra, we propose a novel parameter set for jointly coding the excitation vector's pulse position, sign, and gain information.

UR - http://www.scopus.com/inward/record.url?scp=0031645439&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0031645439&partnerID=8YFLogxK

M3 - Conference contribution

VL - 1

SP - 369

EP - 372

BT - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

PB - IEEE

ER -