On variable rate frame independent predictive speech coding: Re-engineering ILBC

Christopher M. Garrido, Manohar N. Murthi, Søren Vang Andersen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Scopus citations

Abstract

The Internet Low Bit-rate Coder (iLBC) is now widely used for Voice over Internet Protocol (VoIP) applications. Unlike speech coders such as those based on Code Excited Linear Prediction (CELP), the iLBC achieves superior robustness to packet loss by avoiding inter-frame coding dependencies. While robustness to packet loss is essential, a VoIP codec should also possess the flexibility to change its source coding rate in order to counter network congestion and facilitate joint source channel coding for wireless channels. Previously, we presented a new variation of the iLBC encoding procedure which yielded a more efficient, rate-flexible result. In an effort to improve performance at lower source rates, we present various improvements to the original framework. Specifically, we reallocate bits from the Adaptive Codebook) procedure; reduce the length of the start state vector; utilize an adaptive pulse gain quantization scheme; and extend the use of entropy coding. Overall, the various combined improvements result in the modified iLBC (with entropy coding) achieving a rate reduction of 2.0 to 2.9 kbps when compared to the original fixed-rate iLBC without any loss in quality. In comparisons with Adaptive Muiti-Rate (AMR), the modified iLBC coder remarkably exhibits equivalent Perceptual Evaluation of Speech Quality (PESQ) scores as the AMR coder at 10.2 and 12.2 kbps, and out-performs AMR for all packet loss rates. This is a significant result as the modified iLBC performs equivalent to AMR without exploiting inter-frame redundancies.

Original languageEnglish (US)
Title of host publication2006 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings
PagesI717-I720
StatePublished - Dec 1 2006
Event2006 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2006 - Toulouse, France
Duration: May 14 2006May 19 2006

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume1
ISSN (Print)1520-6149

Other

Other2006 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2006
CountryFrance
CityToulouse
Period5/14/065/19/06

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'On variable rate frame independent predictive speech coding: Re-engineering ILBC'. Together they form a unique fingerprint.

Cite this