An enhanced psychoacoustic model based on the discrete wavelet packet transform

Xing He, Michael S. Scordilis

Research output: Contribution to journalArticle

17 Scopus citations

Abstract

The perception of acoustic information by humans is based on the detailed temporal and spectral analysis provided by the auditory processing of the received signal. The incorporation of this process in psychoacoustical computational models has contributed significantly both in the development of highly efficient audio compression schemes as well as in effective audio watermarking methods. In this paper, we present an approach based on the discrete wavelet packet transform, which closely mimics the multi-resolution properties of the human ear and also includes simultaneous and temporal auditory masking. Experimental results show that the proposed technique offers better masking capabilities and it reduces the signal-to-masking ratio when compared to related approaches, without introducing audible distortion. Those results have implications that are important both for audio compression by permitting further bit rate reduction, and for watermarking by providing greater signal space for information hiding.

Original languageEnglish (US)
Pages (from-to)738-755
Number of pages18
JournalJournal of the Franklin Institute
Volume343
Issue number7
DOIs
StatePublished - Nov 1 2006

Keywords

  • Auditory masking
  • Multi-resolution analysis
  • Psychoacoustic model
  • Wavelet packet transform

ASJC Scopus subject areas

  • Modeling and Simulation
  • Control and Optimization
  • Applied Mathematics
  • Electrical and Electronic Engineering
  • Control and Systems Engineering
  • Information Systems and Management
  • Signal Processing

Fingerprint Dive into the research topics of 'An enhanced psychoacoustic model based on the discrete wavelet packet transform'. Together they form a unique fingerprint.

  • Cite this