Disclosed are scalable quantizers for audio and other signals characterized
by a non-uniform, perception-based distortion metric, that operate in a common
companded domain which includes both the base-layer and one or more enhancement-layers.
The common companded domain is designed to permit use of the same unweighted MSE
metric for optimal quantization parameter selection in multiple layers, exploiting
the statistical dependence of the enhancement-layer signal on the quantization
parameters used in the preceding layer. One embodiment features an asymptotically
optimal entropy coded uniform scalar quantizer. Another embodiment is an improved
bit rate scalable multi-layer Advanced Audio Coder (AAC) which extends the scalability
of the asymptotically optimal entropy coded uniform scalar quantizer to systems
with non-uniform base-layer quantization, selecting the enhancement-layer quantization
methodology to be used in a particular band based on the preceding layer quantization
coefficients. In the important case that the source is well modeled as Laplacian,
the optimal conditional quantizer is implementable by only two distinct switchable
quantizers depending on whether or not the previous quantizer identified the band
in question as a so-called "zero dead-zone:" Hence, major savings in bit rate are
recouped at virtually no additional computational cost. For example, the proposed
four layer scalable coder consisting of 16 kbps layers achieves performance close
to a 60 kbps non-scalable coder on the standard test database of 44.1 kHz audio.