This invention increases the available instruction level parallelism (IPC) of
CABAC encoding by decoupling the re-normalization loop and the bit-insertion task
required to create the encoded bit-stream. This makes all software implementations
of CABAC based encoding significantly faster on digital signal processors that
can exploit instruction level parallelism such as very long instruction word (VLIW)
digital signal processors. In a joint hardware/software implementation, this invention
employs existing Huffman variable length encoding hardware with minimum modifications.
The de-coupling of these two tasks of this invention exposes previously hidden
underlying instruction level parallelism and task level parallelism.