The present invention provides a high-speed, memory efficient parallel
coding technique for embedded block coding with optimized truncation
(EBCOT) used in still image compression. Attributing to parallel
processing method and structure, it processes a discrete wavelet
transform (DWT) coefficient at a clock cycle without any state variable
stored. Therefore, the need of state variable memory can be avoid and the
external memory bandwidth can be reduced. With the same cost of chip-area
and lower power consumption, the processing rate of this invention is
several times higher than conventional schemes. Furthermore, the present
invention processes 50 M coefficients per second at 100 MHz and can
encode lossless HDTV 720 p resolution pictures at 30 fps in real time.