A SIMD processor efficiently utilizes its hardware resources to achieve
higher data processing throughput. The effective width of a SIMD
processor is extended by clocking the instruction processing side of the
SIMD processor at a fraction of the rate of the data processing side and
by providing multiple execution pipelines, each with multiple data paths.
As a result, higher data processing throughput is achieved while an
instruction is fetched and issued once per clock. This configuration also
allows a large group of threads to be clustered and executed together
through the SIMD processor so that greater memory efficiency can be
achieved for certain types of operations like texture memory accesses
performed in connection with graphics processing.