A system and method process data elements with instruction-level parallelism.
An
instruction buffer holds a first instruction and a second instruction, the first
instruction being associated with a first thread, and the second instruction being
associated with a second thread. A dependency counter counts satisfaction of dependencies
of instructions of the second thread on instructions of the first thread. An instruction
control unit is coupled to the instruction buffer and the dependency counter, the
instruction control unit increments and decrements the dependency counter according
to dependency information included in instructions. An execution switch is coupled
to the instruction control unit and the instruction buffer, and the execution switch
routes instructions to instruction execution units.