A degree of local addressing is provided for a processing element array by partitioning
a register file memory (e.g., data columns, data rows), and adding a select column
or row to be associated with each block. The select column or row allows each processing
element to read data from or to write data to a different register file address.
Global addressing may also be implemented by reading data from or writing data
to the same register file address for each processing element. The invention provides
the advantage of faster overall execution time. In addition, there is minimal additional
area overhead because of the need to pitch match the processing element array to
a main memory.