In one embodiment, a node comprises, integrated onto a single integrated
circuit chip (in some embodiments), a plurality of processor cores and a
node controller coupled to the plurality of processor cores. The node
controller is coupled to receive an external request transmitted to the
node, and is configured to transmit a corresponding request to at least a
subset of the plurality of processor cores responsive to the external
request. The node controller is configured to receive respective
responses from each processor core of the subset. Each processor core
transmits the respective response independently in response to servicing
the corresponding request and is capable of transmitting the response on
different clock cycles than other processor cores. The node controller is
configured to transmit an external response to the external request
responsive to receiving each of the respective responses.