High throughput in data computations and processing is maintained while
minimizing latency. A binary tree architecture is provided in which two
trees are used simultaneously, and initiation of the trees is staggered
to allow for optimal use of bandwidth. These techniques are desirable for
erasure codes and other computations where the addition operator is
commutative. Additionally, a ternary tree architecture may be used, in
which three trees co-exist on the same set of nodes to maintain high
throughput while further improving latency.