Parallelization of word alignment for a text-to-text operation. The
training data is divided into multiple groups, and training is carried
out of each group on separate processors. Different techniques can be
carried out to increase the speed of the processing. The hookups can be
done only once for all of multiple different iterations. Moreover,
parallel operations can apply only to the counts, since this may be the
most time-consuming part.