A method and apparatus is disclosed for optimizing the mapping of processes
within a parallelized software program among individual processors in a
high performance, scalable computing system. The resulting mapping
enhances the usage of the available inter-processor communications
bandwidth and balances the computational processing load among the
processors of the system, improving upon mappings generated by skilled
programmers.