Loading software on a plurality of processors is presented. A processing
unit (PU) retrieves a file from system memory and loads it into its
internal memory. The PU extracts a processor type from the file's header
which identifies whether the file should execute on the PU or a
synergistic processing unit (SPU). If an SPU should execute the file, the
PU DMA's the file to the SPU for execution. In one embodiment, the file
is a combined file which includes both PU and SPU code. In this
embodiment, the PU identifies one or more section headers included in the
file which indicates embedded SPU code within the combined file. In this
embodiment, the PU extracts the SPU code from the combined file and DMA's
the extracted code to an SPU for execution.