A scalable engine having multiple datapaths, each of which is a unique
multi-function floating point pipeline capable of performing a four
component dot product on data in a single pass through the datapath,
which allows matrix transformations to be computed in an efficient
manner, with a high data throughput and without substantially increasing
the cost and amount of hardware required to implement the pipeline.