An apparatus and method for efficient filtering and convolution of content
data are described. The method includes organizing, in response to
executing a data shuffle instruction, a selected portion of data within a
destination data storage device. The portion of data is organized
according to an arrangement of coefficients within a coefficient data
storage device. Once organized, a plurality of summed-product pairs are
generated in response to executing a multiply-accumulate instruction. The
plurality of product pairs are formed by multiplying data within the
destination data storage device and coefficients within the coefficient
data storage device. Once generated, adjacent summed-product pairs are
added in response to executing an adjacent-add instruction. The adjacent
summed-product pairs are added within the destination data storage device
to form one or more data processing operation results. Once the one or
more data processing operation results are formed, the results are stored
within a memory device.