A microprocessor and a related compiler support a local cache block flush
instruction in which an execution unit of a processor determines an
effective address. The processor forces all pending references to a cache
block corresponding to the determined effective address to commit to the
cache subsystem. If the referenced cache line is modified in the local
cache (the cache subsystem corresponding to the processor executing the
instruction), it is then written back to main memory. If the referenced
block is valid in the local cache it is invalidated, but only in the
local cache. If the referenced block is not valid in the local cache,
there is no invalidation. Remote processors receiving a local cache block
flush instruction from another processor via the system ignore the
instruction.