A system and method are disclosed for providing efficient data storage. A
data stream comprising a plurality of data segments is received. The
system determines whether one of the plurality of data segments has been
stored previously using a summary in a low latency memory; in the event
that the data segment is determined not to have been stored previously,
assigning an identifier to the data segment.