The invention provides methods and systems for summarizing multiple
continuous update streams such that an approximate answer to a query over
one or more of the continuous update streams (such as a Query requiring a
join operation followed by a duplicate elimination step) may be rapidly
provided. The systems and methods use multiple (parallel) Join Distinct
(JD) Sketch data structures corresponding to hash buckets of at least one
initial attribute.