A system, method, and computer program product to automatically eliminate
the distribution information available for reconstruction from a
disguised dataset. The invention flattens input numerical values into a
substantially uniformly distributed dataset, then maps the uniformly
distributed dataset into equivalent data in a target distribution. The
invention allows the incremental encryption of new values in an encrypted
database while leaving existing encrypted values unchanged. The
flattening comprises (1) partitioning, (2) mapping, and (3) saving
auxiliary information about the data processing, which is encrypted and
not updated. The partitioning is MDL based, and includes a growth phase
for dividing a space into fine partitions and a prune phase for merging
some partitions together.