A system and method for organizing raw data from one or more sources. The
content of the raw data is converted into an appropriate number system
and stored in a format that facilitates the use of efficient mathematical
operations. The number system is selected to handle each of the various
elements, characters, or other representative indicia found in the raw
data. Furthermore, the number system is selected so that the numerical
data retains semantic significance with respect to the raw data. Once
converted into the numeric format, the data is processed using various
techniques to extract the best information from the raw data into a
distilled database.