A method of generating a frequency distribution of scores comprising: a)
generating mass data for a biological molecule; b) generating mass data
for a series of random hypothetical biological molecules; c) calculating a
frequency distribution of high similarity scores between mass data of each
molecule generated in steps a and b.