Techniques for linking non-coding and gene coding regions of a genome are
provided. In one aspect, a method of determining associations between
non-coding sequences and gene coding sequences in a genome of an organism
comprises the following steps. At least one conserved region is
identified from one or more non-coding sequences. Additional instances of
the conserved region are located in the untranslated or amino acid coding
regions of one or more genes in the organism under consideration, and the
conserved region is associated with the one or more biological processes
in which these one or more genes participate.