The invention provides compositions and methods for preparing DNA
sequencing libraries. In particular, the method relates to preparing DNA
sequencing libraries from kilobase scale nucleic acids. The invention
also provides methods for assembling short read sequencing data into
longer contiguous sequences. The method is useful for various
applications in genomics, including genome assembly, full length cDNA
sequencing, metagenomics, and the analysis of repetitive sequences of
assembled genomes.