摘要:
The present invention provides methods and systems for assembling a genome from a shot-gun set of end sequenced DNA fragments. Specifically, the present invention provides a method of determining the genomic sequence (base sequence and orientation) of a complex genome using DNA sequence information generated from a collection of DNA fragments obtained from the genome. The present method is particularly useful in assembling genomes of at least 10 MB (up to 5 GB) and which are made up of at least 5% repetitive DNA sequences (up to 25% repetitive), but can be used also for smaller genomes with a lower percentage of repetitive DNA.