Abstract:
The invention relates to an automated method for high-throughput DNA sequencing from high density DNA arrays by (a) initiating a first sequencing reaction on a first high density DNA array; and imaging said first high density DNA array using a detector, and (b) initiating a first sequencing reaction on a second high density DNA array; and imaging said second high density DNA array using the detector, wherein the first sequencing reaction in (a) is initiated before the first sequencing reaction in (b) is initiated such that the sequencing reactions in (a) and (b) are staggered. By using asynchronous sequencing reactions and imaging two separate arrays using one detector, imaging can be carried out on one array while sequencing reactions are carried out on one the other, substrate, the other substrate is imaged, reducing the idle time of the imaging system.
Abstract:
This application discloses methods of producing a DNA strand for sequencing, as well as genetic constructs, libraries, and arrays using DNA strands produced according to these methods. The application also discloses methods of sequencing using the DNA strands, genetic constructs, libraries, and arrays produced. In certain aspects, DNA being sequenced includes a target sequence and at least one adaptor sequence.
Abstract:
The present invention provides methods and compositions for tagging long fragments of a target nucleic acid for sequencing and analyzing the resulting sequence information in order to reduce errors and perform haplotype phasing, for example.
Abstract:
Provided herein are compositions, methods, and kits for enriching for one or more nucleic acid sequences of interest in a sample. The methods include providing a circular ligase, one or more 5′ hook probes and/or one or more 3′ hook probes and contacting the sample comprising the nucleic acids with the circular ligase and one or more 5′ hook probes and/or one or more 3′ hook probes under conditions to allow the hook probes to selectively bind to the one or more nucleic acid sequences of interest, and under conditions to form one or more hook products, each hook product comprising the hook probes and the one or more nucleic acid sequences of interest.
Abstract:
A high density DNA array comprising a patterned surface, said surface comprising a pattern of small DNA binding regions separated by a non-DNA binding surface, wherein the DNA binding regions comprise DNA capture chemistry and the non-DNA binding surface does not have the DNA capture chemistry wherein more than 50% of the DNA binding regions in the array have single informative DNA species.
Abstract:
The present invention is directed to methods and compositions for acquiring nucleotide sequence information of target sequences. In particular, the present invention provides methods and compositions for improving the efficiency of sequencing reactions by using fewer labels to distinguish between nucleotides and by detecting nucleotides at multiple detection positions in a target sequence.
Abstract:
Random arrays of single molecules are provided for carrying out large scale analyzes, particularly of biomolecules, such as genomic DNA, cDNAs, proteins, and the like. In one aspect, arrays of the invention comprise concatemers of DNA fragments that are randomly disposed on a regular array of discrete spaced apart regions, such that substantially all such regions contain no more than a single concatemer. Preferably, such regions have areas substantially less than 1 μm2 and have nearest neighbor distances that permit optical resolution of on the order of 109 single molecules per cm2. Many analytical chemistries can be applied to random arrays of the invention, including sequencing by hybridization chemistries, sequencing by synthesis chemistries, SNP detection chemistries, and the like, to greatly expand the scale and potential applications of such techniques.
Abstract:
Various short reads can be grouped and identified as coming from a same long DNA fragment (e.g., by using wells with a relatively low-concentration of DNA). A histogram of the genomic coverage of a group of short reads can provide the edges of the corresponding long fragment (pulse). The knowledge of these pulses can provide an ability to determine the haploid genome and to identify structural variations.
Abstract:
The present invention is directed to compositions and methods for nucleic acid identification and detection. Compositions and methods of the present invention include extracting and fragmenting target nucleic acids from a sample, using the fragmented target nucleic acids to produce target nucleic acid templates and subjecting those target nucleic acid templates to amplification methods to form nucleic acid nanoballs. The invention also includes methods of detecting and identifying sequences using various sequencing applications, including sequencing by ligation methods.
Abstract:
Techniques perform de novo assembly. The assembly can use labels that indicate origins of the nucleic acid molecules. For example, a representative set of labels identified from initial reads that overlap with a seed can be used. Mate pair information can be used. A sequence read that aligns to an end of a contig can lead to using the other sequence read of a mate pair, and the other sequence read can be used to determine which branch to use to extend, e.g., in an external cloud or helper contig. A kmer index can include labels indicating an origin of each of the nucleic acid molecules that include each kmer, memory addresses of the reads that correspond to each kmer in the index, and a position in each of the mate pairs that includes the kmer. Haploid seeds can also be determined using polymorphic loci identified in a population.