Abstract:
Random arrays of single molecules are provided for carrying out large scale analyzes, particularly of biomolecules, such as genomic DNA, cDNAs, proteins, and the like. In one aspect, arrays of the invention comprise concatemers of DNA fragments that are randomly disposed on a regular array of discrete spaced apart regions, such that substantially all such regions contain no more than a single concatemer. Preferably, such regions have areas substantially less than 1 μm2 and have nearest neighbor distances that permit optical resolution of on the order of 109 single molecules per cm2. Many analytical chemistries can be applied to random arrays of the invention, including sequencing by hybridization chemistries, sequencing by synthesis chemistries, SNP detection chemistries, and the like, to greatly expand the scale and potential applications of such techniques.
Abstract:
Various short reads can be grouped and identified as coming from a same long DNA fragment (e.g., by using wells with a relatively low-concentration of DNA). A histogram of the genomic coverage of a group of short reads can provide the edges of the corresponding long fragment (pulse). The knowledge of these pulses can provide an ability to determine the haploid genome and to identify structural variations.
Abstract:
The present invention is directed to compositions and methods for nucleic acid identification and detection. Compositions and methods of the present invention include extracting and fragmenting target nucleic acids from a sample, using the fragmented target nucleic acids to produce target nucleic acid templates and subjecting those target nucleic acid templates to amplification methods to form nucleic acid nanoballs. The invention also includes methods of detecting and identifying sequences using various sequencing applications, including sequencing by ligation methods.
Abstract:
A scalable reaction and detection system for automated high throughput sequencing of nucleic acids involving a combination of chemical processes and observation processes independent of the chemistry processes. Discrete functional units may be configured in a manner that allows the system to interchangeably utilize different sequencing reaction components in conjunction with discrete apparatus components for optical image collection and/or analysis.
Abstract:
Techniques perform de novo assembly. The assembly can use labels that indicate origins of the nucleic acid molecules. For example, a representative set of labels identified from initial reads that overlap with a seed can be used. Mate pair information can be used. A sequence read that aligns to an end of a contig can lead to using the other sequence read of a mate pair, and the other sequence read can be used to determine which branch to use to extend, e.g., in an external cloud or helper contig. A kmer index can include labels indicating an origin of each of the nucleic acid molecules that include each kmer, memory addresses of the reads that correspond to each kmer in the index, and a position in each of the mate pairs that includes the kmer. Haploid seeds can also be determined using polymorphic loci identified in a population.
Abstract:
An imaging system is provided wherein a positioning stage is translated with respect to an objective lens component and a scan mirror is repositioned while a two-dimensional image is made of a biochemical site on a substrate. In an example embodiment, an imaging system comprises a camera, an objective lens component, a positioning stage, and a scan mirror controllable by a servo system that synchronizes movement of the positioning stage and the tilting of the scan mirror so that the substrate image is maintained stable during imaging of the continuously moving positioning stage.
Abstract:
A method and associated system for imaging high density biochemical arrays comprises one or more imaging channels that share a common objective lens and a corresponding one or more time delay integration-type imaging cameras with optical alignment mechanisms that permit independent inter-channel and intra-channel adjustment of each of four degrees: X, Y, rotation and scale. The imaging channels are configured to independently examine different spectra of the image of the biochemical arrays.
Abstract:
The present invention is directed to treatment of nucleic acid molecules that are attached or associated with solid supports for biochemical analysis, including nucleic acid sequencing. After loading on the solid support, the nucleic acid molecules are treated with a composition comprising a condensing agent, a volume excluding agent, or both, then treated with a composition comprising a protein.
Abstract:
The present invention provides methods and compositions for tagging long fragments of a target nucleic acid for sequencing and analyzing the resulting sequence information in order to reduce errors and perform haplotype phasing, for example.
Abstract:
This disclosure provides methods and compositions for tagging long fragments of a target nucleic acid for sequencing and analyzing the resulting sequence information in order to reduce errors and perform haplotype phasing, for example.