Abstract:
This disclosure provides methods and compositions for tagging long fragments of a target nucleic acid for sequencing and analyzing the resulting sequence information in order to reduce errors and perform haplotype phasing, for example.
Abstract:
The present invention is directed to methods and compositions for acquiring nucleotide sequence information of target sequences. In particular, the present invention provides methods and compositions for improving the efficiency of sequencing reactions by using fewer labels to distinguish between nucleotides and by detecting nucleotides at multiple detection positions in a target sequence.
Abstract:
Methods, systems, and apparatuses are provided for creating and using a machine-leaning model to call a base at a position of a nucleic acid based on intensity values measured during a production sequencing run. The model can be trained using training data from training sequencing runs performed earlier. The model is trained using intensity values and assumed sequences that are determined as the correct output. The training data can be filtered to improve accuracy. The training data can be selected in a specific manner to be representative of the type of organism to be sequenced. The model can be trained to use intensity signals from multiple cycles and from neighboring nucleic acids to improve accuracy in the base calls.
Abstract:
A method and associated system for imaging high density biochemical arrays comprises one or more imaging channels that share a common objective lens and a corresponding one or more time delay integration-type imaging cameras with optical alignment mechanisms that permit independent inter-channel and intra-channel adjustment of each of four degrees: X, Y, rotation and scale. The imaging channels are configured to independently examine different spectra of the image of the biochemical arrays.
Abstract:
This disclosure provides methods and compositions for tagging long fragments of a target nucleic acid for sequencing and analyzing the resulting sequence information in order to reduce errors and perform haplotype phasing, for example.
Abstract:
Methods for determining the copy number of a genomic region at a detection position of a target sequence in a sample are disclosed. Genomic regions of a target sequence in a sample are sequenced and measurement data for sequence coverage is obtained. Sequence coverage bias is corrected and may be normalized against a baseline sample. Hidden Markov Model (HMM) segmentation, scoring, and output are performed, and in some embodiments population-based no-calling and identification of low-confidence regions may also be performed. A total copy number value and region-specific copy number value for a plurality of regions are then estimated.
Abstract:
The present invention is directed to compositions and methods for nucleic acid identification and detection. Compositions and methods of the present invention include extracting and fragmenting target nucleic acids from a sample, using the fragmented target nucleic acids to produce target nucleic acid templates and subjecting those target nucleic acid templates to amplification methods to form nucleic acid nanoballs. The invention also includes methods of detecting and identifying sequences using various sequencing applications, including sequencing by ligation methods.
Abstract:
An array chip design is provided where the chip includes a field region arranged with sites according to a first pitch and at least one track region having a one-dimensional site pattern arranged according to a second pitch that is less dense and is an integer multiple of the first pitch so that observation through pixel-based sensors using one-dimensional quad-cell averaging can be applied in the track region, thereby to attain alignment of the chip to pixel-based optical instrumentation with a higher density of sites.
Abstract:
This disclosure provides technology for ordering sequence information derived from one or more target polynucleotides. In one aspect, one or more tiers or levels of fragmentation and aliquoting are generated, after which sequence information is obtained from fragments in a final level or tier. Each fragment in such final tier is from a particular aliquot, which, in turn, is from a particular aliquot of a prior tier, and so on. For every fragment of an aliquot in the final tier, the aliquots from which it was derived at every prior tier is known, or can be discerned. Thus, identical sequences from overlapping fragments from different aliquots can be distinguished and grouped as being derived from the same or different fragments from prior tiers. When the fragments in the final tier are sequenced, overlapping sequence regions of fragments in different aliquots are used to register the fragments so that non-overlapping regions are ordered. In one aspect, this process is carried out in a hierarchical fashion until the one or more target polynucleotides are characterized, e.g. by their nucleic acid sequences, or by an ordering of sequence segments, or by an ordering of single nucleotide polymorphisms (SNPs), or the like.
Abstract:
The present invention is directed to methods and compositions for acquiring nucleotide sequence information of target sequences. In particular, the present invention provides methods and compositions for improving the efficiency of sequencing reactions by using fewer labels to distinguish between nucleotides and by detecting nucleotides at multiple detection positions in a target sequence.