Abstract:
A system for nucleic acid sequencing includes a machine-readable memory and a processor configured to execute machine-readable instructions. The instructions, when executed by the processor, cause the system to expose template polynucleotide strands in a plurality of defined spaces of a sensor array to a series of flows of nucleotide species, the series comprising a sequence of random flows; and obtain, for each of the series of flows of nucleotide species, a signal indicative of how many nucleotide incorporations occurred for that particular flow to determine a predicted sequence of nucleotides corresponding to the template polynucleotide strands.
Abstract:
A kit for use with a nucleic acid sequencing instrument can include a plurality of combinatorial barcodes sequences meeting the following criteria: each of the combinatorial barcode sequences comprise a plurality of iterations of a sequence motif, where the sequence motif comprises a first nucleotide base from a first group of nucleotide bases followed by a second nucleotide base from a second group of nucleotide bases, the first group and the second group differing from each other; and the plurality of combinatorial barcode sequences is at least 1,000,000 different barcode sequences.
Abstract:
A compression method includes measuring a waveform associated with a chemical event occurring on a sensor array, wherein the waveform comprises at least one region associated with expected measured values and at least one region associated with unpredictable measured values; applying a first compression process to the waveform, the first compression process including an averaging of one or more frames in one or more portions of the waveform; and applying a second compression process to the waveform, the second compression process including a truncating of data corresponding to a portion of the waveform that is not related to a nucleotide incorporation component of the waveform.
Abstract:
A method for sequencing a nucleic acid template includes: (a) performing a first sequencing process including flowing nucleotides and/or reagents to the nucleic acid template according to a first predetermined ordering of nucleotides and/or reagents to obtain a first sequencing result; (b) after the first sequencing process, performing a second sequencing process including flowing nucleotides and/or reagents to the nucleic acid template according to a second predetermined ordering of nucleotides and/or reagents to obtain a second sequencing result, the second predetermined ordering of nucleotides and/or reagents being different from the first predetermined ordering of nucleotides and/or reagents and at least one of the first and second predetermined orderings of nucleotides and/or reagents being designed for repeat sequencing; and (c) determining a sequence of bases corresponding to at least a portion of the nucleic acid template using both the first sequencing result and the second sequencing result.
Abstract:
A method for nucleic acid sequencing may include disposing a plurality of template nucleic acid molecules in a plurality of defined spaces disposed on a sensor array, at least some of the plurality of template nucleic acid molecules having a sequencing primer and a polymerase operably bound therewith; advancing one or more nucleotide species over the plurality of template nucleic acid molecules with the sequencing primer and the polymerase operably bound therewith; measuring a signal generated by nucleotide incorporations resulting from advancing the one or more nucleotide species; and exposing the plurality of template nucleic acid molecules to a cleaving reagent subsequent to the advancing and measuring. The cleaving reagent can remove labeling reagents attached to the one or more nucleotide species. The advancing and measuring steps can be performed for different orders of the one or more nucleotide species prior to a subsequent exposing of the plurality of template nucleic acid molecules to the cleaving reagent.
Abstract:
A method for sequencing a polynucleotide sample having a barcode sequence, includes: introducing a series of nucleotides to the polynucleotide sample according to a predetermined flow ordering; obtaining a series of signals resulting from the introducing of nucleotides to the polynucleotide sample; and resolving the series of signals over the barcode sequence to render a flowspace string, wherein the flowspace string is a codeword of an error-tolerant code capable of distinguishing the barcode sequence from other barcode sequences in the presence of one or more errors.
Abstract:
A method for nucleic acid sequencing includes: receiving a signal comprising measurements of a parameter measured in response to a plurality of nucleotide flows flowed in a space comprising a sample nucleic acid; normalizing the signal to obtain a normalized signal; adaptively normalizing the normalized signal to obtain an adaptively normalized signal; and predicting a sequence of base calls corresponding to the sample nucleic acid using the adaptively normalized signal.
Abstract:
A method for evaluating variant likelihood includes: providing a plurality of template polynucleotide strands, sequencing primers, and polymerase in a plurality of defined spaces disposed on a sensor array; exposing the plurality of template polynucleotide strands, sequencing primers, and polymerase to a series of flows of nucleotide species according to a predetermined order; obtaining measured values corresponding to an ensemble of sequencing reads for at least some of the template polynucleotide strands in at least one of the defined spaces; and evaluating a likelihood that a variant sequence is present given the measured values corresponding to the ensemble of sequencing reads, the evaluating comprising: determining a measurement confidence value for each read in the ensemble of sequencing reads and modifying at least some model-predicted values using a first bias for forward strands and a second bias for reverse strands.
Abstract:
A compression method includes measuring a waveform associated with a chemical event occurring on a sensor array, wherein the waveform comprises at least one region associated with expected measured values and at least one region associated with unpredictable measured values; applying a first compression process to the waveform, the first compression process including an averaging of one or more frames in one or more portions of the waveform; applying a keyframe delta compression to the frame-averaged data; forming a compressed data structure including a keyframe and calculated differences subsequent to the keyframe; and storing one or more compressed data structures in memory.
Abstract:
A system for nucleic acid sequencing is provided. The system comprises a sequencing device configured to expose a tagged polynucleotide comprising a combinatorial barcode sequence and a sample polynucleotide to sequential nucleotide flows, each flow comprising one species of nucleotide and the sequential flows being in a predetermined order based on the species of nucleotide such that exposing of the tagged polynucleotide to the sequential nucleotide flows causes incorporations of nucleotides from the sequential nucleotide flows into the tagged polynucleotide over the barcode sequence The sequencing device is configured to detect a series of signals over the barcode sequence resulting from the incorporations, wherein the predetermined order of nucleotide flows comprises a repetition of a flow order motif that is based on a sequence motif. The system comprises a computing device configured to resolve the detected series of signals to determine the combinatorial barcode sequence.