Abstract:
Method includes receiving sequencing data including a plurality of sample reads that have corresponding sequences of nucleotides and assigning the sample reads to designated loci. The method also includes analyzing the assigned reads for each designated locus to identify corresponding regions-of-interest (ROIs) within the assigned reads. Each of the ROIs has one or more series of repeat motifs. The method also includes sorting the assigned reads based on the sequences of the ROIs such that the ROIs with different sequences are assigned as different potential alleles. The method also includes analyzing, for designated loci having multiple potential alleles, the sequences of the potential alleles to determine whether a first allele of the potential alleles is suspected stutter product of a second allele of the potential alleles.
Abstract:
A computer-implemented method, computer system and computer-readable medium for identifying off-target matches from a set of candidate primer sequences on a genome reference sequence can include: receiving onto a data storage unit a plurality of candidate primer sequences; for each candidate primer sequence, calculating, using a processor, a plurality of candidate matches on the genome reference sequence for the candidate primer sequences; calculating, using the processor, verified matches on the genome reference sequence based on the candidate matching locations satisfying a plurality of matching verification rules; performing matching calculations of the verified matches to determine whether the verified matches form a match condition on the genome reference sequence; and generating a location profile on the genome reference sequence based on the match condition from the verified matches that meet a predetermined threshold.
Abstract:
A computer-implemented method, computer system and computer-readable medium for identifying off-target matches from a set of candidate primer sequences on a genome reference sequence can include: receiving onto a data storage unit a plurality of candidate primer sequences; for each candidate primer sequence, calculating, using a processor, a plurality of candidate matches on the genome reference sequence for the candidate primer sequences; calculating, using the processor, verified matches on the genome reference sequence based on the candidate matching locations satisfying a plurality of matching verification rules; performing matching calculations of the verified matches to determine whether the verified matches form a match condition on the genome reference sequence; and generating a location profile on the genome reference sequence based on the match condition from the verified matches that meet a predetermined threshold.
Abstract:
Methods and systems are provided for quantifying and deconvolving nucleic acid mixture samples including nucleic acid of one or more contributors having known or unknown genomes. The methods and systems provided herein implement processes that use Bayesian probabilistic modeling techniques to determine the abundances and confidence intervals of genetically distinct contributors in a chimerism sample, thereby improving specificity, accuracy and sensitivity, and greatly expanded the application scope over conventional methods.
Abstract:
Disclosed herein are systems and methods for multiplex primer design and selection. In one example, a system includes non-transitory memory configured to store executable instructions; and a hardware processor programmed by the executable instructions to receive a plurality of target gene sequences and determine a set of primers for each target gene sequence based on a penalty score associated with the set of primers, wherein the penalty score is based on a non-linear combination of a primer-level penalty score and a set-level penalty score.
Abstract:
Sample analyzer having a system controller that includes a plurality of modules. The plurality of modules include a first filter module that assigns sample reads to designated loci based on a sequence of nucleotides and an aligner module that analyzes the assigned reads to identify corresponding regions-of-interest (ROIs) within the assigned reads. A second filter module sorts the assigned reads based on the sequences of the ROIs such that the ROIs with different sequences are assigned as different potential alleles. Each potential allele has a sequence that is different from the sequences of other potential alleles within the designated locus. A stutter module analyzes the sequences of the potential alleles to determine whether a first allele of the potential alleles is suspected stutter product of a second allele of the potential alleles.