Abstract:
Systems and method for annotating variants within a genome can call variants from reads or receive called variants directly and associate the called variants with functional annotations and interpretive annotations. A summary report of the called variants, the associated functional annotations, and the associated interpretive annotations can be generated.
Abstract:
Systems and method for identifying variants associated with a genetic disease can include obtaining sequencing reads for a plurality of individuals for a list of variant positions. The reads can be compared to identify variants that are found in affected individuals and absent in non-affected individuals. Such variants can include loss of heterozygosity, trans-phased compound heterozygotes, increased frequency mitochondrial variants, homozygous recessive variants, de novo variants, sex-linked variants, and combinations thereof.
Abstract:
Systems and method for identifying variants associated with a genetic disease can include obtaining calls for a plurality of individuals for a list of variant positions. The calls can be compared to identify variants that are found in affected individuals and absent in non-affected individuals. Such variants can include loss of heterozygosity, trans-phased compound heterozygotes, increased frequency mitochondrial variants, homozygous recessive variants, de novo variants, sex-linked variants, and combinations thereof.
Abstract:
Systems and method for identifying variants associated with a genetic disease can include obtaining calls for a plurality of individuals for a list of variant positions. The calls can be compared to identify variants that are found in affected individuals and absent in non-affected individuals. Such variants can include loss of heterozygosity, trans-phased compound heterozygotes, increased frequency mitochondrial variants, homozygous recessive variants, de novo variants, sex-linked variants, and combinations thereof.
Abstract:
Systems and method for identifying somatic mutations can receive first and second sequence information, determine if a variant present in the first sequencing information is also present in the second sequence information, and identify variants present in the first sequence information are somatic mutations when the variant is either not present in the second sequence information or the presence of the variant in the second sequence information is likely due to a sequencing error.
Abstract:
Systems and method for identifying variants associated with a genetic disease can include obtaining calls for a plurality of individuals for a list of variant positions. The calls can be compared to identify variants that are found in affected individuals and absent in non-affected individuals. Such variants can include loss of heterozygosity, trans-phased compound heterozygotes, increased frequency mitochondrial variants, homozygous recessive variants, de novo variants, sex-linked variants, and combinations thereof.
Abstract:
Identification of exon junctions includes obtaining a first read sequence based on a detected plurality of signals of a first sequence. A list of exon prefix and suffix sequences are generated by identifying exons of the human genome with a prefix sequence mapping to a suffix sequence of the first read sequence and by identifying exons with a suffix sequence mapping to a prefix sequence of the first read sequence. A pair of exon sequences is selected, with a first exon sequence being one of the exon suffix sequences and a second exon sequence being one of the exon prefix sequences. Summing a number of sequence elements of the first exon sequence that overlap the prefix of the first read sequence, a number of sequence elements of the second exon sequence that overlap the suffix of the first read sequence, and a constant is used to identify a fusion junction.
Abstract:
Systems and method for identifying somatic mutations can receive first and second sequence information, determine if a variant present in the first sequencing information is also present in the second sequence information, and identify variants present in the first sequence information are somatic mutations when the variant is either not present in the second sequence information or the presence of the variant in the second sequence information is likely due to a sequencing error.
Abstract:
Disclosed are systems and methods for resequencing using color calls. A DNA sample is encoded and sequenced according to a multi-base code producing a string of read color calls for a fragment of the sample. A reference sequence is obtained. The string of read color calls is mapped to the reference sequence. A base sequence is extracted from the reference sequence. The base sequence is encoded as a string of reference color codes according to the multi-base code. The string of read color calls is aligned with the string of reference color codes and mismatches in the alignment are detected. One or more mismatches of the string of read color calls are annotated as inconsistent. The one or more inconsistent mismatches of the string of read color calls are corrected. The string of corrected read color calls is decoded to bases producing a read sequence.
Abstract:
Systems and methods are used to identify an exon junction from a single read of a transcript. A transcript sample is interrogated and a read sequence is produced using a nucleic acid sequencer. A first exon sequence and a second exon sequence are obtained using the processor. The first exon sequence is mapped to a prefix of the read sequence using the processor. The second exon sequence is mapped to a suffix of the read sequence using the processor. A sum of a number of sequence elements of the first exon sequence that overlap the prefix of the read sequence, of a number of sequence elements of the second exon sequence that overlap the suffix of the read sequence, and of a constant is calculated using the processor. If the sum equals a length of the read sequence, a junction is identified in the read using the processor.