摘要:
A method of associating a phenotype with one or more candidate chromosomal regions in a genome of an organism includes the step of deriving a phenotypic data structure that represents differences in phenotypes between different strains of the organism. Further, a genotypic data structure is established. The genotypic data structure corresponds to a locus selected from a plurality of loci in the genome of the organism. The genotypic data structure represents variations of at least one component of the locus between different strains of the organism. The phenotypic data structure is compared to the genotypic data structure to form a correlation value. The process of establishing a genotypic data structure and comparing it to the phenotypic data structure is repeated for each locus in the plurality of loci, thereby identifying one or more genotypic data structures that form a high correlation value relative to all other compared genotypic data structures. The loci that correspond to the one or more genotypic data structures having a high correlation value represent the one or more candidate chromosomal regions.
摘要:
Computer systems, computer program products and methods for processing an input nucleic acid chromatogram having a plurality of substantially simultaneous traces. Each trace in the plurality of traces has a plurality of datapoints. Each respective datapoint in each of the plurality of datapoints represents a signal amplitude at a position in the trace corresponding to the respective datapoint. A first peak is identified in a first trace that is substantially overlapping a second peak in a second trace in the plurality of traces. The first peak is outputted to a first output homozygous representation and the second peak is outputted to a second output homozygous representation. The first output homozygous representation is a first homozygous sequence representation or a first homozygous nucleic acid chromatogram corresponding to the input nucleic acid chromatogram. The second output homozygous representation is a second homozygous sequence representation or second homozygous nucleic acid chromatogram of the input nucleic acid chromatogram.