摘要:
The present invention relates to the field of genomic mutation detection, and in particular, to the detection of the copy number variation (CNV) in cellular chromosomal DNA fragments. The present invention also relates to the detection of diseases related to the copy number variation in the cellular chromosomal DNA fragments.
摘要:
Provided are a method, system, and computer-readable medium for determining whether a copy number variation exists in a sample genome. The method includes sequencing a sample genome to obtain a sequencing result formed by multiple reads; comparing the sequencing result with a reference genome sequence to determine the distribution of the reads on the reference genome sequence; determining, based on the distribution of the reads on the reference genome sequence, multiple breakpoints on the reference genome sequence, wherein the number of the reads on either side of each breakpoint are significantly different; determining, based on the plurality of breakpoints, a detection window on the reference genome; determining, based on the reads falling in the detection window, a parameter; and determining, based on the difference between the first parameter and a preset threshold, whether a copy number variation exists in the sample genome against the detection window.
摘要:
Disclosed are a method and a system for determining genome copy number variation, which relates to the technical field of bioinformatics. The method comprises obtaining reads; determining sequence labels according to the reads; counting the number of sequence labels falling into each window; performing GC correction on the sequence label number of each window and a correction according to an expected sequence label number adjusted by a control set to obtain a corrected sequence label number; selecting a demarcation point with a small significance value as a candidate CNV breaking point; rejecting the least significant candidate CNV breaking point at every turn, updating difference significance values of two candidate CNV breaking points on the left and right of the rejected candidate CNV breaking point and performing cyclic iteration until difference significance values of all candidate CNV breaking points are smaller than a termination threshold value, thereby determining a CNV breaking point. The method and the system the present invention have clinical feasibility, and can precisely detect a micro-deletion/micro-duplication area of 0.5 M under the situation of using data of about 50 M.
摘要:
The present invention relates to a method for detecting genetic variation, comprising the following steps: acquiring reads from a test sample; aligning said reads with a reference genome sequence; dividing said reference genome sequence into windows, calculating the number of said reads which are aligned to each window, and acquiring the statistic for each window on the basis of the number of said reads; and for a fragment of the reference genome sequence, acquiring the genetic variation sites on the basis of the change in the statistics of all the windows thereon in the fragment of the reference genome sequence.
摘要:
Provided are a method, system and computer readable medium for determining the base information in a predetermined area of a fetus genome, the method comprising following steps: constructing a sequence library for the DNA samples of the fetus genome; sequencing the sequence library to obtain the sequencing result of the fetus, the sequencing result of the fetus comprised of a plurality of sequencing data; and based on the sequencing result of the fetus, determining the base information in the predetermined area according to the hidden Markov model in conjunction with the genetic information of an individual related hereditarily to the fetus.
摘要:
Disclosed is a method for determining the chromosome aneuploidy of a single cell and a system for determining the chromosome aneuploidy of a single cell. Among them, the method for determining the chromosome aneuploidy of a single cell according to the embodiments of the present invention comprises: the whole genome of the single cell is sequenced to obtain a first sequencing result; the total number of sequencing data from the first sequencing result is counted, obtaining a value L; the number of sequencing data of a first chromosome from the first sequencing result is counted, obtaining a value M; a first parameter is determined based on the value L and the value M; and it is determined whether or not the single cell has aneuploidy in respect of the first chromosome based on the difference between the first parameter and a predetermined control parameter.