摘要:
Provided are a method and a system for identifying whether the twins are dizygotic twins, the method comprising: typing at least one polymorphic loci of the twins fetuses to obtain the fetal polymorphism types, comparing the fetal polymorphism types with the corresponding polymorphism types of their parents, determining whether the twins are dizygotic twins on the basis of the comparison result.
摘要:
Provided are a method, system and computer readable medium for determining the base information in a predetermined area of a fetus genome, the method comprising following steps: constructing a sequence library for the DNA samples of the fetus genome; sequencing the sequence library to obtain the sequencing result of the fetus, the sequencing result of the fetus comprised of a plurality of sequencing data; and based on the sequencing result of the fetus, determining the base information in the predetermined area according to the hidden Markov model in conjunction with the genetic information of an individual related hereditarily to the fetus.
摘要:
Disclosed are a method of detecting a pre-determined event in a nucleic acid sample and a system thereof. The method of detecting the pre-determined event in the nucleic acid sample comprises the following steps: constructing a sequencing-library for the nucleic acid sample; sequencing the sequencing-library to obtain a sequencing result consisting of a plurality of sequencing data; determining the sequencing data from a pre-determined region; and determining an occurrence of the pre-determined event in the nucleic acid sample based on a composition of the sequencing data from the pre-determined region.
摘要:
Disclosed are a method and a system for determining genome copy number variation, which relates to the technical field of bioinformatics. The method comprises obtaining reads; determining sequence labels according to the reads; counting the number of sequence labels falling into each window; performing GC correction on the sequence label number of each window and a correction according to an expected sequence label number adjusted by a control set to obtain a corrected sequence label number; selecting a demarcation point with a small significance value as a candidate CNV breaking point; rejecting the least significant candidate CNV breaking point at every turn, updating difference significance values of two candidate CNV breaking points on the left and right of the rejected candidate CNV breaking point and performing cyclic iteration until difference significance values of all candidate CNV breaking points are smaller than a termination threshold value, thereby determining a CNV breaking point. The method and the system the present invention have clinical feasibility, and can precisely detect a micro-deletion/micro-duplication area of 0.5 M under the situation of using data of about 50 M.
摘要:
The present invention provides a method and system for determining whether a genomic abnormality exists. The method for determining whether a genomic abnormality exists includes the steps of: separating fetal nucleated red blood cells from a sample from a pregnant woman; sequencing at least a part of the genome of the nucleated red blood cells, so as to obtain a sequencing result; and on the basis of the sequencing result, determining whether a genomic abnormality exists in the nucleated red blood cells.
摘要:
The present invention relates to a method for detecting genetic variation, comprising the following steps: acquiring reads from a test sample; aligning said reads with a reference genome sequence; dividing said reference genome sequence into windows, calculating the number of said reads which are aligned to each window, and acquiring the statistic for each window on the basis of the number of said reads; and for a fragment of the reference genome sequence, acquiring the genetic variation sites on the basis of the change in the statistics of all the windows thereon in the fragment of the reference genome sequence.
摘要:
The present invention relates to a method for quantifying the relative content of a protein in a sample. The present invention also relates to a method for comprising the relative content of a protein in at least two samples.
摘要:
The present invention provides an error correcting method of test sequence, which involves receiving test sequences, configuring high frequency short string list based on a preset high frequency threshold value, traversing each received test sequence, searching an area with the largest number of continuous high frequency short strings on each test sequence in combination with high frequency short string list, configuring whole left sequence and/or right sequence of high frequency short strings at left side and/or right side of searched area according to corresponding received test sequence and high frequency short string list, and constituting corresponding test sequence according to configured left and/or right sequence and searched area. The present invention also provides corresponding error correcting system of test sequence and gene assembly equipment.
摘要:
A method and system to determine biomarkers related to abnormal condition in a subject are provided, comprising:sequencing nucleic acid samples from a first and a second subject in order to obtain multiple sequences respectively consisting of the first and the second sequencing results, wherein the first subject is in the abnormal condition; and the second subject is not in the abnormal condition; and the nucleic acid samples from the first and the second subject are both isolated from the samples of the same type; and the first and the second subject belong to the same species; and determining the biomarkers related to the abnormal condition in the subject based on the difference between the first and the second sequencing results.
摘要:
Provided is a method of detecting method of detecting fusion transcripts in a sample to be analyzed. The method may comprises: subjecting the sample to be analyzed containing a RNA transcriptome to paired-end sequencing, to obtain paired-end RNA-Seq data of the sample to be analyzed; aligning the paired-end RNA-Seq data to a human reference genome sequence, to obtain first paired-end mapped reads, first single-end mapped reads, and first unmapped reads; evaluating an insertsize between two ends of the paired-end mapped reads by means of the first paired-end mapped reads, to obtain a proportion of paired-end mapped reads with overlapped 3′-ends; aligning the first unmapped reads to annotated transcripts, to obtain second single-end mapped reads and second unmapped reads; aligning the second unmapped reads to the annotated transcripts, to filter out unmapped reads caused by indel and obtain third unmapped reads; merging all single-end mapped reads, to obtain a set of single-end mapped reads; obtaining a gene pair linked by a cross-read as a primary set of candidate gene pairs based on the set of single-end mapped reads and combining with a relationship of the mapped paired-end reads; subjecting the primary set of candidate gene pairs to a filtration, to obtain a candidate set of fused gene pairs; bisecting the third unmapped read, to obtain a half-unmapped read; aligning the half-unmapped read to a gene-junction sequence in the candidate set of fused gene pairs, to obtain a potent region of a fused junction site in the gene in which the half-unmap read locates; outputting original reads of mapped half-unmapped reads, to obtain useful unmapped reads; subjecting the candidate set of fused gene pairs to a fusion simulation; aligning the useful unmapped reads to a junction library, to obtain a fused gene supported by the useful unmapped reads; calculating and gathering the fused sequence supported by the useful unmapped reads, to obtain information of the fused gene. And a system for detecting fusion transcripts is also provided.