TRANSCRIPTOME ASSEMBLY METHOD AND SYSTEM
    1.
    发明申请
    TRANSCRIPTOME ASSEMBLY METHOD AND SYSTEM 审中-公开
    代码汇编方法和系统

    公开(公告)号:US20150120204A1

    公开(公告)日:2015-04-30

    申请号:US14394135

    申请日:2012-04-13

    IPC分类号: G06F19/22 G06F19/20

    CPC分类号: G16B30/00 G16B25/00

    摘要: Provided is a transcriptome assembly method, comprising the following steps of: constructing a sequencing sample transcriptome read into a de Brujin graph; performing filtering and linearization processing on the de Brujin graph, so as to form continuous contigs; obtaining association among the contigs, and filtering association data; performing linearization processing on a continuous sequence without bifurcation; outputting a contig sequence; comparing the read and an end pairing read with the output contig sequence, so as to obtain information between the read and the contig; establishing connections among the contigs, so as to construct a graph with the contigs as points and the connections as edges; pre-processing and dividing the obtained graph, so as to obtain independent sub-graphs; and outputting a transcript according to the sub-graphs. Further provided is a transcriptome assembly system based on the method.

    摘要翻译: 提供了一种转录组装方法,包括以下步骤:构建读取到de Brujin图中的测序样本转录组; 对de Brujin图进行滤波和线性化处理,形成连续重叠群; 获得重叠群之间的关联,并过滤关联数据; 对没有分岔的连续序列进行线性化处理; 输出重叠序列; 将读取和结束配对读取与输出contig序列进行比较,以便获得读取和重叠数据之间的信息; 在重叠群之间建立连接,以便以重叠群构建点作为边缘的连接; 预处理和划分获得的图,以获得独立的子图; 并根据子图输出抄本。 还提供了一种基于该方法的转录组装系统。