SYSTEMS AND METHODS FOR DETERMINING STRUCTURAL VARIATION AND PHASING USING VARIANT CALL DATA
    1.
    发明申请
    SYSTEMS AND METHODS FOR DETERMINING STRUCTURAL VARIATION AND PHASING USING VARIANT CALL DATA 审中-公开
    使用变量呼叫数据确定结构变化和相位的系统和方法

    公开(公告)号:WO2016130578A1

    公开(公告)日:2016-08-18

    申请号:PCT/US2016/017196

    申请日:2016-02-09

    IPC分类号: C12Q1/68

    摘要: Systems and methods for determining structural variation and phasing using variant call data obtained from nucleic acid of a biological sample are provided. Sequence reads are obtained, each comprising a portion corresponding to a subset of the test nucleic acid and a portion encoding a barcode independent of the sequencing data. Bin information is obtained. Each bin represents a different portion of the sample nucleic acid. Each bin corresponds to a set of sequence reads in a plurality of sets of sequence reads formed from the sequence reads such that each sequence read in a respective set of sequence reads corresponds to a subset of the nucleic acid represented by the bin corresponding to the respective set. Binomial tests identify bin pairs having more sequence reads with the same barcode in common than expected by chance. Probabilistic models determine structural variation likelihood from the sequence reads of these bin pairs.

    摘要翻译: 提供了使用从生物样品的核酸获得的变体调用数据来确定结构变化和定相的系统和方法。 获得序列读数,每个包含对应于测试核酸子集的部分和编码独立于测序数据的条形码的部分。 获取Bin信息。 每个箱表示样品核酸的不同部分。 每个仓对应于从序列读取形成的多组序列读取中的一组序列读取,使得在相应的序列读取集合中读取的每个序列对应于由对应于相应的块的bin所代表的核酸的子集 组。 二项式测试识别具有更多序列读取的bin对,其具有与预期相同的条件相同的条形码。 概率模型根据这些箱对的序列读数确定结构变异似然。

    METHODS FOR TARGETED NUCLEIC ACID SEQUENCE COVERAGE
    4.
    发明申请
    METHODS FOR TARGETED NUCLEIC ACID SEQUENCE COVERAGE 审中-公开
    靶向核酸序列覆盖的方法

    公开(公告)号:WO2016138148A1

    公开(公告)日:2016-09-01

    申请号:PCT/US2016/019382

    申请日:2016-02-24

    IPC分类号: C12Q1/68

    摘要: The present invention is directed to methods, compositions and systems for analyzing sequence information from targeted regions of a genome. Such targeted regions may include regions of the genome that are poorly characterized, highly polymorphic, or divergent from reference genome sequences.

    摘要翻译: 本发明涉及从基因组的目标区域分析序列信息的方法,组合物和系统。 这样的靶向区域可以包括基因组区域,其特征不清楚,高度多态性或与参考基因组序列不同。