DATA ANALYSIS OF DNA SEQUENCES
    7.
    发明申请
    DATA ANALYSIS OF DNA SEQUENCES 审中-公开
    DNA序列的数据分析

    公开(公告)号:US20120173153A1

    公开(公告)日:2012-07-05

    申请号:US13332242

    申请日:2011-12-20

    IPC分类号: G06F19/22

    CPC分类号: G16B30/00

    摘要: Systems and methods for data analysis are provided. In one embodiment, a method may be provided for analysis comprising electronically receiving sequence data related to a plurality of sequences and a reference sequence, associating the sequence data with one of at least two groups, identifying a plurality of high quality read sequences from among the plurality of sequences, extracting a plurality of unique read sequences from the plurality of high quality read sequences, and aligning the plurality of unique read sequences against the reference sequence data corresponding to a reference sample. The method may further identify mutations in a targeted location, display the targeted mutations, and prioritize the technologies that caused the mutations according to their efficiency. In one example, the systems and methods are used to characterize the activity of several ZFN candidates.

    摘要翻译: 提供了数据分析的系统和方法。 在一个实施例中,可以提供一种用于分析的方法,包括电子地接收与多个序列相关的序列数据和参考序列,将序列数据与至少两个组中的一个相关联,从多个序列中识别多个高质量读取序列 多个序列,从所述多个高质量读取序列提取多个唯一读取序列,以及将所述多个唯一读取序列与对应于参考样本的参考序列数据对准。 该方法可以进一步鉴定目标位置中的突变,显示目标突变,并根据其效率对引起突变的技术进行优先化。 在一个示例中,系统和方法用于表征若干ZFN候选者的活动。