METHODS AND SYSTEMS FOR DATA ANALYSIS
    2.
    发明申请
    METHODS AND SYSTEMS FOR DATA ANALYSIS 有权
    数据分析的方法和系统

    公开(公告)号:US20120330567A1

    公开(公告)日:2012-12-27

    申请号:US13459968

    申请日:2012-04-30

    IPC分类号: G06F19/20

    CPC分类号: G06F19/22

    摘要: The present disclosure provides computer implemented methods and systems for analyzing datasets, such as large data sets output from nucleic acid sequenceing technologies. In particular, the present disclosure provides for data analysis comprising computing the BWT of a collection of strings in an incremental, character by character, manner. The present disclosure also provides compression boosting strategies resulting in a BWT of a reordered collection of data that is more compressible by second stage compression methods compared to non-reordered computational analysis.

    摘要翻译: 本公开提供了用于分析数据集的计算机实现的方法和系统,例如从核酸序列技术输出的大数据集。 特别地,本公开提供了数据分析,包括以字符逐个递增的方式计算字符串集合的BWT。 本公开还提供压缩增强策略,导致与非重新排序的计算分析相比,通过第二阶段压缩方法可压缩数据的重新排序的数据的BWT。

    Methods and systems for data analysis using the Burrows Wheeler transform
    3.
    发明授权
    Methods and systems for data analysis using the Burrows Wheeler transform 有权
    使用Burrows Wheeler变换进行数据分析的方法和系统

    公开(公告)号:US08798936B2

    公开(公告)日:2014-08-05

    申请号:US13459968

    申请日:2012-04-30

    IPC分类号: G01N33/48 G06F5/00

    CPC分类号: G06F19/22

    摘要: The present disclosure provides computer implemented methods and systems for analyzing datasets, such as large data sets output from nucleic acid sequencing technologies. In particular, the present disclosure provides for data analysis comprising computing the BWT of a collection of strings in an incremental, character by character, manner. The present disclosure also provides compression boosting strategies resulting in a BWT of a reordered collection of data that is more compressible by second stage compression methods compared to non-reordered computational analysis.

    摘要翻译: 本公开提供了用于分析数据集的计算机实现的方法和系统,例如从核酸测序技术输出的大数据集。 特别地,本公开提供了数据分析,包括以字符逐个递增的方式计算字符串集合的BWT。 本公开还提供压缩增强策略,导致与非重新排序的计算分析相比,通过第二阶段压缩方法可压缩数据的重新排序的数据的BWT。