IDENTIFICATION OF MARKER FEATURES IN MULTI-DIMENSIONAL DATA

    公开(公告)号:EP3582147A1

    公开(公告)日:2019-12-18

    申请号:EP19190291.5

    申请日:2013-02-27

    IPC分类号: G06K9/62 G06K9/00 G06T11/20

    摘要: Method(s) and system(s) for identifying marker features of various subsets of a multidimensional data are provided. Each subset includes various data points associated with various features. Each of the data points are defined by feature values corresponding to the associated features. The method includes identifying feature pairs based on a matrix of the data points and the features, and computing correlation distances between features in each of the feature pairs. The method includes generating a non-linear pattern of the plurality of features in a two-dimensional plane. Additionally, the method includes calculating a threshold feature value for the associated features of the data points of a particular subset and representing the threshold feature value as a threshold non-linear pattern in the two-dimensional plane. The method includes determining the marker features based on a relative position of the features with respect to the threshold feature value in the two-dimensional plane.

    Identification of ribosomal DNA sequences
    5.
    发明公开
    Identification of ribosomal DNA sequences 有权
    鉴定核糖体DNA DNA序列

    公开(公告)号:EP2390811A1

    公开(公告)日:2011-11-30

    申请号:EP11167497.4

    申请日:2011-05-25

    IPC分类号: G06F19/24

    CPC分类号: G06F19/24 G06F19/22

    摘要: Method(s) for identifying rDNA sequences from a sample containing plurality of unknown DNA sequences are described herein. The method includes selecting one or more target clusters, from a plurality of reference clusters (165), corresponding to the query sequence. The target clusters are selected based on a composition based analysis. A proportion of probable rDNA clusters from the target clusters is identified. Based on the proportion of the probable rDNA clusters, the query sequence is identified as an rDNA.

    摘要翻译: 本文描述了用于从含有多个未知DNA序列的样品鉴定rDNA序列的方法。 该方法包括从与查询序列相对应的多个参考簇(165)中选择一个或多个目标簇。 基于基于组合的分析来选择目标簇。 确定来自目标群体的可能rDNA簇的一部分。 基于可能的rDNA簇的比例,查询序列被鉴定为rDNA。

    Identification of marker features in multi-dimensional data
    6.
    发明公开
    Identification of marker features in multi-dimensional data 审中-公开
    在多维数据标记属性的识别

    公开(公告)号:EP2674895A3

    公开(公告)日:2016-12-07

    申请号:EP13157001.2

    申请日:2013-02-27

    摘要: Method(s) and system(s) for identifying marker features of various subsets of a multidimensional data are provided. Each subset includes various data points associated with various features. Each of the data points are defined by feature values corresponding to the associated features. The method includes identifying feature pairs based on a matrix of the data points and the features, and computing correlation distances between features in each of the feature pairs. The method includes generating a non-linear pattern of the plurality of features in a two-dimensional plane. Additionally, the method includes calculating a threshold feature value for the associated features of the data points of a particular subset and representing the threshold feature value as a threshold non-linear pattern in the two-dimensional plane. The method includes determining the marker features based on a relative position of the features with respect to the threshold feature value in the two-dimensional plane.

    Identification of marker features in multi-dimensional data
    7.
    发明公开
    Identification of marker features in multi-dimensional data 审中-公开
    识别on on on en。。。。。。。。。。

    公开(公告)号:EP2674895A2

    公开(公告)日:2013-12-18

    申请号:EP13157001.2

    申请日:2013-02-27

    IPC分类号: G06K9/62

    摘要: Method(s) and system(s) for identifying marker features of various subsets of a multidimensional data are provided. Each subset includes various data points associated with various features. Each of the data points are defined by feature values corresponding to the associated features. The method includes identifying feature pairs based on a matrix of the data points and the features, and computing correlation distances between features in each of the feature pairs. The method includes generating a non-linear pattern of the plurality of features in a two-dimensional plane. Additionally, the method includes calculating a threshold feature value for the associated features of the data points of a particular subset and representing the threshold feature value as a threshold non-linear pattern in the two-dimensional plane. The method includes determining the marker features based on a relative position of the features with respect to the threshold feature value in the two-dimensional plane.

    摘要翻译: 提供了用于识别多维数据的各种子集的标记特征的方法和系统。 每个子集包括与各种特征相关联的各种数据点。 每个数据点由对应于相关联特征的特征值定义。 该方法包括基于数据点和特征的矩阵来识别特征对,以及计算每个特征对中的特征之间的相关距离。 该方法包括在二维平面中生成多个特征的非线性图案。 另外,该方法包括计算特定子集的数据点的相关联特征的阈值特征值,并将阈值特征值表示为二维平面中的阈值非线性模式。 该方法包括基于特征相对于二维平面中的阈值特征值的相对位置来确定标记特征。

    Taxonomic classification of metagenomic sequences
    8.
    发明公开
    Taxonomic classification of metagenomic sequences 有权
    分子生物学分类

    公开(公告)号:EP2390810A2

    公开(公告)日:2011-11-30

    申请号:EP11167492.5

    申请日:2011-05-25

    IPC分类号: G06F19/24

    摘要: Method(s) for identifying a taxon corresponding to a query sequence are described herein. The method includes selecting a target cluster, from amongst a plurality of reference clusters (165), corresponding to the query sequence. The target cluster may be selected based on a composition based analysis. A similarity based analysis of the query sequence is performed with respect to the target cluster. From the target cluster, the taxon corresponding to the query sequence is identified based on the similarity based analysis.

    摘要翻译: 本文描述了用于识别与查询序列相对应的分类群的方法。 该方法包括从多个参考集群(165)中选择对应于该查询序列的目标集群。 可以基于基于组合的分析来选择目标簇。 针对目标簇执行查询序列的基于相似度的分析。 从目标群集中,基于相似性分析来识别与查询序列相对应的分类群。