POPULATION CLASSIFICATION OF GENETIC DATA SET USING TREE BASED SPATIAL DATA STRUCTURE
    1.
    发明申请
    POPULATION CLASSIFICATION OF GENETIC DATA SET USING TREE BASED SPATIAL DATA STRUCTURE 审中-公开
    使用基于树的空间数据结构对遗传数据进行群体分类

    公开(公告)号:US20150186596A1

    公开(公告)日:2015-07-02

    申请号:US14416647

    申请日:2013-08-07

    CPC classification number: G16B20/00 G16B40/00

    Abstract: Reference feature vectors are constructed representing refer-ence genetic data sets of a reference population. The reference feature vec-tors are transformed using a linear transformation to generate reduced di-mensionality vector representations of the reference genetic data sets of the reference population. A tree-based spatial data structure is constructed to index the reference genetic data sets as data points defined by at least some dimensions of the reduced dimensionality vector representations of the ref-erence genetic data sets of the reference population. The linear transform may be generated by performing feature reduction on the reference feature vectors. A feature vector representing a proband genetic data set is trans-formed using the linear transformation to generate a reduced-dimensional-ity vector representation that is located in the tree-based spatial data struc-ture to perform population assignment for the proband genetic data set.

    Abstract translation: 参考特征向量被构建为参考群体的参考遗传数据集。 参考特征矢量使用线性变换来变换,以生成参考群的参考遗传数据集的减少的二维向量表示。 构建基于树的空间数据结构以将参考遗传数据集作为由参考群体的参考遗传数据集的缩减维度向量表示的至少一些维度定义的数据点进行索引。 可以通过对参考特征向量执行特征缩减来生成线性变换。 表示概率遗传数据集的特征向量使用线性变换来转换,以产生位于基于树的空间数据结构中的简化维度向量表示,以对先验遗传数据集执行群体分配 。

Patent Agency Ranking