发明申请
- 专利标题: Identification of Co-Regulation Patterns By Unsupervised Cluster Analysis of Gene Expression Data
- 专利标题(中): 通过基因表达数据的无监督聚类分析鉴定协调模式
-
申请号: US13019585申请日: 2011-02-02
-
公开(公告)号: US20110125683A1公开(公告)日: 2011-05-26
- 发明人: Asa Ben Hur , André Elisseeff , Isabelle Guyon
- 申请人: Asa Ben Hur , André Elisseeff , Isabelle Guyon
- 申请人地址: US GA Savannah
- 专利权人: HEALTH DISCOVERY CORPORATION
- 当前专利权人: HEALTH DISCOVERY CORPORATION
- 当前专利权人地址: US GA Savannah
- 主分类号: G06N3/12
- IPC分类号: G06N3/12
摘要:
A method is provided for unsupervised clustering of gene expression data to identify co-regulation patterns. A clustering algorithm randomly divides the data into k different subsets and measures the similarity between pairs of datapoints within the subsets, assigning a score to the pairs based on similarity, with the greatest similarity giving the highest correlation score. A distribution of the scores is plotted for each k. The highest value of k that has a distribution that remains concentrated near the highest correlation score corresponds to the number of co-regulation patterns.
公开/授权文献
信息查询