发明申请
- 专利标题: PARALLELIZATION OF SYNTHETIC EVENTS WITH GENETIC SURPRISAL DATA REPRESENTING A GENETIC SEQUENCE OF AN ORGANISM
- 专利标题(中): 合成事件的并行与遗传数据表示有机体的遗传序列
-
申请号: US13562714申请日: 2012-07-31
-
公开(公告)号: US20130254202A1公开(公告)日: 2013-09-26
- 发明人: Robert R. Friedlander , James R. Kraemer
- 申请人: Robert R. Friedlander , James R. Kraemer
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
A method, system, and computer program product for parallelization of updating synthetic events with genetic surprisal data comprising dividing the synthetic event into cohort parts and assigning the cohort parts to one of a plurality of computer processing elements. Within each processing element: searching data records of patients for genetic surprisal data; generating a cluster comprising a centroid by populating the cluster based on all of the matches of the data records; calculating a new centroid for each cluster; calculating a Euclidean distance in multiple dimensions for each match of data records to the new centroid for each cluster; reassigning each match of data to the new centroid of each cluster based on the shortest calculated Euclidean distance to the new centroid for each cluster; and determining at least one cohort part from the clusters and recombining the cohort parts into updated synthetic events based on the metadata.
信息查询