-
公开(公告)号:US11348693B2
公开(公告)日:2022-05-31
申请号:US16378265
申请日:2019-04-08
发明人: Thomas Joseph , Aditya Rao , Naveen Sivadasan , Saipradeep Govindakrishnan Vangala , Sujatha Kotte , Rajgopal Srinivasan
IPC分类号: G16H70/60 , G16H10/20 , G06F16/28 , G06F16/90 , G16B45/00 , G16B35/10 , G16B50/10 , G06N5/04 , G06F16/901
摘要: This disclosure relates generally to method and system for graph convolution based gene prioritization on heterogeneous networks. The method includes obtaining a set of entities for human rare diseases from one or more sources containing rare diseases, genes, phenotypes for rare diseases and biological pathways and constructing an initial heterogeneous network using each of an entity from the set of entities. the initial heterogeneous network applying Graph Convolution-based Association Scoring (GCAS) to the initial heterogeneous network to derive inferred associations and creating a Heterogeneous Association Network for Rare Diseases (HANRD) by adding the inferred associations to the initial heterogeneous network and generating a prioritized set of genes for an input query being received in the HANRD.
-
公开(公告)号:US20210125690A1
公开(公告)日:2021-04-29
申请号:US17026353
申请日:2020-09-21
发明人: Thomas Joseph , Aditya Ramkrishna Rao , Saipradeep Vangala Govindakrishnan , Naveen Sivadasan , Uma Sunderam , Sujatha Kotte , Rajgopal Srinivasan
摘要: Diagnosis of rare human diseases using DNA sequencing is a fast growing area of research. Conventional methods carries a risk of incorrect phenotype interpretation. However, obtaining a correct genotype and phenotype matching is challenging. A system for matching phenotype descriptions and pathogenic variants provides a one to one mapping of the phenotype and genotypes of a plurality of subjects under test. Initially, a plurality of phenotypes and a plurality of genome sequences are segmented based on metadata. A phenotype driven gene prioritization and a variant prioritization is applied on the segmented data method. A similarity score is calculated between the phenotype driven gene prioritization output and the variant prioritization output. The similarity score is further utilized to obtain a one to one matching of the plurality of phenotypes and the plurality of genotype sequences of the plurality of subjects under test.
-