FILTERING GENETIC NETWORKS TO DISCOVER POPULATIONS OF INTEREST

    公开(公告)号:US20220076789A1

    公开(公告)日:2022-03-10

    申请号:US17531426

    申请日:2021-11-19

    摘要: A computing server generates a graph such as an identity-by-descent (IBD) network. The graph includes a plurality of nodes. Each node represents one of the individuals. Two or more nodes are connected through edges. Each edge connecting two nodes and associated with a weight that is derived from affinity between the genetic data of the two individuals represented by the two nodes. The computing system filters the graph based on features that are associated with the edges or the nodes. The filtered graph includes a subset of nodes. The computing system divides the filtered graph into a plurality of clusters to identify genetic communities that may not be discoverable without filtering. The computing server may also perform a multi-path hierarchical community detection process to assign an individual represented by a node to more than one community.

    Family networks
    2.
    发明授权
    Family networks 有权
    家庭网络

    公开(公告)号:US09390225B2

    公开(公告)日:2016-07-12

    申请号:US14214856

    申请日:2014-03-15

    IPC分类号: G06F19/00 G06F19/14

    摘要: Described embodiments enable identification of family networks using combinations of DNA analysis and genealogical information. Genealogical data is provided by users of a genealogical research service or collected from other sources and used to create family trees for each user. DNA samples are also received from the users. By analyzing the DNA samples, potential genetic relationships can be identified between some users. Once these DNA-suggested relationships have been identified, common ancestors can be sought in the respective trees of the potentially related users. Where these common ancestors exist, an inference is drawn that the DNA-suggested relationship accurately represents a familial overlap between the individuals in question. People descended from the same common ancestor are each members of a family network. Members of a family network not in a user's tree may be identified for the user, enabling the user to discover additional ancestors that might otherwise have remained unknown.

    摘要翻译: 描述的实施方案使得能够使用DNA分析和家谱信息的组合来鉴定家族网络。 家谱数据由家谱研究服务的用户提供或从其他来源收集,并用于为每个用户创建家庭树。 也从用户那里收到DNA样本。 通过分析DNA样本,可以在一些用户之间识别潜在的遗传关系。 一旦确定了这些DNA建议的关系,可以在潜在相关用户的相应树中寻找共同的祖先。 在这些共同祖先存在的地方,推断出DNA建议关系准确地表示了所讨论的个体之间的家族重叠。 来自同一共同祖先的人是家庭网络的每个成员。 可以为用户识别不在用户树中的家庭网络的成员,使得用户能够发现否则将保持未知的其他祖先。

    Linking individual datasets to a database

    公开(公告)号:US11429615B2

    公开(公告)日:2022-08-30

    申请号:US17128009

    申请日:2020-12-19

    摘要: The disclosed system links an individual dataset to a database. The system receives a target individual dataset associated with a target individual and identifies candidate individual datasets that are potentially related to the target individual dataset. The system identifies a related individual dataset that has data bits that match some data bits in the target individual dataset. The system then identifies a parent node that is a common parent node to both the target individual dataset and the related individual dataset. The system retrieves a data tree that the parent node belongs to with the data tree containing information describing inter-relationships among datasets in the data tree. A node in the data tree is identified to assign the target individual dataset based on strings of matched data bits and number of the matched strings between the target individual dataset and the datasets in the data tree.

    Filtering genetic networks to discover populations of interest

    公开(公告)号:US11211149B2

    公开(公告)日:2021-12-28

    申请号:US17252652

    申请日:2019-06-14

    摘要: A computing server generates a graph such as an identity-by-descent (IBD) network. The graph includes a plurality of nodes. Each node represents one of the individuals. Two or more nodes are connected through edges. Each edge connecting two nodes and associated with a weight that is derived from affinity between the genetic data of the two individuals represented by the two nodes. The computing system filters the graph based on features that are associated with the edges or the nodes. The filtered graph includes a subset of nodes. The computing system divides the filtered graph into a plurality of clusters to identify genetic communities that may not be discoverable without filtering. The computing server may also perform a multi-path hierarchical community detection process to assign an individual represented by a node to more than one communities.

    Discovering Population Structure From Patterns of Identity-By-Descent
    6.
    发明申请
    Discovering Population Structure From Patterns of Identity-By-Descent 审中-公开
    发现身份逐渐下降的模式中的人口结构

    公开(公告)号:US20160350479A1

    公开(公告)日:2016-12-01

    申请号:US15168011

    申请日:2016-05-28

    IPC分类号: G06F19/24 C40B30/02

    摘要: Described are techniques for determining population structure from identity-by-descent (IBD) of individuals. The techniques may be used to predict that an individual belongs to zero, one or more of a number of communities identified within an IBD network. Additional data may be used to annotate the communities with birth location, surname, and ethnicity information. In turn, these data may be used to provide to an individual a prediction of membership to zero, one or more communities, accompanied by a summary of the information annotated to those communities.

    摘要翻译: 描述的是确定个体身份(IBD)的人口结构的技术。 这些技术可以用于预测个体属于IBD网络内识别的多个社区中的零个,一个或多个。 附加数据可用于注释具有出生地点,姓氏和种族信息的社区。 反过来,这些数据可以用于向个人提供对零,一个或多个社区的成员资格的预测,并附有对这些社区注释的信息的摘要。

    LINKING INDIVIDUAL DATASETS TO A DATABASE

    公开(公告)号:US20220365934A1

    公开(公告)日:2022-11-17

    申请号:US17868775

    申请日:2022-07-20

    摘要: The disclosed system links an individual dataset to a database. The system receives a target individual dataset associated with a target individual and identifies candidate individual datasets that are potentially related to the target individual dataset. The system identifies a related individual dataset that has data bits that match some data bits in the target individual dataset. The system then identifies a parent node that is a common parent node to both the target individual dataset and the related individual dataset. The system retrieves a data tree that the parent node belongs to with the data tree containing information describing inter-relationships among datasets in the data tree. A node in the data tree is identified to assign the target individual dataset based on strings of matched data bits and number of the matched strings between the target individual dataset and the datasets in the data tree.

    Family Networks
    8.
    发明申请
    Family Networks 审中-公开

    公开(公告)号:US20190267109A1

    公开(公告)日:2019-08-29

    申请号:US16406847

    申请日:2019-05-08

    摘要: Described embodiments enable identification of family networks using combinations of DNA analysis and genealogical information. Genealogical data is provided by users of a genealogical research service or collected from other sources and used to create family trees for each user. DNA samples are also received from the users. By analyzing the DNA samples, potential genetic relationships can be identified between some users. Once these DNA-suggested relationships have been identified, common ancestors can be sought in the respective trees of the potentially related users. Where these common ancestors exist, an inference is drawn that the DNA-suggested relationship accurately represents a familial overlap between the individuals in question. People descended from the same common ancestor are each members of a family network. Members of a family network not in a user's tree may be identified for the user, enabling the user to discover additional ancestors that might otherwise have remained unknown.

    Family Networks
    10.
    发明申请
    Family Networks 有权
    家庭网络

    公开(公告)号:US20140278138A1

    公开(公告)日:2014-09-18

    申请号:US14214856

    申请日:2014-03-15

    IPC分类号: G06F19/14

    摘要: Described embodiments enable identification of family networks using combinations of DNA analysis and genealogical information. Genealogical data is provided by users of a genealogical research service or collected from other sources and used to create family trees for each user. DNA samples are also received from the users. By analyzing the DNA samples, potential genetic relationships can be identified between some users. Once these DNA-suggested relationships have been identified, common ancestors can be sought in the respective trees of the potentially related users. Where these common ancestors exist, an inference is drawn that the DNA-suggested relationship accurately represents a familial overlap between the individuals in question. People descended from the same common ancestor are each members of a family network. Members of a family network not in a user's tree may be identified for the user, enabling the user to discover additional ancestors that might otherwise have remained unknown.

    摘要翻译: 描述的实施方案使得能够使用DNA分析和家谱信息的组合来鉴定家族网络。 家谱数据由家谱研究服务的用户提供或从其他来源收集,并用于为每个用户创建家庭树。 也从用户那里收到DNA样本。 通过分析DNA样本,可以在一些用户之间识别潜在的遗传关系。 一旦确定了这些DNA建议的关系,可以在潜在相关用户的相应树中寻找共同的祖先。 在这些共同祖先存在的地方,推断出DNA建议关系准确地表示了所讨论的个体之间的家族重叠。 来自同一共同祖先的人是家庭网络的每个成员。 可以为用户识别不在用户树中的家庭网络的成员,使得用户能够发现否则将保持未知的其他祖先。