专利检索 ap:("Palantir Technologies Inc.") AND inv:"Rahul Mehta" 第 1 页

1.

发明授权
Systems and methods for automatic clustering and canonical designation of related data in various data structures 有权

公开(公告)号：US11704325B2

公开(公告)日：2023-07-18

申请号：US17812984

申请日：2022-07-15

申请人： Palantir Technologies Inc.

发明人： Lawrence Manning , Rahul Mehta , Daniel Erenrich , Guillem Palou Visa , Roger Hu , Xavier Falco , Rowan Gilmore , Eli Bingham , Jason Prestinario , Yifei Huang , Daniel Fernandez , Jeremy Elser , Clayton Sader , Rahul Agarwal , Matthew Elkherj , Nicholas Latourette , Aleksandr Zamoshchin

IPC分类号： G06F16/00 , G06F16/2457 , G06F16/35 , G06F16/9535 , G06F16/28 , G06F18/23

CPC分类号： G06F16/24578 , G06F16/285 , G06F16/35 , G06F16/9535 , G06F18/23

摘要： Computer implemented systems and methods are disclosed for automatically clustering and canonically identifying related data in various data structures. Data structures may include a plurality of records, wherein each record is associated with a respective entity. In accordance with some embodiments, the systems and methods further comprise identifying clusters of records associated with a respective entity by grouping the records into pairs, analyzing the respective pairs to determine a probability that both members of the pair relate to a common entity, and identifying a cluster of overlapping pairs to generate a collection of records relating to a common entity. Clusters may further be analyzed to determine canonical names or other properties for the respective entities by analyzing record fields and identifying similarities.

2.

发明申请
SYSTEMS AND METHODS FOR AUTOMATIC CLUSTERING AND CANONICAL DESIGNATION OF RELATED DATA IN VARIOUS DATA STRUCTURES 有权

公开(公告)号：US20220374454A1

公开(公告)日：2022-11-24

申请号：US17812984

申请日：2022-07-15

申请人： Palantir Technologies Inc.

发明人： Lawrence Manning , Rahul Mehta , Daniel Erenrich , Guillem Palou Visa , Roger Hu , Xavier Falco , Rowan Gilmore , Eli Bingham , Jason Prestinario , Yifei Huang , Daniel Fernandez , Jeremy Elser , Clayton Sader , Rahul Agarwal , Matthew Elkherj , Nicholas Latourette , Aleksandr Zamoshchin

IPC分类号： G06F16/28 , G06K9/62

摘要： Computer implemented systems and methods are disclosed for automatically clustering and canonically identifying related data in various data structures. Data structures may include a plurality of records, wherein each record is associated with a respective entity. In accordance with some embodiments, the systems and methods further comprise identifying clusters of records associated with a respective entity by grouping the records into pairs, analyzing the respective pairs to determine a probability that both members of the pair relate to a common entity, and identifying a cluster of overlapping pairs to generate a collection of records relating to a common entity. Clusters may further be analyzed to determine canonical names or other properties for the respective entities by analyzing record fields and identifying similarities.

3.

发明申请
SYSTEMS AND METHODS FOR AUTOMATIC CLUSTERING AND CANONICAL DESIGNATION OF RELATED DATA IN VARIOUS DATA STRUCTURES 审中-公开

公开(公告)号：US20190079937A1

公开(公告)日：2019-03-14

申请号：US16189040

申请日：2018-11-13

申请人： Palantir Technologies Inc.

发明人： Lawrence Manning , Rahul Mehta , Daniel Erenrich , Guillem Palou Visa , Roger Hu , Xavier Falco , Rowan Gilmore , Eli Bingham , Jason Prestinario , Yifei Huang , Daniel Fernandez , Jeremy Elser , Clayton Sader , Rahul Agarwal , Matthew Elkherj , Nicholas Latourette , Aleksandr Zamoshchin

IPC分类号： G06F17/30

摘要： Computer implemented systems and methods are disclosed for automatically clustering and canonically identifying related data in various data structures. Data structures may include a plurality of records, wherein each record is associated with a respective entity. In accordance with some embodiments, the systems and methods further comprise identifying clusters of records associated with a respective entity by grouping the records into pairs, analyzing the respective pairs to determine a probability that both members of the pair relate to a common entity, and identifying a cluster of overlapping pairs to generate a collection of records relating to a common entity. Clusters may further be analyzed to determine canonical names or other properties for the respective entities by analyzing record fields and identifying similarities.

4.

发明公开
SYSTEMS AND METHODS FOR AUTOMATIC CLUSTERING AND CANONICAL DESIGNATION OF RELATED DATA IN VARIOUS DATA STRUCTURES 审中-公开

公开(公告)号：US20240320227A1

公开(公告)日：2024-09-26

申请号：US18731699

申请日：2024-06-03

申请人： Palantir Technologies Inc.

发明人： Lawrence Manning , Rahul Mehta , Daniel Erenrich , Guillem Palou Visa , Roger Hu , Xavier Falco , Rowan Gilmore , Eli Bingham , Jason Prestinario , Yifei Huang , Daniel Fernandez , Jeremy Elser , Clayton Sader , Rahul Agarwal , Matthew Elkherj , Nicholas Latourette , Aleksandr Zamoshchin

IPC分类号： G06F16/2457 , G06F16/28 , G06F16/35 , G06F16/9535 , G06F18/23

CPC分类号： G06F16/24578 , G06F16/285 , G06F16/35 , G06F16/9535 , G06F18/23

摘要： Computer implemented systems and methods are disclosed for automatically clustering and canonically identifying related data in various data structures. Data structures may include a plurality of records, wherein each record is associated with a respective entity. In accordance with some embodiments, the systems and methods further comprise identifying clusters of records associated with a respective entity by grouping the records into pairs, analyzing the respective pairs to determine a probability that both members of the pair relate to a common entity, and identifying a cluster of overlapping pairs to generate a collection of records relating to a common entity. Clusters may further be analyzed to determine canonical names or other properties for the respective entities by analyzing record fields and identifying similarities.

5.

发明授权
Systems and methods for automatic clustering and canonical designation of related data in various data structures 有权

公开(公告)号：US12038933B2

公开(公告)日：2024-07-16

申请号：US18325616

申请日：2023-05-30

申请人： Palantir Technologies Inc.

发明人： Lawrence Manning , Rahul Mehta , Daniel Erenrich , Guillem Palou Visa , Roger Hu , Xavier Falco , Rowan Gilmore , Eli Bingham , Jason Prestinario , Yifei Huang , Daniel Fernandez , Jeremy Elser , Clayton Sader , Rahul Agarwal , Matthew Elkherj , Nicholas Latourette , Aleksandr Zamoshchin

IPC分类号： G06F16/00 , G06F16/2457 , G06F16/28 , G06F16/35 , G06F16/9535 , G06F18/23

CPC分类号： G06F16/24578 , G06F16/285 , G06F16/35 , G06F16/9535 , G06F18/23

摘要： Computer implemented systems and methods are disclosed for automatically clustering and canonically identifying related data in various data structures. Data structures may include a plurality of records, wherein each record is associated with a respective entity. In accordance with some embodiments, the systems and methods further comprise identifying clusters of records associated with a respective entity by grouping the records into pairs, analyzing the respective pairs to determine a probability that both members of the pair relate to a common entity, and identifying a cluster of overlapping pairs to generate a collection of records relating to a common entity. Clusters may further be analyzed to determine canonical names or other properties for the respective entities by analyzing record fields and identifying similarities.

6.

发明授权
Systems and methods for automatic clustering and canonical designation of related data in various data structures 有权

公开(公告)号：US10127289B2

公开(公告)日：2018-11-13

申请号：US15233149

申请日：2016-08-10

申请人： Palantir Technologies Inc.

发明人： Lawrence Manning , Rahul Mehta , Daniel Erenrich , Guillem Palou Visa , Roger Hu , Xavier Falco , Rowan Gilmore , Eli Bingham , Jason Prestinario , Yifei Huang , Daniel Fernandez , Jeremy Elser , Clayton Sader , Rahul Agarwal , Matthew Elkherj , Nicholas Latourette , Aleksandr Zamoshchin

IPC分类号： G06F17/30

摘要： Computer implemented systems and methods are disclosed for automatically clustering and canonically identifying related data in various data structures. Data structures may include a plurality of records, wherein each record is associated with a respective entity. In accordance with some embodiments, the systems and methods further comprise identifying clusters of records associated with a respective entity by grouping the records into pairs, analyzing the respective pairs to determine a probability that both members of the pair relate to a common entity, and identifying a cluster of overlapping pairs to generate a collection of records relating to a common entity. Clusters may further be analyzed to determine canonical names or other properties for the respective entities by analyzing record fields and identifying similarities.

7.

发明授权
Automatic generation of composite datasets based on hierarchical fields 有权
标题翻译：基于分层字段自动生成复合数据集

公开(公告)号：US09542446B1

公开(公告)日：2017-01-10

申请号：US14996179

申请日：2016-01-14

申请人： Palantir Technologies, Inc.

发明人： Ben Duffield , Patrick Woody , Rahul Mehta

IPC分类号： G06F17/30

CPC分类号： G06F17/30498 , G06F17/30398 , G06F17/30401 , G06F17/3043 , G06F17/30525 , G06F17/30598

摘要： Datasets are annotated with metadata including categories. Each category corresponds to one or more fields. A hierarchy mapping is generated to indicate a hierarchical relationship between different categories. A natural language query specifies a first granularity level indicating a particular category and one or more field values corresponding to the particular category. Based on the hierarchy mapping, one or more categories that are hierarchically related to the particular category are identified. Based on the metadata, two or more datasets that include at least one hierarchically related category is selected. Based on the first granularity level, one or more dataset filters are generated. The one or more dataset filters are translated to a second granularity level corresponding to the at least one hierarchically related category. The translated filters are applied to at least one of the selected datasets. The two or more datasets are joined to generate a composite dataset.

摘要翻译： 数据集用包含类别的元数据进行注释。每个类别对应一个或多个字段。生成层次映射以指示不同类别之间的层次关系。自然语言查询指定指示特定类别的第一粒度级别和对应于特定类别的一个或多个字段值。基于层次映射，识别与特定类别分层相关的一个或多个类别。基于元数据，选择包括至少一个层级相关类别的两个或多个数据集。基于第一粒度级别，生成一个或多个数据集过滤器。一个或多个数据集过滤器被转换为对应于至少一个层级相关类别的第二粒度级别。已翻译的过滤器应用于所选数据集中的至少一个。连接两个或更多数据集以生成复合数据集。

8.

发明公开
SYSTEMS AND METHODS FOR AUTOMATIC CLUSTERING AND CANONICAL DESIGNATION OF RELATED DATA IN VARIOUS DATA STRUCTURES 审中-公开

公开(公告)号：US20230297582A1

公开(公告)日：2023-09-21

申请号：US18325616

申请日：2023-05-30

申请人： Palantir Technologies Inc.

发明人： Lawrence Manning , Rahul Mehta , Daniel Erenrich , Guillem Palou Visa , Roger Hu , Xavier Falco , Rowan Gilmore , Eli Bingham , Jason Prestinario , Yifei Huang , Daniel Fernandez , Jeremy Elser , Clayton Sader , Rahul Agarwal , Matthew Elkherj , Nicholas Latourette , Aleksandr Zamoshchin

IPC分类号： G06F16/2457 , G06F16/35 , G06F16/9535 , G06F16/28 , G06F18/23

CPC分类号： G06F16/24578 , G06F16/35 , G06F16/9535 , G06F16/285 , G06F18/23

摘要： Computer implemented systems and methods are disclosed for automatically clustering and canonically identifying related data in various data structures. Data structures may include a plurality of records, wherein each record is associated with a respective entity. In accordance with some embodiments, the systems and methods further comprise identifying clusters of records associated with a respective entity by grouping the records into pairs, analyzing the respective pairs to determine a probability that both members of the pair relate to a common entity, and identifying a cluster of overlapping pairs to generate a collection of records relating to a common entity. Clusters may further be analyzed to determine canonical names or other properties for the respective entities by analyzing record fields and identifying similarities.

9.

发明授权
Automatic generation of composite datasets based on hierarchical fields 有权

公开(公告)号：US10678860B1

公开(公告)日：2020-06-09

申请号：US15282780

申请日：2016-09-30

申请人： Palantir Technologies, Inc.

发明人： Ben Duffield , Patrick Woody , Rahul Mehta

IPC分类号： G06F16/9032 , G06F16/248 , G06F16/28 , G06F16/2455 , G06F16/2457

摘要： Datasets are annotated with metadata including categories. Each category corresponds to one or more fields. A hierarchy mapping is generated to indicate a hierarchical relationship between different categories. A natural language query specifies a first granularity level indicating a particular category and one or more field values corresponding to the particular category. Based on the hierarchy mapping, one or more categories that are hierarchically related to the particular category are identified. Based on the metadata, two or more datasets that include at least one hierarchically related category is selected. Based on the first granularity level, one or more dataset filters are generated. The one or more dataset filters are translated to a second granularity level corresponding to the at least one hierarchically related category. The translated filters are applied to at least one of the selected datasets. The two or more datasets are joined to generate a composite dataset.

10.

发明申请
SYSTEMS AND METHODS FOR AUTOMATIC CLUSTERING AND CANONICAL DESIGNATION OF RELATED DATA IN VARIOUS DATA STRUCTURES 审中-公开
标题翻译：用于各种数据结构的相关数据的自动聚类和统一指定的系统和方法

公开(公告)号：US20170052958A1

公开(公告)日：2017-02-23

申请号：US15233149

申请日：2016-08-10

申请人： Palantir Technologies Inc.

发明人： Lawrence Manning , Rahul Mehta , Daniel Erenrich , Guillem Palou Visa , Roger Hu , Xavier Falco , Rowan Gilmore , Eli Bingham , Jason Prestinario , Yifei Huang , Daniel Fernandez , Jeremy Elser , Clayton Sader , Rahul Agarwal , Matthew Elkherj , Nicholas Latourette , Aleksandr Zamoshchin

IPC分类号： G06F17/30

CPC分类号： G06F17/3053 , G06F17/30705 , G06F17/30867

摘要： Computer implemented systems and methods are disclosed for automatically clustering and canonically identifying related data in various data structures. Data structures may include a plurality of records, wherein each record is associated with a respective entity. In accordance with some embodiments, the systems and methods further comprise identifying clusters of records associated with a respective entity by grouping the records into pairs, analyzing the respective pairs to determine a probability that both members of the pair relate to a common entity, and identifying a cluster of overlapping pairs to generate a collection of records relating to a common entity. Clusters may further be analyzed to determine canonical names or other properties for the respective entities by analyzing record fields and identifying similarities.

摘要翻译： 公开了计算机实现的系统和方法，用于自动聚类和规范地识别各种数据结构中的相关数据。数据结构可以包括多个记录，其中每个记录与相应实体相关联。根据一些实施例，系统和方法还包括通过将记录分组成对来识别与相应实体相关联的记录簇，分析相应的对以确定该对的两个成员与公共实体相关联的概率，以及识别一组重叠的对以生成与公共实体相关的记录集合。可以通过分析记录字段并识别相似性来进一步分析集群以确定相应实体的规范名称或其他属性。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类