专利检索 ap:("Mihaela Ancuta Bornea" OR "Songyun Duan" OR "Achille Belly Fokoue-Nkoutche" OR "Oktie Hassanzadeh" OR "Anastasios Kementsietsidis" OR "Kavitha Srinivas" OR "Michael J. Ward") AND inv:"Kavitha Srinivas" 第 1 页

1.

发明申请
Linking Data Elements Based on Similarity Data Values and Semantic Annotations 审中-公开

公开(公告)号：US20130332467A1

公开(公告)日：2013-12-12

申请号：US13543872

申请日：2012-07-08

申请人： Mihaela Ancuta Bornea , Songyun Duan , Achille Belly Fokoue-Nkoutche , Oktie Hassanzadeh , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

发明人： Mihaela Ancuta Bornea , Songyun Duan , Achille Belly Fokoue-Nkoutche , Oktie Hassanzadeh , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

IPC分类号： G06F17/30

CPC分类号： G06F16/951

摘要： Data elements from data sources and having a data value set are linked by using hash functions to determine a dimensionally reduced instance signature for each data element based on all data values associated with that data element to yield a plurality of dimensionally reduced instance signatures of equivalent fixed size such that similarities among the data values in the data value sets across all data elements is maintained among the plurality of instance signatures. Candidate pairs of data elements to link are identified using the plurality of instance signatures in locality sensitive hash functions, and a similarity index is generated for each candidate pair using a pre-determined measure of similarity. Candidate pairs of data elements having a similarity index above a given threshold are linked.

2.

发明申请
Linking Data Elements Based on Similarity Data Values and Semantic Annotations 审中-公开
标题翻译：基于相似性数据值和语义注释链接数据元素

公开(公告)号：US20130332466A1

公开(公告)日：2013-12-12

申请号：US13491724

申请日：2012-06-08

申请人： Mihaela Ancuta Bornea , Songyun Duan , Achille Belly Fokoue-Nkoutche , Oktie Hassanzadeh , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

发明人： Mihaela Ancuta Bornea , Songyun Duan , Achille Belly Fokoue-Nkoutche , Oktie Hassanzadeh , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

IPC分类号： G06F17/30

CPC分类号： G06F17/30864

摘要： Data elements from data sources and having a data value set are linked by using hash functions to determine a dimensionally reduced instance signature for each data element based on all data values associated with that data element to yield a plurality of dimensionally reduced instance signatures of equivalent fixed size such that similarities among the data values in the data value sets across all data elements is maintained among the plurality of instance signatures. Candidate pairs of data elements to link are identified using the plurality of instance signatures in locality sensitive hash functions, and a similarity index is generated for each candidate pair using a pre-determined measure of similarity. Candidate pairs of data elements having a similarity index above a given threshold are linked.

摘要翻译： 来自数据源并且具有数据值集合的数据元素通过使用散列函数来链接，以基于与该数据元素相关联的所有数据值来确定每个数据元素的尺寸上减小的实例签名，以产生多个等距固定的尺寸缩小的实例签名大小，使得在多个实例签名之间保持跨所有数据元素的数据值中的数据值之间的相似性。使用位置敏感哈希函数中的多个实例签名来识别要链接的候选数据元素对，并且使用预定的相似度测量为每个候选对生成相似性索引。具有高于给定阈值的相似性指数的候选对的数据元素被链接。

3.

发明授权
Linking data elements based on similarity data values and semantic annotations 有权

公开(公告)号：US10229200B2

公开(公告)日：2019-03-12

申请号：US13491724

申请日：2012-06-08

申请人： Mihaela Ancuta Bornea , Songyun Duan , Achille Belly Fokoue-Nkoutche , Oktie Hassanzadeh , Anastasios Kementsietsidis , Kavitha Srinivas , Michael James Ward

发明人： Mihaela Ancuta Bornea , Songyun Duan , Achille Belly Fokoue-Nkoutche , Oktie Hassanzadeh , Anastasios Kementsietsidis , Kavitha Srinivas , Michael James Ward

IPC分类号： G06F17/30

摘要： Data elements from data sources and having a data value set are linked by using hash functions to determine a dimensionally reduced instance signature for each data element based on all data values associated with that data element to yield a plurality of dimensionally reduced instance signatures of equivalent fixed size such that similarities among the data values in the data value sets across all data elements is maintained among the plurality of instance signatures. Candidate pairs of data elements to link are identified using the plurality of instance signatures in locality sensitive hash functions, and a similarity index is generated for each candidate pair using a pre-determined measure of similarity. Candidate pairs of data elements having a similarity index above a given threshold are linked.

4.

发明授权
Querying and integrating structured and unstructured data 有权
标题翻译：查询和整合结构化和非结构化数据

公开(公告)号：US09037615B2

公开(公告)日：2015-05-19

申请号：US13493174

申请日：2012-06-11

申请人： Mihaela Ancuta Bornea , Songyun Duan , James J. Fan , Achille Fokoue-Nkoutche , Alfio M. Gliozzo , Aditya Kalyanpur , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

发明人： Mihaela Ancuta Bornea , Songyun Duan , James J. Fan , Achille Fokoue-Nkoutche , Alfio M. Gliozzo , Aditya Kalyanpur , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

IPC分类号： G06F7/00 , G06F17/30

CPC分类号： G06F17/30946 , G06F17/30292

摘要： A computer-implemented method, system, and article of manufacture for querying and integrating structured and unstructured data. The method includes: receiving entity information that is extracted from a first set of unstructured data using an open domain information extraction system, wherein the entity in-formation comprises relationship information between a first entity and a second entity of the first set of unstructured data; recognizing a pattern based on the relationship information and creating a schema for the first set of unstructured data based on the pattern; and associating an element of the created schema with (i) an entity of a second set of unstructured data or (ii) a schema element of an existing set of structured data if there is sufficient overall similarity between the created schema element and either the second unstructured data entity or the schema element of the existing structured data.

摘要翻译： 用于查询和整合结构化和非结构化数据的计算机实现的方法，系统和制造。该方法包括：使用开放域信息提取系统接收从第一组非结构化数据提取的实体信息，其中所述实体信息包括第一组非结构化数据的第一实体与第二实体之间的关系信息; 基于所述关系信息识别模式，并基于所述模式为所述第一组非结构化数据创建模式; 并且将所创建的模式的元素与（i）第二组非结构化数据的实体相关联，或者（ii）现有结构化数据集合的模式元素，如果所创建的模式元素与第二组之间存在足够的总体相似度非结构化数据实体或现有结构化数据的架构元素。

5.

发明申请
QUERYING AND INTEGRATING STRUCTURED AND INSTRUCTURED DATA 有权
标题翻译：查询和整合结构化和结构化数据

公开(公告)号：US20130332478A1

公开(公告)日：2013-12-12

申请号：US13493174

申请日：2012-06-11

申请人： Mihaela Ancuta Bornea , Songyun Duan , James J. Fan , Achille Fokoue-Nkoutche , Alfio M. Gliozzo , Aditya Kalyanpur , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

发明人： Mihaela Ancuta Bornea , Songyun Duan , James J. Fan , Achille Fokoue-Nkoutche , Alfio M. Gliozzo , Aditya Kalyanpur , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

IPC分类号： G06F17/30

CPC分类号： G06F17/30946 , G06F17/30292

摘要： A computer-implemented method, system, and article of manufacture for querying and integrating structured and unstructured data. The method includes: receiving entity information that is extracted from a first set of unstructured data using an open domain information extraction system, wherein the entity information comprises relationship information between a first entity and a second entity of the first set of unstructured data; recognizing a pattern based on the relationship information and creating a schema for the first set of unstructured data based on the pattern; and associating an element of the created schema with (i) an entity of a second set of unstructured data or (ii) a schema element of an existing set of structured data if there is sufficient overall similarity between the created schema element and either the second unstructured data entity or the schema element of the existing structured data.

摘要翻译： 用于查询和整合结构化和非结构化数据的计算机实现的方法，系统和制造。该方法包括：使用开放域信息提取系统接收从第一组非结构化数据提取的实体信息，其中实体信息包括第一组非结构化数据的第一实体与第二实体之间的关系信息; 基于所述关系信息识别模式，并基于所述模式为所述第一组非结构化数据创建模式; 并且将所创建的模式的元素与（i）第二组非结构化数据的实体相关联，或者（ii）现有结构化数据集合的模式元素，如果所创建的模式元素与第二组之间存在足够的总体相似度非结构化数据实体或现有结构化数据的架构元素。

6.

发明申请
AGGREGATING SEARCH RESULTS BASED ON ASSOCIATING DATA INSTANCES WITH KNOWLEDGE BASE ENTITIES 审中-公开
标题翻译：基于与知识基础实体相关联的数据实验的搜索结果

公开(公告)号：US20120246154A1

公开(公告)日：2012-09-27

申请号：US13070193

申请日：2011-03-23

申请人： Songyun Duan , Achille B. Fokoue-Nfoutche , Oktie Hassanzadeh , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

发明人： Songyun Duan , Achille B. Fokoue-Nfoutche , Oktie Hassanzadeh , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

IPC分类号： G06F17/30

CPC分类号： G06F16/2455

摘要： Methods and systems for aggregating search query results include receiving search query results and schema information for the query results from multiple heterogeneous sources, determining types for elements of the query results based on the schema information, determining potential aggregations for the query results based on the types, which are based on accumulated information from the plurality of heterogeneous resources, and aggregating the query results according to one or more of the potential aggregations.

摘要翻译： 用于聚合搜索查询结果的方法和系统包括从多个异构源接收用于查询结果的搜索查询结果和模式信息，基于模式信息确定查询结果的元素的类型，基于类型确定查询结果的潜在聚合，其基于来自所述多个异构资源的累积信息，并且根据所述潜在聚合中的一个或多个聚合所述查询结果。

7.

发明授权
Annotating schema elements based on associating data instances with knowledge base entities 有权

公开(公告)号：US09959326B2

公开(公告)日：2018-05-01

申请号：US13070238

申请日：2011-03-23

申请人： Songyun Duan , Achille B. Fokoue-Nkoutche , Oktie Hassanzadeh , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

发明人： Songyun Duan , Achille B. Fokoue-Nkoutche , Oktie Hassanzadeh , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

IPC分类号： G06F17/30

CPC分类号： G06F17/30545 , G06F17/30566

摘要： Methods and systems for determining schema element types are shown that include pooling potential annotations for an element of an unlabeled schema from a plurality of heterogeneous sources, scoring the pool of potential annotations according to relevancy using information using instance information from the plurality of heterogeneous sources to produce a relevancy score, and annotating the element of the unlabeled schema using the most relevant potential annotations.

8.

发明申请
ANNOTATING SCHEMA ELEMENTS BASED ON ASSOCIATING DATA INSTANCES WITH KNOWLEDGE BASE ENTITIES 有权
标题翻译：基于与知识基础实体相关的数据实验的示例图表元素

公开(公告)号：US20120246175A1

公开(公告)日：2012-09-27

申请号：US13070238

申请日：2011-03-23

申请人： Songyun Duan , Achille B. Fokoue-Nkoutche , Oktie Hassanzadeh , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

发明人： Songyun Duan , Achille B. Fokoue-Nkoutche , Oktie Hassanzadeh , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

IPC分类号： G06F17/30

CPC分类号： G06F17/30545 , G06F17/30566

摘要： Methods and systems for determining schema element types are shown that include pooling potential annotations for an element of an unlabeled schema from a plurality of heterogeneous sources, scoring the pool of potential annotations according to relevancy using information using instance information from the plurality of heterogeneous sources to produce a relevancy score, and annotating the element of the unlabeled schema using the most relevant potential annotations.

摘要翻译： 示出了用于确定模式元素类型的方法和系统，其包括对来自多个异构源的未标记模式的元素的潜在注释进行池化，根据使用来自多个异构源的实例信息的信息，根据相关性评分潜在注释池，产生相关性分数，并使用最相关的潜在注释来注释未标记模式的元素。

9.

发明申请
SPREADSHEET SCHEMA EXTRACTION 审中-公开

公开(公告)号：US20140074878A1

公开(公告)日：2014-03-13

申请号：US13617322

申请日：2012-09-14

申请人： Mihaela A. Bornea , Songyun Duan , Achille B. Fokoue-Nkoutche , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

发明人： Mihaela A. Bornea , Songyun Duan , Achille B. Fokoue-Nkoutche , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

IPC分类号： G06F17/30

CPC分类号： G06F17/2745 , G06F17/246

摘要： Aspects of the present invention provide a tool for extracting schema from a spreadsheet. In an embodiment, a set of data that is stored in an uncataloged tabular format, such as a spreadsheet, is retrieved. The structure of the retrieved set of data is surveyed to determine the dataset schema thereof. Then, data elements within the dataset schema are analyzed to obtain information regarding the data elements. Based on dataset schema and the element information, an interface can be constructed that allows remote access to the set of data.

10.

发明申请
SPREADSHEET SCHEMA EXTRACTION 审中-公开

公开(公告)号：US20140075278A1

公开(公告)日：2014-03-13

申请号：US13611258

申请日：2012-09-12

申请人： Mihaela A. Bornea , Songyun Duan , Achille B. Fokoue-Nkoutche , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

发明人： Mihaela A. Bornea , Songyun Duan , Achille B. Fokoue-Nkoutche , Anastasios Kementsietsidis , Kavitha Srinivas , Michael J. Ward

IPC分类号： G06F17/00

CPC分类号： G06F17/2745 , G06F17/246

摘要： Aspects of the present invention provide a tool for extracting schema from a spreadsheet. In an embodiment, a set of data that is stored in an uncataloged tabular format, such as a spreadsheet, is retrieved. The structure of the retrieved set of data is surveyed to determine the dataset schema thereof. Then, data elements within the dataset schema are analyzed to obtain information regarding the data elements. Based on dataset schema and the element information, an interface can be constructed that allows remote access to the set of data.

摘要翻译： 本发明的各方面提供了用于从电子表格中提取模式的工具。在一个实施例中，检索以非催化表格格式存储的一组数据，例如电子表格。检索检索到的数据集的结构以确定其数据集模式。然后，分析数据集模式中的数据元素以获得有关数据元素的信息。基于数据集模式和元素信息，可以构造允许远程访问数据集的接口。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类