专利检索 ap:("Andrei Z. Broder" OR "Marcus Felipe Fontoura" OR "Michael Herscovici" OR "Ronny Lempel" OR "John Ai McPherson, Jr." OR "Andreas Neumann" OR "Runping Qi" OR "Eugene Jon Shekita") AND inv:"Marcus Felipe Fontoura" 第 1 页

1.

发明授权
Generic architecture for indexing document groups in an inverted text index 有权
标题翻译：在反向文本索引中索引文档组的通用架构

公开(公告)号：US08131726B2

公开(公告)日：2012-03-06

申请号：US10905604

申请日：2005-01-12

申请人： Andrei Z. Broder , Marcus Felipe Fontoura , Michael Herscovici , Ronny Lempel , John Ai McPherson, Jr. , Andreas Neumann , Runping Qi , Eugene Jon Shekita

发明人： Andrei Z. Broder , Marcus Felipe Fontoura , Michael Herscovici , Ronny Lempel , John Ai McPherson, Jr. , Andreas Neumann , Runping Qi , Eugene Jon Shekita

IPC分类号： G06F7/00 , G06F17/00 , G06F17/30

CPC分类号： G06F17/30622

摘要： A method for indexing a plurality of documents, that includes a plurality of duplicate documents, first identifies one or more duplicate groups of documents from among the plurality of documents. Then, one index of content for the duplicate group is created instead of indexing the content from every document within the duplicate group. However, in contrast to the content index, an index of metadata for each of the documents in the duplicate group is created. Thus the content of each duplicate group is indexed only once, while a search engine using such indexing techniques retains the capability to answer queries as if the duplicated content was indexed for each document of the group.

摘要翻译： 一种用于索引多个文档（包括多个重复文档）的方法首先从多个文档中识别一个或多个文档重复组。然后，创建重复组的一个内容索引，而不是从重复组中的每个文档索引内容。然而，与内容索引相反，创建了重复组中的每个文档的元数据索引。因此，每个重复组的内容仅被索引一次，而使用这种索引技术的搜索引擎保留回答查询的能力，就好像为组中的每个文档索引了重复的内容。

2.

发明申请
SYSTEM AND ARTICLE OF MANUFACTURE FOR SEARCHING DOCUMENTS FOR RANGES OF NUMERIC VALUES 失效
标题翻译：用于搜索数值范围的文件的制造和制造

公开(公告)号：US20080294634A1

公开(公告)日：2008-11-27

申请号：US12187344

申请日：2008-08-06

申请人： Marcus Felipe Fontoura , Ronny Lempel , Runping Qi , Jason Yeong Zien

发明人： Marcus Felipe Fontoura , Ronny Lempel , Runping Qi , Jason Yeong Zien

IPC分类号： G06F7/06 , G06F17/30

CPC分类号： G06F17/30864 , Y10S707/99933 , Y10S707/99937

摘要： Provided are a system and article of manufacture for searching documents for ranges of numeric values. Document identifiers for documents include at least one value that is a member of a set of values. A number of posting lists is generated, wherein each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents including at least one value within the range of consecutive values associated with the posting list, and wherein each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored, wherein the posting lists are used to process a query on a range of values within the set of values. A query on a query range of values within the set of values is received and a determination is made of a minimum number of posting lists associated with consecutive values that together include the query range of values. The determined posting lists are merged to form a merged posting list including document identifiers of documents including values within the query range. The document identifiers in the merged posting list are returned.

摘要翻译： 提供了用于搜索文件范围的数值的系统和制品。文档的文档标识符至少包含一个值，它是一组值的成员。生成多个发布列表，其中每个发布列表与该组值范围内的连续值的范围相关联，并且包括文档的文档标识符，其包括与发布列表相关联的连续值的范围内的至少一个值，并且其中每个文档标识符与由文档标识符标识的文档中包括的值集合中的一个值相关联。存储生成的发布列表，其中发布列表用于处理在该组值范围内的查询。接收关于该值集合中的值的查询范围的查询，并且确定与连续值相关联的一起包括查询范围值的连续值的最小发布列表数。确定的发布列表被合并以形成合并的发布列表，包括包括查询范围内的值的文档的文档标识符。返回合并发布列表中的文档标识符。

3.

发明授权
Searching documents for ranges of numeric values 有权
标题翻译：搜索文件范围的数值

公开(公告)号：US08271498B2

公开(公告)日：2012-09-18

申请号：US12190495

申请日：2008-08-12

申请人： Marcus Felipe Fontoura , Ronny Lempel , Runping Qi , Jason Yeong Zien

发明人： Marcus Felipe Fontoura , Ronny Lempel , Runping Qi , Jason Yeong Zien

IPC分类号： G06F7/00

CPC分类号： G06F17/30864 , Y10S707/99933 , Y10S707/99937

摘要： Provided are a method, system, and article of manufacture for searching documents for ranges of numeric values. Document identifiers for documents are accessed, wherein the documents include at least one value that is a member of a set of values. A number of posting lists are generated. Each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents including at least one value within the range of consecutive values associated with the posting list, and wherein each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored, wherein the posting lists are used to process a query on a range of values within the set of values.

摘要翻译： 提供了用于搜索文件范围的数值的方法，系统和制品。访问文档的文档标识符，其中文档包括作为一组值的成员的至少一个值。生成多个发布列表。每个发布列表与所述值集合内的连续值的范围相关联，并且包括用于文档的文档标识符，所述文档包括与所述发布列表相关联的连续值的范围内的至少一个值，并且其中每个文档标识符与由文件标识符标识的文档中包含的值集合。存储生成的发布列表，其中发布列表用于处理在该组值范围内的查询。

4.

发明申请
METHOD, SYSTEM AND ARTICLE OF MANUFACTURE FOR SEARCHING DOCUMENTS FOR RANGES OF NUMERIC VALUES 有权
标题翻译：用于搜索数值范围的文档的制造方法，系统和文章

公开(公告)号：US20080301130A1

公开(公告)日：2008-12-04

申请号：US12190495

申请日：2008-08-12

申请人： Marcus Felipe Fontoura , Ronny Lempel , Runping Qi , Jason Yeong Zien

发明人： Marcus Felipe Fontoura , Ronny Lempel , Runping Qi , Jason Yeong Zien

IPC分类号： G06F17/30

CPC分类号： G06F17/30864 , Y10S707/99933 , Y10S707/99937

摘要： Provided are a method, system, and article of manufacture for searching documents for ranges of numeric values. Document identifiers for documents are accessed, wherein the documents include at least one value that is a member of a set of values. A number of posting lists are generated. Each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents including at least one value within the range of consecutive values associated with the posting list, and wherein each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored, wherein the posting lists are used to process a query on a range of values within the set of values.

摘要翻译： 提供了用于搜索文件范围的数值的方法，系统和制品。访问文档的文档标识符，其中文档包括作为一组值的成员的至少一个值。生成多个发布列表。每个发布列表与该组值范围内的连续值的范围相关联，并且包括用于文档的文档标识符，其包括与发布列表相关联的连续值的范围内的至少一个值，并且其中每个文档标识符与由文件标识符标识的文档中包含的值集合。存储生成的发布列表，其中发布列表用于处理在该组值范围内的查询。

5.

发明授权
Method for searching documents for ranges of numeric values 有权
标题翻译：搜索文件数值范围的方法

公开(公告)号：US07461064B2

公开(公告)日：2008-12-02

申请号：US10949473

申请日：2004-09-24

申请人： Marcus Felipe Fontoura , Ronny Lempel , Runping Qi , Jason Yeong Zien

发明人： Marcus Felipe Fontoura , Ronny Lempel , Runping Qi , Jason Yeong Zien

IPC分类号： G06F7/00 , G06F17/30

CPC分类号： G06F17/30864 , Y10S707/99933 , Y10S707/99937

摘要： Provided are a method, system, and program for searching documents for ranges of numeric values. Document identifiers for documents are accessed, wherein the documents include at least one value that is a member of a set of values. A number of posting lists are generated. Each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents having values within the range of consecutive values associated with the posting list. Each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored.

摘要翻译： 提供了用于在数值范围内搜索文档的方法，系统和程序。访问文档的文档标识符，其中文档包括作为一组值的成员的至少一个值。生成多个发布列表。每个发布列表与该组值范围内的连续值的范围相关联，并且包括具有与发布列表相关联的连续值范围内的值的文档的文档标识符。每个文档标识符与由文档标识符标识的文档中包括的值集合中的一个值相关联。生成的发布列表被存储。

6.

发明授权
Searching documents for ranges of numeric values 失效
标题翻译：搜索文件范围的数值

公开(公告)号：US08346759B2

公开(公告)日：2013-01-01

申请号：US12187344

申请日：2008-08-06

申请人： Marcus Felipe Fontoura , Ronny Lempel , Runping Qi , Jason Yeong Zien

发明人： Marcus Felipe Fontoura , Ronny Lempel , Runping Qi , Jason Yeong Zien

IPC分类号： G06F7/00 , G06F17/30

CPC分类号： G06F17/30864 , Y10S707/99933 , Y10S707/99937

摘要： Provided are a system and article of manufacture for searching documents for ranges of numeric values. A number of posting lists is generated, wherein each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents including at least one value within the range of consecutive values associated with the posting list, and wherein each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored. A query on a query range of values within the set of values is received and a determination is made of a minimum number of posting lists associated with consecutive values that together include the query range of values. The determined posting lists are merged.

摘要翻译： 提供了用于搜索文件范围的数值的系统和制品。生成多个发布列表，其中每个发布列表与该组值范围内的连续值的范围相关联，并且包括文档的文档标识符，其包括与发布列表相关联的连续值的范围内的至少一个值，并且其中每个文档标识符与由文档标识符标识的文档中包括的值集合中的一个值相关联。生成的发布列表被存储。接收关于该值集合中的值的查询范围的查询，并且确定与连续值相关联的一起包括查询范围值的连续值的最小发布列表数。确定的发布列表合并。

7.

发明申请
SEARCHING DOCUMENTS FOR RANGES OF NUMERIC VALUES 有权
标题翻译：搜索数值范围的文件

公开(公告)号：US20120096016A1

公开(公告)日：2012-04-19

申请号：US13335634

申请日：2011-12-22

申请人： Marcus Felipe Fontoura , Ronny Lempel , Runping Qi , Jason Yeong Zien

发明人： Marcus Felipe Fontoura , Ronny Lempel , Runping Qi , Jason Yeong Zien

IPC分类号： G06F17/30

CPC分类号： G06F17/30864 , Y10S707/99933 , Y10S707/99937

摘要： Provided are a method, system, and article of manufacture for searching documents for ranges of numeric values. Document identifiers for documents are accessed, wherein the documents include at least one value that is a member of a set of values. A number of posting lists are generated. Each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents including at least one value within the range of consecutive values associated with the posting list, and wherein each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored, wherein the posting lists are used to process a query on a range of values within the set of values.

摘要翻译： 提供了用于搜索文件范围的数值的方法，系统和制品。访问文档的文档标识符，其中文档包括作为一组值的成员的至少一个值。生成多个发布列表。每个发布列表与所述值集合内的连续值的范围相关联，并且包括用于文档的文档标识符，所述文档包括与所述发布列表相关联的连续值的范围内的至少一个值，并且其中每个文档标识符与由文件标识符标识的文档中包含的值集合。存储生成的发布列表，其中发布列表用于处理在该组值范围内的查询。

8.

发明授权
Pipelined architecture for global analysis and index building 有权
标题翻译：流水线架构，用于全球分析和索引建设

公开(公告)号：US07783626B2

公开(公告)日：2010-08-24

申请号：US11840881

申请日：2007-08-17

申请人： Marcus Felipe Fontoura , Reiner Kraft , Tony Kai-Chi Leung , John A. McPherson, Jr. , Andreas Neumann , Runping Qi , Sridhar Rajagopalan , Eugene J. Shekita , Jason Yeong Zien

发明人： Marcus Felipe Fontoura , Reiner Kraft , Tony Kai-Chi Leung , John A. McPherson, Jr. , Andreas Neumann , Runping Qi , Sridhar Rajagopalan , Eugene J. Shekita , Jason Yeong Zien

IPC分类号： G06F7/00

CPC分类号： G06F17/30864 , Y10S707/99931 , Y10S707/99943

摘要： Provided is a technique for building an index. A new indexi+1 is built and an anchor text tablei+1 and a duplicates tablei+1 are output using a storei, a delta store, and previously generated global analysis computationsi, wherein the previously generated global analysis computationsi include an anchor text tablei, a rank tablei, and a duplicates tablei. New global analysis computationsi+1 are generated using the anchor text tablei+1, the duplicates tablei+1, and the previously generated global analysis computationsi.

摘要翻译： 提供了一种构建索引的技术。构建新的索引+1，并使用存储，增量存储和先前生成的全局分析计算i输出锚文本表1和复制表1，其中先前生成的全局分析计算包括锚文本表，一个等级表，和一个重复的表。新的全局分析计算i + 1使用锚文本tablei + 1，复制表1 + 1和以前生成的全局分析计算i生成。

9.

发明授权
Architecture for an indexer 失效
标题翻译：索引器的架构

公开(公告)号：US07743060B2

公开(公告)日：2010-06-22

申请号：US11834556

申请日：2007-08-06

申请人： Marcus Felipe Fontoura , Andreas Neumann , Sridhar Rajagopalan , Eugene J. Shekita , Jason Yeong Zien

发明人： Marcus Felipe Fontoura , Andreas Neumann , Sridhar Rajagopalan , Eugene J. Shekita , Jason Yeong Zien

IPC分类号： G06F7/00 , G06F17/30

CPC分类号： G06F17/30864 , G06F17/30616 , Y10S707/99932 , Y10S707/99937 , Y10S707/99942 , Y10S707/99943

摘要： Disclosed is a technique for indexing data. For each token in a set of documents, a sort key is generated that includes a document identifier that indicates whether a section of a document associated with the sort key is an anchor text section or a context section, wherein the anchor text section and the context text section have a same document identifier; it is determined whether a data field associated with the token is a fixed width; when the data field is a fixed width, the token is designated as one for which fixed width sort is to be performed; and, when the data field is a variable length, the token is designated as one for which a variable width sort is to be performed. The fixed width sort and the variable width sort are performed. For each document, the sort keys are used to bring together the anchor text section and the context section of that document.

摘要翻译： 公开了一种索引数据的技术。对于一组文档中的每个标记，生成包括指示与排序键相关联的文档的一部分是锚定文本部分还是上下文部分的文档标识符的排序关键字，其中锚文本部分和上下文文本部分具有相同的文档标识符; 确定与令牌相关联的数据字段是否是固定宽度; 当数据字段是固定宽度时，令牌被指定为要进行固定宽度排序的令牌; 并且当数据字段是可变长度时，令牌被指定为要对其执行可变宽度排序的令牌。执行固定宽度排序和可变宽度排序。对于每个文档，排序键用于将锚文本部分和文档的上下文部分组合在一起。

10.

发明授权
System and method to facilitate importation of data taxonomies within a network 有权
标题翻译：促进网络内数据分类的输入的系统和方法

公开(公告)号：US07991806B2

公开(公告)日：2011-08-02

申请号：US11781183

申请日：2007-07-20

申请人： Andrei Zary Broder , Marcus Felipe Fontoura , Vanja Josifovski

发明人： Andrei Zary Broder , Marcus Felipe Fontoura , Vanja Josifovski

IPC分类号： G06F17/30

CPC分类号： G06F17/30734 , G06Q30/0241

摘要： A system and method to facilitate importation of data taxonomies within a network are described. Advertiser entities access a data storage module within a network-based entity to retrieve content information from one or more content taxonomies stored within the data storage module. Subsequently, the advertiser entities select advertisements targeted to specific users based on the retrieved content information and further transmit the advertisements to the network-based entity. Furthermore, publisher entities and/or advertiser entities transmit data, such as, for example, associated taxonomy information, to the network-based entity. The entity receives the respective taxonomy information and parses the taxonomy information to extract node information and associated categories related to the received information. Finally, the entity integrates the node information and associated categories into one or more taxonomies stored within the data storage module. Alternatively, the entity maps the node information and associated categories into corresponding nodes within one or more taxonomies stored within the data storage module, and further stores the mapping information into a mapping database within the data storage module.

摘要翻译： 描述了一种促进网络中数据分类的输入的系统和方法。广告商实体访问基于网络的实体内的数据存储模块，以从存储在数据存储模块内的一个或多个内容分类法检索内容信息。随后，广告商实体根据检索到的内容信息选择针对特定用户的广告，并且进一步将广告发送到基于网络的实体。此外，发布者实体和/或广告商实体将数据（例如相关联的分类信息）发送到基于网络的实体。实体接收相应的分类信息并解析分类信息以提取与所接收信息相关的节点信息和相关类别。最后，实体将节点信息和相关类别整合到存储在数据存储模块内的一个或多个分类。或者，实体将节点信息和相关联的类别映射到存储在数据存储模块内的一个或多个分类法内的对应节点，并且还将映射信息存储到数据存储模块内的映射数据库中。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类