专利检索 ap:("Huican Zhu" OR "Jeffrey Dean" OR "Sanjay Ghemawat" OR "Bwolen Po-Jen Yang" OR "Anurag Acharya") AND inv:"Anurag Acharya" 第 4 页

31.

发明授权
Library citation integration 有权
标题翻译：图书馆引文整合

公开(公告)号：US07526475B1

公开(公告)日：2009-04-28

申请号：US11432039

申请日：2006-05-10

申请人： Alexandre A. Verstak , Anurag Acharya

发明人： Alexandre A. Verstak , Anurag Acharya

IPC分类号： G06F7/00 , G06F17/00

CPC分类号： G06F17/30011 , Y10S707/99931 , Y10S707/99933

摘要： An online search system generates an index of documents using index information received from a library. Some documents have restricted access; some documents may not be available online. The search system provides links to documents in the library as well as other sites based on a search, and may include link resolvers received from the library. The search system provides access links to the link resolvers if an identifier, such as a user identification or IP address, matches an affiliation list from the library.

摘要翻译： 在线搜索系统使用从库接收的索引信息生成文档索引。有些文件限制访问; 一些文件可能无法在线上。搜索系统提供到图书馆中的文档以及基于搜索的其他站点的链接，并且可以包括从图书馆接收的链接解析器。如果诸如用户标识或IP地址的标识符与来自库的隶属关系列表匹配，则搜索系统提供到链接解析器的访问链接。

32.

发明授权
Query modification 有权
标题翻译：查询修改

公开(公告)号：US08819000B1

公开(公告)日：2014-08-26

申请号：US13461315

申请日：2012-05-01

申请人： Anurag Acharya , Alexandre A. Verstak

发明人： Anurag Acharya , Alexandre A. Verstak

IPC分类号： G06F17/30

CPC分类号： G06F17/30 , G06F17/30672 , G06F17/30864

摘要： Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for query modification. In one aspect, a method includes receiving an original query including a first limitation. First search results responsive to a modified query are obtained, where the first limitation has been omitted from the modified query. One or more common characteristics shared by two or more resources are identified. Each of the two or more resources corresponds to a different highly-ranked result of the first search results. A second modified query including the original query and a second limitation representing the one or more common characteristics is generated. Second search results responsive to the second modified query are obtained. The second search results are provided in a response to the original query.

摘要翻译： 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于查询修改。一方面，一种方法包括接收包括第一限制的原始查询。获得响应于修改查询的第一搜索结果，其中已经从修改的查询中省略了第一个限制。识别由两个或多个资源共享的一个或多个共同特征。两个或更多个资源中的每一个对应于第一搜索结果的不同高度排名的结果。生成包括原始查询和表示一个或多个共同特征的第二限制的第二修改查询。获得响应于第二修改查询的第二搜索结果。响应于原始查询提供第二个搜索结果。

33.

发明授权
Identifying a primary version of a document 有权
标题翻译：识别文档的主要版本

公开(公告)号：US08522129B1

公开(公告)日：2013-08-27

申请号：US13346436

申请日：2012-01-09

申请人： Alexandre A. Verstak , Anurag Acharya

发明人： Alexandre A. Verstak , Anurag Acharya

IPC分类号： G06F17/22 , G06F7/00 , G06F17/30

CPC分类号： G06F17/2288 , G06F17/2211 , G06F17/30067 , G06F17/3023 , G06F17/30309 , G06F17/30548

摘要： A system and method identifies a primary version out of different versions of the same document. The system selects a priority of authority for each document version based on a priority rule and information associated with the document version, and selects a primary version based on the priority of authority and information associated with the document version.

摘要翻译： 系统和方法从同一文档的不同版本中标识主要版本。系统根据与文档版本相关联的优先级规则和信息为每个文档版本选择权限的优先级，并且基于与文档版本相关联的权限和信息的优先级来选择主版本。

34.

发明授权
Search engine cache control 有权
标题翻译：搜索引擎缓存控制

公开(公告)号：US07840557B1

公开(公告)日：2010-11-23

申请号：US10845283

申请日：2004-05-12

申请人： Benjamin T. Smith , Anurag Acharya

发明人： Benjamin T. Smith , Anurag Acharya

IPC分类号： G06F7/00 , G06F17/30

CPC分类号： G06F12/0875

摘要： A search query containing at least one term is received at a search controller from a query server and preferably normalized and hashed into a representation of the search query. The representation of the search query is transmitted towards a cache containing multiple query result entries. Each query result entry contains a list of documents associated with the previously searched search query. The cache is then searched and query result entries for the search query are sent to the search controller from the cache. Subsequently, it is determined whether the query result entries are current versions for the search query. If the query result entries are not the current versions, then current versions of the query result entries are obtained.

摘要翻译： 包含至少一个术语的搜索查询在搜索控制器处从查询服务器接收，并且优选地被标准化并被散列成搜索查询的表示。搜索查询的表示被发送到包含多个查询结果条目的高速缓存。每个查询结果条目包含与先前搜索的搜索查询相关联的文档列表。然后搜索缓存，并将搜索查询的查询结果条目从缓存发送到搜索控制器。随后，确定查询结果条目是否是用于搜索查询的当前版本。如果查询结果条目不是当前版本，则获取当前版本的查询结果条目。

35.

发明授权
Systems and methods for syndicating and hosting customized news content 有权

公开(公告)号：US10162802B1

公开(公告)日：2018-12-25

申请号：US13615846

申请日：2012-09-14

申请人： Krishna Bharat , Michael Schmitt , Mike Curtiss , Marissa Mayer , Anurag Acharya , Srdjan Mitrovic , Vijay Boyapati

发明人： Krishna Bharat , Michael Schmitt , Mike Curtiss , Marissa Mayer , Anurag Acharya , Srdjan Mitrovic , Vijay Boyapati

IPC分类号： G06F17/21

摘要： A system provides client access to customized news content. The system includes a custom news source server and a news search server. The custom news source server periodically sends one or more customized search queries to a news search server. The news search server fetches news content from multiple news source servers and aggregates the news content. The news search server also periodically receives the one or more search queries from the custom news source server, searches the aggregated news content based on the one or more search queries, and periodically provides selected news content to the custom news server based on results of the searches. The custom news source server permits access to clients, from across a network, to the selected news content provided by the news search server.

36.

发明授权
Generating equivalence classes and rules for associating content with document identifiers 有权
标题翻译：生成用于将内容与文档标识符相关联的等价类和规则

公开(公告)号：US09026566B2

公开(公告)日：2015-05-05

申请号：US12725381

申请日：2010-03-16

申请人： Anurag Acharya , Arvind Jain , Arup Mukherjee

发明人： Anurag Acharya , Arvind Jain , Arup Mukherjee

IPC分类号： G06F17/30

CPC分类号： G06F17/30864

摘要： A system of reducing the possibility of crawling duplicate document identifiers partitions a plurality of document identifiers into multiple clusters, each cluster having a cluster name and a set of document parameters. The system generates an equivalence rule for each cluster of document identifiers, the rule specifying which document parameters associated with the cluster are content-relevant. Next, the system groups each cluster of document identifiers into one or more equivalence classes in accordance with its associated equivalence rule, each equivalence class including one or more document identifiers that correspond to a document content and having a representative document identifier identifying the document content.

摘要翻译： 减少爬行重复文档标识符的可能性的系统将多个文档标识符划分成多个集群，每个集群具有集群名称和一组文档参数。系统为文档标识符的每个集群生成等价规则，该规则指定与集群相关联的文档参数与内容相关。接下来，系统根据其相关联的等价规则将每个文档标识符簇分组为一个或多个等价类，每个等价类包括与文档内容对应的一个或多个文档标识符，并且具有标识文档内容的代表性文档标识符。

37.

发明授权
Systems and methods for personalizing aggregated news content 有权
标题翻译：个性化聚合新闻内容的系统和方法

公开(公告)号：US08676837B2

公开(公告)日：2014-03-18

申请号：US10748663

申请日：2003-12-31

申请人： Krishna Bharat , Michael Schmitt , Mike Curtiss , Marissa Mayer , Kerah Pelczarski , Brian Rakowski , Anurag Acharya

发明人： Krishna Bharat , Michael Schmitt , Mike Curtiss , Marissa Mayer , Kerah Pelczarski , Brian Rakowski , Anurag Acharya

IPC分类号： G06F17/30

CPC分类号： G06F17/30867

摘要： A system customizes a news document associated with a user of a news aggregation service. The system includes multiple news source servers that store news content and a remote news aggregation server. The news aggregation server creates a customized news document based on one or more personalized search queries received from a user. The news aggregation server fetches the news content from the multiple news source servers, aggregates the news content, and searches the aggregated news content based on the one or more personalized search queries. The news aggregation server provides selected news content to the customized news document based on results of the search.

摘要翻译： 系统自定义与新闻聚合服务的用户相关联的新闻文档。该系统包括存储新闻内容的多个新闻源服务器和远程新闻聚合服务器。新闻聚合服务器基于从用户接收的一个或多个个性化搜索查询创建定制的新闻文档。新闻聚合服务器从多个新闻源服务器获取新闻内容，聚合新闻内容，并根据一个或多个个性化搜索查询搜索聚合新闻内容。新闻聚合服务器根据搜索结果向定制的新闻文档提供选定的新闻内容。

38.

发明授权
Document search in affiliated libraries 有权
标题翻译：在附属图书馆进行文件搜索

公开(公告)号：US08473487B1

公开(公告)日：2013-06-25

申请号：US12419872

申请日：2009-04-07

申请人： Alexandre A. Verstak , Anurag Acharya

发明人： Alexandre A. Verstak , Anurag Acharya

IPC分类号： G06F17/30

CPC分类号： G06F17/30011 , Y10S707/99931 , Y10S707/99933

摘要： An online search system generates an index of documents using index information received from a library. Some documents have restricted access; some documents may not be available online. The search system provides links to documents in the library as well as other sites based on a search, and may include link resolvers received from the library. The search system provides access links to the link resolvers if an identifier, such as a user identification or IP address, matches an affiliation list from the library.

摘要翻译： 在线搜索系统使用从库接收的索引信息生成文档索引。有些文件限制访问; 一些文件可能无法在线上。搜索系统提供到图书馆中的文档以及基于搜索的其他站点的链接，并且可以包括从图书馆接收的链接解析器。如果诸如用户标识或IP地址的标识符与来自库的隶属关系列表匹配，则搜索系统提供到链接解析器的访问链接。

39.

发明申请
Search Engine Cache Control 有权
标题翻译：搜索引擎缓存控制

公开(公告)号：US20110035372A1

公开(公告)日：2011-02-10

申请号：US12905922

申请日：2010-10-15

申请人： Benjamin T. Smith , Anurag Acharya

发明人： Benjamin T. Smith , Anurag Acharya

IPC分类号： G06F17/30

CPC分类号： G06F12/0875

摘要： A search query containing one or more terms is received from a client system. In response to receiving the search query, one or more snippets obtained in response to a prior execution of said search query are requested from a cache. For a respective snippet received from the cache, it is determined whether the respective snippet is a current version. In response to a determination that the respective snippet is not the current version, the current version of the respective snippet is obtained from a corresponding document in which one or more terms from said search query are located and the snippet stored in the cache is updated using the obtained current version. Search query results including the respective snippet are transmitted to the client.

摘要翻译： 从客户端系统接收到包含一个或多个术语的搜索查询。响应于接收到搜索查询，响应于先前执行所述搜索查询获得的一个或多个片段从高速缓存请求。对于从高速缓存接收到的相应片段，确定相应的片段是否是当前版本。响应于相应片段不是当前版本的确定，从相应的文档获得当前版本的相应片段，其中来自所述搜索查询的一个或多个术语位于并且存储在高速缓存中的片段使用获得的当前版本。搜索查询结果，包括相应的代码段被传送到客户端。

40.

发明授权
System for automatically managing duplicate documents when crawling dynamic documents 有权
标题翻译：抓取动态文档时自动管理重复文件的系统

公开(公告)号：US07680773B1

公开(公告)日：2010-03-16

申请号：US11097687

申请日：2005-03-31

申请人： Anurag Acharya , Arvind Jain , Arup Mukherjee

发明人： Anurag Acharya , Arvind Jain , Arup Mukherjee

IPC分类号： G06F17/30

CPC分类号： G06F17/30864

摘要： A system of reducing the possibility of crawling duplicate document identifiers partitions a plurality of document identifiers into multiple clusters, each cluster having a cluster name and a set of document parameters. The system generates an equivalence rule for each cluster of document identifiers, the rule specifying which document parameters associated with the cluster are content-relevant. Next, the system groups each cluster of document identifiers into one or more equivalence classes in accordance with its associated equivalence rule, each equivalence class including one or more document identifiers that correspond to a document content and having a representative document identifier identifying the document content.

摘要翻译： 减少爬行重复文档标识符的可能性的系统将多个文档标识符划分成多个集群，每个集群具有集群名称和一组文档参数。系统为文档标识符的每个集群生成等价规则，该规则指定与集群相关联的文档参数与内容相关。接下来，系统根据其相关联的等价规则将每个文档标识符簇分组为一个或多个等价类，每个等价类包括与文档内容对应的一个或多个文档标识符，并且具有标识文档内容的代表性文档标识符。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类