-
公开(公告)号:US20090299962A1
公开(公告)日:2009-12-03
申请号:US12127949
申请日:2008-05-28
申请人: PRATIBHA PERMANDLA , GAURAV SAREEN
发明人: PRATIBHA PERMANDLA , GAURAV SAREEN
IPC分类号: G06F17/30
CPC分类号: G06F17/30864
摘要: Systems and methods are provided for regularly updating a web index with new or updated content, such as meta words or meta streams, for a particular web page address, such as a URL. Web page addresses and associated updated information, such as meta words, meta streams, values, and locations in the web index for those meta words are received. In order to update a web index, which is used by search engines to search web documents, a document identification is retrieved and associated with the updated information. As information in the web index is stored by document identification and not by web page addresses, the document identification may replace the web page address. Each meta word received is matched with corresponding document identifications and associated updated information, which creates an inverted format of the information. The web index may now be updated and stored by the system.
摘要翻译: 提供了系统和方法,用于针对诸如URL的特定网页地址,定期更新具有新的或更新的内容(例如元字元或元流)的网络索引。 接收网页地址和相关联的更新信息,例如元词,元流,值和这些元词的web索引中的位置。 为了更新由搜索引擎用于搜索web文档的web索引,检索文档标识并与更新的信息相关联。 由于web索引中的信息是通过文档标识而不是通过网页地址来存储的,所以文档标识可以替换网页地址。 接收的每个元字符与相应的文档标识和相关联的更新信息匹配,这创建了信息的反向格式。 网络索引现在可以被系统更新和存储。
-
公开(公告)号:US20080294602A1
公开(公告)日:2008-11-27
申请号:US11753699
申请日:2007-05-25
申请人: PRATIBHA PERMANDLA , GAURAV SAREEN , GIRISH KUMAR , JUNHUA WANG , ROHIT V. WAD , WILLIAM D. RAMSEY
发明人: PRATIBHA PERMANDLA , GAURAV SAREEN , GIRISH KUMAR , JUNHUA WANG , ROHIT V. WAD , WILLIAM D. RAMSEY
IPC分类号: G06F17/30
CPC分类号: G06F17/30705 , G06F17/30864
摘要: Systems, methods, computer-readable media, and graphical user interfaces for presenting search results having collapsed domains are provided. A search result obtaining module obtains search results based upon a received query. Upon obtaining the search results, search results having the same domain are associated with one another. Thereafter, search result clusters of associated search results are formed. In some embodiments, the search result clusters may be formatted to include desired search result cluster attributes. The search result clusters are presented such that two or more associated search results form a single cluster of search results rather than being presented individually. In some embodiments, an option to view more search results with the same domain may be provided.
摘要翻译: 提供了用于呈现具有折叠域的搜索结果的系统,方法,计算机可读介质和图形用户界面。 搜索结果获取模块基于接收到的查询获得搜索结果。 在获得搜索结果时,具有相同域的搜索结果彼此相关联。 此后,形成关联搜索结果的搜索结果集群。 在一些实施例中,搜索结果集群可以被格式化以包括期望的搜索结果集群属性。 呈现搜索结果集群,使得两个或多个相关联的搜索结果形成单个搜索结果集,而不是单独呈现。 在一些实施例中,可以提供查看具有相同域的更多搜索结果的选项。
-
公开(公告)号:US20110202541A1
公开(公告)日:2011-08-18
申请号:US12705207
申请日:2010-02-12
申请人: PRATIBHA PERMANDLA , YINZHE YU , GAURAV SAREEN , ABHAS KUMAR
发明人: PRATIBHA PERMANDLA , YINZHE YU , GAURAV SAREEN , ABHAS KUMAR
CPC分类号: G06F17/30613
摘要: Systems and methods for performing an updating process to an in-memory index are provided. Upon receiving notice of document modifications covered by an inverted index associated with a search engine, in the form of an update file, a representation of the modification is published onto various index serving machines. Each index serving machine receiving the update file determines if the modifications are applicable to the index serving machine. If an index serving machine determines that it contains mapping information corresponding to the modified documents, the index serving machine utilizes the update file and associated mapping information to update an in-memory index. In embodiments, the in-memory index is used to provide results to user queries in tandem with the inverted index. In some embodiments, an extra in-memory index is maintained that is revised with constantly incoming metadata updates and the existing in-memory index is periodically swapped with the revised in-memory index.
摘要翻译: 提供了用于对存储器内索引进行更新处理的系统和方法。 在接收到与搜索引擎相关联的反向索引覆盖的文档修改的通知时,以更新文件的形式,将修改的表示发布到各种索引服务机器上。 接收更新文件的每个索引服务机器确定修改是否适用于索引服务机器。 如果索引服务机器确定其包含与修改的文档相对应的映射信息,则索引服务机器利用更新文件和相关联的映射信息来更新内存中索引。 在实施例中,使用存储器内索引来提供与反向索引一起的用户查询的结果。 在一些实施例中,维护额外的内存中索引,该额外内存索引通过不断传入的元数据更新进行修改,并且使用修改的内存内索引周期性地交换现有的内存中索引。
-
公开(公告)号:US20110258198A1
公开(公告)日:2011-10-20
申请号:US13169807
申请日:2011-06-27
申请人: WALTER SUN , JAY KUMAR GOYAL , PRATIBHA PERMANDLA , YINZHE YU , JINGFENG LI
发明人: WALTER SUN , JAY KUMAR GOYAL , PRATIBHA PERMANDLA , YINZHE YU , JINGFENG LI
IPC分类号: G06F17/30
CPC分类号: G06F17/30702 , G06F17/30867
摘要: Systems and methods for applying user behavior data to improve serach query result ranking are provided. Upon receiving an update file indicating that recent, significant user behavior data is available for a document associated with an inverted index, the update file is published periodically and frequently to an index server. After filtering out the relevant update information from the update file, the index server extracts identifiers of the documents having the associated user behavior data. The update file and the identifier of the documents are utilized to update an in-memory index containing representations of metadata indicative of the user behavior. The in-memory index is continuously updated and utilized to serve search query results in response to user search queries. Search query results from the in-memory index are ranked using the user behavior data prior to serving. Thus, results associated with recent, significant user-behavior metadata receive prominent placement on the search results page.
摘要翻译: 提供了用于应用用户行为数据以改进serach查询结果排名的系统和方法。 在接收到指示最近的重要用户行为数据可用于与反向索引相关联的文档的更新文件时,更新文件被周期性地且频繁地发布到索引服务器。 在从更新文件中滤除相关更新信息之后,索引服务器提取具有相关用户行为数据的文档的标识符。 更新文件和文档的标识符用于更新包含表示用户行为的元数据的内存中索引。 内存中索引被不断更新并用于响应于用户搜索查询来提供搜索查询结果。 来自内存中索引的搜索查询结果使用用户行为数据进行排序。 因此,与最近的重要用户行为元数据相关联的结果在搜索结果页面上接收突出的位置。
-
-
-