-
公开(公告)号:US08244701B2
公开(公告)日:2012-08-14
申请号:US13169807
申请日:2011-06-27
申请人: Walter Sun , Jay Kumar Goyal , Pratibha Permandla , Yinzhe Yu , Jingfeng Li
发明人: Walter Sun , Jay Kumar Goyal , Pratibha Permandla , Yinzhe Yu , Jingfeng Li
CPC分类号: G06F17/30702 , G06F17/30867
摘要: Systems and methods for applying user behavior data to improve search query result ranking are provided. Upon receiving an update file indicating that recent, significant user behavior data is available for a document associated with an inverted index, the update file is published periodically and frequently to an index server. After filtering out the relevant update information from the update file, the index server extracts identifiers of the documents having the associated user behavior data. The update file and the identifier of the documents are utilized to update an in-memory index containing representations of metadata indicative of the user behavior. The in-memory index is continuously updated and utilized to serve search query results in response to user search queries. Search query results from the in-memory index are ranked using the user behavior data prior to serving. Thus, results associated with recent, significant user-behavior metadata receive prominent placement on the search results page.
摘要翻译: 提供了用于应用用户行为数据以改善搜索查询结果排序的系统和方法。 在接收到指示最近的重要用户行为数据可用于与反向索引相关联的文档的更新文件时,更新文件被周期性地且频繁地发布到索引服务器。 在从更新文件中滤除相关更新信息之后,索引服务器提取具有相关用户行为数据的文档的标识符。 更新文件和文档的标识符用于更新包含表示用户行为的元数据的内存中索引。 存储器内索引不断更新并用于响应于用户搜索查询来提供搜索查询结果。 来自内存中索引的搜索查询结果使用用户行为数据进行排序。 因此,与最近的重要用户行为元数据相关联的结果在搜索结果页面上接收突出的位置。
-
公开(公告)号:US20120059806A1
公开(公告)日:2012-03-08
申请号:US13292793
申请日:2011-11-09
IPC分类号: G06F17/30
CPC分类号: G06F17/30336 , G06F17/30613
摘要: Systems and methods for regularly updating portions of a merged index are provided. Initially, upon receiving an indication that modifications have occurred to content of web-based documents, dynamic update of index (DUI) objects that identify the documents and expose the modified content are composed by ascertaining relative positions of the modified content within the documents, and packaging identifiers of the documents, the relative positions, and metadata underlying the modified content into a message. The DUI objects are applied to an overloading index that maintains structured records of recent modifications. In particular, portions of the overloading index are targeted utilizing the document identifiers and the relative positions specified by the DUI object, thereby updating the targeted portions within the overloading index corresponding to the modified content without rewriting the entire overloading index. Periodically, an association process is invoked for grouping the merged index with the overloading index for search purposes.
摘要翻译: 提供了用于定期更新合并索引的部分的系统和方法。 最初,在接收到对基于web的文档的内容进行修改的指示时,通过确定文档中修改的内容的相对位置来构成标识文档和公开修改的内容的索引(DUI)对象的动态更新,以及 将文档的标识符,相对位置和修改内容的元数据包装到消息中。 DUI对象应用于维护最近修改的结构化记录的重载索引。 特别地,使用文档标识符和DUI对象指定的相对位置来定位重载索引的部分,从而更新与修改的内容相对应的超载索引内的目标部分,而无需重写整个重载索引。 定期地,调用关联过程以将合并的索引与用于搜索目的的重载索引进行分组。
-
公开(公告)号:US20110258198A1
公开(公告)日:2011-10-20
申请号:US13169807
申请日:2011-06-27
申请人: WALTER SUN , JAY KUMAR GOYAL , PRATIBHA PERMANDLA , YINZHE YU , JINGFENG LI
发明人: WALTER SUN , JAY KUMAR GOYAL , PRATIBHA PERMANDLA , YINZHE YU , JINGFENG LI
IPC分类号: G06F17/30
CPC分类号: G06F17/30702 , G06F17/30867
摘要: Systems and methods for applying user behavior data to improve serach query result ranking are provided. Upon receiving an update file indicating that recent, significant user behavior data is available for a document associated with an inverted index, the update file is published periodically and frequently to an index server. After filtering out the relevant update information from the update file, the index server extracts identifiers of the documents having the associated user behavior data. The update file and the identifier of the documents are utilized to update an in-memory index containing representations of metadata indicative of the user behavior. The in-memory index is continuously updated and utilized to serve search query results in response to user search queries. Search query results from the in-memory index are ranked using the user behavior data prior to serving. Thus, results associated with recent, significant user-behavior metadata receive prominent placement on the search results page.
摘要翻译: 提供了用于应用用户行为数据以改进serach查询结果排名的系统和方法。 在接收到指示最近的重要用户行为数据可用于与反向索引相关联的文档的更新文件时,更新文件被周期性地且频繁地发布到索引服务器。 在从更新文件中滤除相关更新信息之后,索引服务器提取具有相关用户行为数据的文档的标识符。 更新文件和文档的标识符用于更新包含表示用户行为的元数据的内存中索引。 内存中索引被不断更新并用于响应于用户搜索查询来提供搜索查询结果。 来自内存中索引的搜索查询结果使用用户行为数据进行排序。 因此,与最近的重要用户行为元数据相关联的结果在搜索结果页面上接收突出的位置。
-
公开(公告)号:US20100205160A1
公开(公告)日:2010-08-12
申请号:US12368771
申请日:2009-02-10
IPC分类号: G06F17/30
CPC分类号: G06F17/30336 , G06F17/30613
摘要: Systems and methods for regularly updating portions of a merged index are provided. Initially, upon receiving an indication that modifications have occurred to content of web-based documents, dynamic update of index (DUI) objects that identify the documents and expose the modified content are composed by ascertaining relative positions of the modified content within the documents, and packaging identifiers of the documents, the relative positions, and metadata underlying the modified content into a message. The DUI objects are applied to an overloading index that maintains structured records of recent modifications. In particular, portions of the overloading index are targeted utilizing the document identifiers and the relative positions specified by the DUI object, thereby updating the targeted portions within the overloading index corresponding to the modified content without rewriting the entire overloading index. Periodically, an association process is invoked for grouping the merged index with the overloading index for search purposes.
摘要翻译: 提供了用于定期更新合并索引的部分的系统和方法。 最初,在接收到对基于web的文档的内容进行修改的指示时,通过确定文档中修改的内容的相对位置来构成标识文档和公开修改的内容的索引(DUI)对象的动态更新,以及 将文档的标识符,相对位置和修改内容的元数据包装到消息中。 DUI对象应用于维护最近修改的结构化记录的重载索引。 特别地,使用文档标识符和DUI对象指定的相对位置来定位重载索引的部分,从而更新与修改的内容相对应的超载索引内的目标部分,而无需重写整个重载索引。 定期地,调用关联过程以将合并的索引与用于搜索目的的重载索引进行分组。
-
公开(公告)号:US20090299962A1
公开(公告)日:2009-12-03
申请号:US12127949
申请日:2008-05-28
申请人: PRATIBHA PERMANDLA , GAURAV SAREEN
发明人: PRATIBHA PERMANDLA , GAURAV SAREEN
IPC分类号: G06F17/30
CPC分类号: G06F17/30864
摘要: Systems and methods are provided for regularly updating a web index with new or updated content, such as meta words or meta streams, for a particular web page address, such as a URL. Web page addresses and associated updated information, such as meta words, meta streams, values, and locations in the web index for those meta words are received. In order to update a web index, which is used by search engines to search web documents, a document identification is retrieved and associated with the updated information. As information in the web index is stored by document identification and not by web page addresses, the document identification may replace the web page address. Each meta word received is matched with corresponding document identifications and associated updated information, which creates an inverted format of the information. The web index may now be updated and stored by the system.
摘要翻译: 提供了系统和方法,用于针对诸如URL的特定网页地址,定期更新具有新的或更新的内容(例如元字元或元流)的网络索引。 接收网页地址和相关联的更新信息,例如元词,元流,值和这些元词的web索引中的位置。 为了更新由搜索引擎用于搜索web文档的web索引,检索文档标识并与更新的信息相关联。 由于web索引中的信息是通过文档标识而不是通过网页地址来存储的,所以文档标识可以替换网页地址。 接收的每个元字符与相应的文档标识和相关联的更新信息匹配,这创建了信息的反向格式。 网络索引现在可以被系统更新和存储。
-
公开(公告)号:US08756206B2
公开(公告)日:2014-06-17
申请号:US13292793
申请日:2011-11-09
IPC分类号: G06F17/30
CPC分类号: G06F17/30336 , G06F17/30613
摘要: Systems and methods for regularly updating portions of a merged index are provided. Initially, upon receiving an indication that modifications have occurred to content of web-based documents, dynamic update of index (DUI) objects that identify the documents and expose the modified content are composed by ascertaining relative positions of the modified content within the documents, and packaging identifiers of the documents, the relative positions, and metadata underlying the modified content into a message. The DUI objects are applied to an overloading index that maintains structured records of recent modifications. In particular, portions of the overloading index are targeted utilizing the document identifiers and the relative positions specified by the DUI object, thereby updating the targeted portions within the overloading index corresponding to the modified content without rewriting the entire overloading index. Periodically, an association process is invoked for grouping the merged index with the overloading index for search purposes.
摘要翻译: 提供了用于定期更新合并索引的部分的系统和方法。 最初,在接收到对基于web的文档的内容进行修改的指示时,通过确定文档中修改的内容的相对位置来构成标识文档和公开修改的内容的索引(DUI)对象的动态更新,以及 将文档的标识符,相对位置和修改内容的元数据包装到消息中。 DUI对象应用于维护最近修改的结构化记录的重载索引。 特别地,使用文档标识符和DUI对象指定的相对位置来定位重载索引的部分,从而更新与修改的内容相对应的超载索引内的目标部分,而无需重写整个重载索引。 定期地,调用关联过程以将合并的索引与用于搜索目的的重载索引进行分组。
-
公开(公告)号:US08244700B2
公开(公告)日:2012-08-14
申请号:US12705207
申请日:2010-02-12
申请人: Pratibha Permandla , Yinzhe Yu , Guarav Sareen , Abhas Kumar
发明人: Pratibha Permandla , Yinzhe Yu , Guarav Sareen , Abhas Kumar
CPC分类号: G06F17/30613
摘要: Systems and methods for performing an updating process to an in-memory index are provided. Upon receiving notice of document modifications covered by an inverted index associated with a search engine, in the form of an update file, a representation of the modification is published onto various index serving machines. Each index serving machine receiving the update file determines if the modifications are applicable to the index serving machine. If an index serving machine determines that it contains mapping information corresponding to the modified documents, the index serving machine utilizes the update file and associated mapping information to update an in-memory index. In embodiments, the in-memory index is used to provide results to user queries in tandem with the inverted index. In some embodiments, an extra in-memory index is maintained that is revised with constantly incoming metadata updates and the existing in-memory index is periodically swapped with the revised in-memory index.
摘要翻译: 提供了用于对存储器内索引进行更新处理的系统和方法。 在接收到与搜索引擎相关联的反向索引覆盖的文档修改的通知时,以更新文件的形式,将修改的表示发布到各种索引服务机器上。 接收更新文件的每个索引服务机器确定修改是否适用于索引服务机器。 如果索引服务机器确定其包含与修改的文档相对应的映射信息,则索引服务机器利用更新文件和相关联的映射信息来更新内存中索引。 在实施例中,使用存储器内索引来提供与反向索引一起的用户查询的结果。 在一些实施例中,维护额外的内存中索引,该额外内存索引通过不断传入的元数据更新进行修改,并且使用修改的内存内索引周期性地交换现有的内存中索引。
-
公开(公告)号:US20080294602A1
公开(公告)日:2008-11-27
申请号:US11753699
申请日:2007-05-25
申请人: PRATIBHA PERMANDLA , GAURAV SAREEN , GIRISH KUMAR , JUNHUA WANG , ROHIT V. WAD , WILLIAM D. RAMSEY
发明人: PRATIBHA PERMANDLA , GAURAV SAREEN , GIRISH KUMAR , JUNHUA WANG , ROHIT V. WAD , WILLIAM D. RAMSEY
IPC分类号: G06F17/30
CPC分类号: G06F17/30705 , G06F17/30864
摘要: Systems, methods, computer-readable media, and graphical user interfaces for presenting search results having collapsed domains are provided. A search result obtaining module obtains search results based upon a received query. Upon obtaining the search results, search results having the same domain are associated with one another. Thereafter, search result clusters of associated search results are formed. In some embodiments, the search result clusters may be formatted to include desired search result cluster attributes. The search result clusters are presented such that two or more associated search results form a single cluster of search results rather than being presented individually. In some embodiments, an option to view more search results with the same domain may be provided.
摘要翻译: 提供了用于呈现具有折叠域的搜索结果的系统,方法,计算机可读介质和图形用户界面。 搜索结果获取模块基于接收到的查询获得搜索结果。 在获得搜索结果时,具有相同域的搜索结果彼此相关联。 此后,形成关联搜索结果的搜索结果集群。 在一些实施例中,搜索结果集群可以被格式化以包括期望的搜索结果集群属性。 呈现搜索结果集群,使得两个或多个相关联的搜索结果形成单个搜索结果集,而不是单独呈现。 在一些实施例中,可以提供查看具有相同域的更多搜索结果的选项。
-
公开(公告)号:US08224841B2
公开(公告)日:2012-07-17
申请号:US12127949
申请日:2008-05-28
申请人: Pratibha Permandla , Gaurav Sareen
发明人: Pratibha Permandla , Gaurav Sareen
IPC分类号: G06F17/30
CPC分类号: G06F17/30864
摘要: Systems and methods are provided for regularly updating a web index with new or updated content, such as meta words or meta streams, for a particular web page address, such as a URL. Web page addresses and associated updated information, such as meta words, meta streams, values, and locations in the web index for those meta words are received. In order to update a web index, which is used by search engines to search web documents, a document identification is retrieved and associated with the updated information. As information in the web index is stored by document identification and not by web page addresses, the document identification may replace the web page address. Each meta word received is matched with corresponding document identifications and associated updated information, which creates an inverted format of the information. The web index may now be updated and stored by the system.
摘要翻译: 提供了系统和方法,用于针对诸如URL的特定网页地址,定期更新具有新的或更新的内容(例如元字元或元流)的网络索引。 接收网页地址和相关联的更新信息,例如元词,元流,值和这些元词的web索引中的位置。 为了更新由搜索引擎用于搜索web文档的web索引,检索文档标识并与更新的信息相关联。 由于web索引中的信息是通过文档标识而不是通过网页地址来存储的,所以文档标识可以替换网页地址。 接收的每个元字符与相应的文档标识和相关联的更新信息匹配,这创建了信息的反向格式。 网络索引现在可以被系统更新和存储。
-
公开(公告)号:US08082258B2
公开(公告)日:2011-12-20
申请号:US12368771
申请日:2009-02-10
CPC分类号: G06F17/30336 , G06F17/30613
摘要: Systems and methods for regularly updating portions of a merged index are provided. Initially, upon receiving an indication that modifications have occurred to content of web-based documents, dynamic update of index (DUI) objects that identify the documents and expose the modified content are composed by ascertaining relative positions of the modified content within the documents, and packaging identifiers of the documents, the relative positions, and metadata underlying the modified content into a message. The DUI objects are applied to an overloading index that maintains structured records of recent modifications. In particular, portions of the overloading index are targeted utilizing the document identifiers and the relative positions specified by the DUI object, thereby updating the targeted portions within the overloading index corresponding to the modified content without rewriting the entire overloading index. Periodically, an association process is invoked for grouping the merged index with the overloading index for search purposes.
摘要翻译: 提供了用于定期更新合并索引的部分的系统和方法。 最初,在接收到对基于web的文档的内容进行修改的指示时,通过确定文档中修改的内容的相对位置来构成标识文档和公开修改的内容的索引(DUI)对象的动态更新,以及 将文档的标识符,相对位置和修改内容的元数据包装到消息中。 DUI对象应用于维护最近修改的结构化记录的重载索引。 特别地,使用文档标识符和DUI对象指定的相对位置来定位重载索引的部分,从而更新与修改的内容相对应的超载索引内的目标部分,而无需重写整个重载索引。 定期地,调用关联过程以将合并的索引与用于搜索目的的重载索引进行分组。
-
-
-
-
-
-
-
-
-