Using behavior data to quickly improve search ranking
    1.
    发明授权
    Using behavior data to quickly improve search ranking 有权
    使用行为数据快速提高搜索排名

    公开(公告)号:US08244701B2

    公开(公告)日:2012-08-14

    申请号:US13169807

    申请日:2011-06-27

    IPC分类号: G06F7/00 G06F17/00 G06F17/30

    CPC分类号: G06F17/30702 G06F17/30867

    摘要: Systems and methods for applying user behavior data to improve search query result ranking are provided. Upon receiving an update file indicating that recent, significant user behavior data is available for a document associated with an inverted index, the update file is published periodically and frequently to an index server. After filtering out the relevant update information from the update file, the index server extracts identifiers of the documents having the associated user behavior data. The update file and the identifier of the documents are utilized to update an in-memory index containing representations of metadata indicative of the user behavior. The in-memory index is continuously updated and utilized to serve search query results in response to user search queries. Search query results from the in-memory index are ranked using the user behavior data prior to serving. Thus, results associated with recent, significant user-behavior metadata receive prominent placement on the search results page.

    摘要翻译: 提供了用于应用用户行为数据以改善搜索查询结果排序的系统和方法。 在接收到指示最近的重要用户行为数据可用于与反向索引相关联的文档的更新文件时,更新文件被周期性地且频繁地发布到索引服务器。 在从更新文件中滤除相关更新信息之后,索引服务器提取具有相关用户行为数据的文档的标识符。 更新文件和文档的标识符用于更新包含表示用户行为的元数据的内存中索引。 存储器内索引不断更新并用于响应于用户搜索查询来提供搜索查询结果。 来自内存中索引的搜索查询结果使用用户行为数据进行排序。 因此,与最近的重要用户行为元数据相关联的结果在搜索结果页面上接收突出的位置。

    UPDATING AN INVERTED INDEX IN A REAL TIME FASHION
    2.
    发明申请
    UPDATING AN INVERTED INDEX IN A REAL TIME FASHION 有权
    在实时更新反转索引

    公开(公告)号:US20120059806A1

    公开(公告)日:2012-03-08

    申请号:US13292793

    申请日:2011-11-09

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30336 G06F17/30613

    摘要: Systems and methods for regularly updating portions of a merged index are provided. Initially, upon receiving an indication that modifications have occurred to content of web-based documents, dynamic update of index (DUI) objects that identify the documents and expose the modified content are composed by ascertaining relative positions of the modified content within the documents, and packaging identifiers of the documents, the relative positions, and metadata underlying the modified content into a message. The DUI objects are applied to an overloading index that maintains structured records of recent modifications. In particular, portions of the overloading index are targeted utilizing the document identifiers and the relative positions specified by the DUI object, thereby updating the targeted portions within the overloading index corresponding to the modified content without rewriting the entire overloading index. Periodically, an association process is invoked for grouping the merged index with the overloading index for search purposes.

    摘要翻译: 提供了用于定期更新合并索引的部分的系统和方法。 最初,在接收到对基于web的文档的内容进行修改的指示时,通过确定文档中修改的内容的相对位置来构成标识文档和公开修改的内容的索引(DUI)对象的动态更新,以及 将文档的标识符,相对位置和修改内容的元数据包装到消息中。 DUI对象应用于维护最近修改的结构化记录的重载索引。 特别地,使用文档标识符和DUI对象指定的相对位置来定位重载索引的部分,从而更新与修改的内容相对应的超载索引内的目标部分,而无需重写整个重载索引。 定期地,调用关联过程以将合并的索引与用于搜索目的的重载索引进行分组。

    USING BEHAVIOR DATA TO QUICKLY IMPROVE SEARCH RANKING
    3.
    发明申请
    USING BEHAVIOR DATA TO QUICKLY IMPROVE SEARCH RANKING 有权
    使用行为数据快速改进搜索排名

    公开(公告)号:US20110258198A1

    公开(公告)日:2011-10-20

    申请号:US13169807

    申请日:2011-06-27

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30702 G06F17/30867

    摘要: Systems and methods for applying user behavior data to improve serach query result ranking are provided. Upon receiving an update file indicating that recent, significant user behavior data is available for a document associated with an inverted index, the update file is published periodically and frequently to an index server. After filtering out the relevant update information from the update file, the index server extracts identifiers of the documents having the associated user behavior data. The update file and the identifier of the documents are utilized to update an in-memory index containing representations of metadata indicative of the user behavior. The in-memory index is continuously updated and utilized to serve search query results in response to user search queries. Search query results from the in-memory index are ranked using the user behavior data prior to serving. Thus, results associated with recent, significant user-behavior metadata receive prominent placement on the search results page.

    摘要翻译: 提供了用于应用用户行为数据以改进serach查询结果排名的系统和方法。 在接收到指示最近的重要用户行为数据可用于与反向索引相关联的文档的更新文件时,更新文件被周期性地且频繁地发布到索引服务器。 在从更新文件中滤除相关更新信息之后,索引服务器提取具有相关用户行为数据的文档的标识符。 更新文件和文档的标识符用于更新包含表示用户行为的元数据的内存中索引。 内存中索引被不断更新并用于响应于用户搜索查询来提供搜索查询结果。 来自内存中索引的搜索查询结果使用用户行为数据进行排序。 因此,与最近的重要用户行为元数据相关联的结果在搜索结果页面上接收突出的位置。

    UPDATING AN INVERTED INDEX IN A REAL TIME FASHION
    4.
    发明申请
    UPDATING AN INVERTED INDEX IN A REAL TIME FASHION 有权
    在实时更新反转索引

    公开(公告)号:US20100205160A1

    公开(公告)日:2010-08-12

    申请号:US12368771

    申请日:2009-02-10

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30336 G06F17/30613

    摘要: Systems and methods for regularly updating portions of a merged index are provided. Initially, upon receiving an indication that modifications have occurred to content of web-based documents, dynamic update of index (DUI) objects that identify the documents and expose the modified content are composed by ascertaining relative positions of the modified content within the documents, and packaging identifiers of the documents, the relative positions, and metadata underlying the modified content into a message. The DUI objects are applied to an overloading index that maintains structured records of recent modifications. In particular, portions of the overloading index are targeted utilizing the document identifiers and the relative positions specified by the DUI object, thereby updating the targeted portions within the overloading index corresponding to the modified content without rewriting the entire overloading index. Periodically, an association process is invoked for grouping the merged index with the overloading index for search purposes.

    摘要翻译: 提供了用于定期更新合并索引的部分的系统和方法。 最初,在接收到对基于web的文档的内容进行修改的指示时,通过确定文档中修改的内容的相对位置来构成标识文档和公开修改的内容的索引(DUI)对象的动态更新,以及 将文档的标识符,相对位置和修改内容的元数据包装到消息中。 DUI对象应用于维护最近修改的结构化记录的重载索引。 特别地,使用文档标识符和DUI对象指定的相对位置来定位重载索引的部分,从而更新与修改的内容相对应的超载索引内的目标部分,而无需重写整个重载索引。 定期地,调用关联过程以将合并的索引与用于搜索目的的重载索引进行分组。

    DYNAMIC UPDATE OF A WEB INDEX
    5.
    发明申请
    DYNAMIC UPDATE OF A WEB INDEX 有权
    WEB索引的动态更新

    公开(公告)号:US20090299962A1

    公开(公告)日:2009-12-03

    申请号:US12127949

    申请日:2008-05-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: Systems and methods are provided for regularly updating a web index with new or updated content, such as meta words or meta streams, for a particular web page address, such as a URL. Web page addresses and associated updated information, such as meta words, meta streams, values, and locations in the web index for those meta words are received. In order to update a web index, which is used by search engines to search web documents, a document identification is retrieved and associated with the updated information. As information in the web index is stored by document identification and not by web page addresses, the document identification may replace the web page address. Each meta word received is matched with corresponding document identifications and associated updated information, which creates an inverted format of the information. The web index may now be updated and stored by the system.

    摘要翻译: 提供了系统和方法,用于针对诸如URL的特定网页地址,定期更新具有新的或更新的内容(例如元字元或元流)的网络索引。 接收网页地址和相关联的更新信息,例如元词,元流,值和这些元词的web索引中的位置。 为了更新由搜索引擎用于搜索web文档的web索引,检索文档标识并与更新的信息相关联。 由于web索引中的信息是通过文档标识而不是通过网页地址来存储的,所以文档标识可以替换网页地址。 接收的每个元字符与相应的文档标识和相关联的更新信息匹配,这创建了信息的反向格式。 网络索引现在可以被系统更新和存储。

    Updating an inverted index in a real time fashion
    6.
    发明授权
    Updating an inverted index in a real time fashion 有权
    以实时方式更新反向索引

    公开(公告)号:US08756206B2

    公开(公告)日:2014-06-17

    申请号:US13292793

    申请日:2011-11-09

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30336 G06F17/30613

    摘要: Systems and methods for regularly updating portions of a merged index are provided. Initially, upon receiving an indication that modifications have occurred to content of web-based documents, dynamic update of index (DUI) objects that identify the documents and expose the modified content are composed by ascertaining relative positions of the modified content within the documents, and packaging identifiers of the documents, the relative positions, and metadata underlying the modified content into a message. The DUI objects are applied to an overloading index that maintains structured records of recent modifications. In particular, portions of the overloading index are targeted utilizing the document identifiers and the relative positions specified by the DUI object, thereby updating the targeted portions within the overloading index corresponding to the modified content without rewriting the entire overloading index. Periodically, an association process is invoked for grouping the merged index with the overloading index for search purposes.

    摘要翻译: 提供了用于定期更新合并索引的部分的系统和方法。 最初,在接收到对基于web的文档的内容进行修改的指示时,通过确定文档中修改的内容的相对位置来构成标识文档和公开修改的内容的索引(DUI)对象的动态更新,以及 将文档的标识符,相对位置和修改内容的元数据包装到消息中。 DUI对象应用于维护最近修改的结构化记录的重载索引。 特别地,使用文档标识符和DUI对象指定的相对位置来定位重载索引的部分,从而更新与修改的内容相对应的超载索引内的目标部分,而无需重写整个重载索引。 定期地,调用关联过程以将合并的索引与用于搜索目的的重载索引进行分组。

    Rapid update of index metadata
    7.
    发明授权
    Rapid update of index metadata 有权
    快速更新索引元数据

    公开(公告)号:US08244700B2

    公开(公告)日:2012-08-14

    申请号:US12705207

    申请日:2010-02-12

    IPC分类号: G06F7/00 G06F17/00 G06F17/30

    CPC分类号: G06F17/30613

    摘要: Systems and methods for performing an updating process to an in-memory index are provided. Upon receiving notice of document modifications covered by an inverted index associated with a search engine, in the form of an update file, a representation of the modification is published onto various index serving machines. Each index serving machine receiving the update file determines if the modifications are applicable to the index serving machine. If an index serving machine determines that it contains mapping information corresponding to the modified documents, the index serving machine utilizes the update file and associated mapping information to update an in-memory index. In embodiments, the in-memory index is used to provide results to user queries in tandem with the inverted index. In some embodiments, an extra in-memory index is maintained that is revised with constantly incoming metadata updates and the existing in-memory index is periodically swapped with the revised in-memory index.

    摘要翻译: 提供了用于对存储器内索引进行更新处理的系统和方法。 在接收到与搜索引擎相关联的反向索引覆盖的文档修改的通知时,以更新文件的形式,将修改的表示发布到各种索引服务机器上。 接收更新文件的每个索引服务机器确定修改是否适用于索引服务机器。 如果索引服务机器确定其包含与修改的文档相对应的映射信息,则索引服务机器利用更新文件和相关联的映射信息来更新内存中索引。 在实施例中,使用存储器内索引来提供与反向索引一起的用户查询的结果。 在一些实施例中,维护额外的内存中索引,该额外内存索引通过不断传入的元数据更新进行修改,并且使用修改的内存内索引周期性地交换现有的内存中索引。

    DOMAIN COLLAPSING OF SEARCH RESULTS
    8.
    发明申请
    DOMAIN COLLAPSING OF SEARCH RESULTS 有权
    搜索结果的域名搜索

    公开(公告)号:US20080294602A1

    公开(公告)日:2008-11-27

    申请号:US11753699

    申请日:2007-05-25

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30705 G06F17/30864

    摘要: Systems, methods, computer-readable media, and graphical user interfaces for presenting search results having collapsed domains are provided. A search result obtaining module obtains search results based upon a received query. Upon obtaining the search results, search results having the same domain are associated with one another. Thereafter, search result clusters of associated search results are formed. In some embodiments, the search result clusters may be formatted to include desired search result cluster attributes. The search result clusters are presented such that two or more associated search results form a single cluster of search results rather than being presented individually. In some embodiments, an option to view more search results with the same domain may be provided.

    摘要翻译: 提供了用于呈现具有折叠域的搜索结果的系统,方法,计算机可读介质和图形用户界面。 搜索结果获取模块基于接收到的查询获得搜索结果。 在获得搜索结果时,具有相同域的搜索结果彼此相关联。 此后,形成关联搜索结果的搜索结果集群。 在一些实施例中,搜索结果集群可以被格式化以包括期望的搜索结果集群属性。 呈现搜索结果集群,使得两个或多个相关联的搜索结果形成单个搜索结果集,而不是单独呈现。 在一些实施例中,可以提供查看具有相同域的更多搜索结果的选项。

    Dynamic update of a web index
    9.
    发明授权
    Dynamic update of a web index 有权
    动态更新Web索引

    公开(公告)号:US08224841B2

    公开(公告)日:2012-07-17

    申请号:US12127949

    申请日:2008-05-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: Systems and methods are provided for regularly updating a web index with new or updated content, such as meta words or meta streams, for a particular web page address, such as a URL. Web page addresses and associated updated information, such as meta words, meta streams, values, and locations in the web index for those meta words are received. In order to update a web index, which is used by search engines to search web documents, a document identification is retrieved and associated with the updated information. As information in the web index is stored by document identification and not by web page addresses, the document identification may replace the web page address. Each meta word received is matched with corresponding document identifications and associated updated information, which creates an inverted format of the information. The web index may now be updated and stored by the system.

    摘要翻译: 提供了系统和方法,用于针对诸如URL的特定网页地址,定期更新具有新的或更新的内容(例如元字元或元流)的网络索引。 接收网页地址和相关联的更新信息,例如元词,元流,值和这些元词的web索引中的位置。 为了更新由搜索引擎用于搜索web文档的web索引,检索文档标识并与更新的信息相关联。 由于web索引中的信息是通过文档标识而不是通过网页地址来存储的,所以文档标识可以替换网页地址。 接收的每个元字符与相应的文档标识和相关联的更新信息匹配,这创建了信息的反向格式。 网络索引现在可以被系统更新和存储。

    Updating an inverted index in a real time fashion
    10.
    发明授权
    Updating an inverted index in a real time fashion 有权
    以实时方式更新反向索引

    公开(公告)号:US08082258B2

    公开(公告)日:2011-12-20

    申请号:US12368771

    申请日:2009-02-10

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30336 G06F17/30613

    摘要: Systems and methods for regularly updating portions of a merged index are provided. Initially, upon receiving an indication that modifications have occurred to content of web-based documents, dynamic update of index (DUI) objects that identify the documents and expose the modified content are composed by ascertaining relative positions of the modified content within the documents, and packaging identifiers of the documents, the relative positions, and metadata underlying the modified content into a message. The DUI objects are applied to an overloading index that maintains structured records of recent modifications. In particular, portions of the overloading index are targeted utilizing the document identifiers and the relative positions specified by the DUI object, thereby updating the targeted portions within the overloading index corresponding to the modified content without rewriting the entire overloading index. Periodically, an association process is invoked for grouping the merged index with the overloading index for search purposes.

    摘要翻译: 提供了用于定期更新合并索引的部分的系统和方法。 最初,在接收到对基于web的文档的内容进行修改的指示时,通过确定文档中修改的内容的相对位置来构成标识文档和公开修改的内容的索引(DUI)对象的动态更新,以及 将文档的标识符,相对位置和修改内容的元数据包装到消息中。 DUI对象应用于维护最近修改的结构化记录的重载索引。 特别地,使用文档标识符和DUI对象指定的相对位置来定位重载索引的部分,从而更新与修改的内容相对应的超载索引内的目标部分,而无需重写整个重载索引。 定期地,调用关联过程以将合并的索引与用于搜索目的的重载索引进行分组。