UPDATING AN INVERTED INDEX IN A REAL TIME FASHION
    1.
    发明申请
    UPDATING AN INVERTED INDEX IN A REAL TIME FASHION 有权
    在实时更新反转索引

    公开(公告)号:US20100205160A1

    公开(公告)日:2010-08-12

    申请号:US12368771

    申请日:2009-02-10

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30336 G06F17/30613

    摘要: Systems and methods for regularly updating portions of a merged index are provided. Initially, upon receiving an indication that modifications have occurred to content of web-based documents, dynamic update of index (DUI) objects that identify the documents and expose the modified content are composed by ascertaining relative positions of the modified content within the documents, and packaging identifiers of the documents, the relative positions, and metadata underlying the modified content into a message. The DUI objects are applied to an overloading index that maintains structured records of recent modifications. In particular, portions of the overloading index are targeted utilizing the document identifiers and the relative positions specified by the DUI object, thereby updating the targeted portions within the overloading index corresponding to the modified content without rewriting the entire overloading index. Periodically, an association process is invoked for grouping the merged index with the overloading index for search purposes.

    摘要翻译: 提供了用于定期更新合并索引的部分的系统和方法。 最初,在接收到对基于web的文档的内容进行修改的指示时,通过确定文档中修改的内容的相对位置来构成标识文档和公开修改的内容的索引(DUI)对象的动态更新,以及 将文档的标识符,相对位置和修改内容的元数据包装到消息中。 DUI对象应用于维护最近修改的结构化记录的重载索引。 特别地,使用文档标识符和DUI对象指定的相对位置来定位重载索引的部分,从而更新与修改的内容相对应的超载索引内的目标部分,而无需重写整个重载索引。 定期地,调用关联过程以将合并的索引与用于搜索目的的重载索引进行分组。

    UPDATING AN INVERTED INDEX IN A REAL TIME FASHION
    2.
    发明申请
    UPDATING AN INVERTED INDEX IN A REAL TIME FASHION 有权
    在实时更新反转索引

    公开(公告)号:US20120059806A1

    公开(公告)日:2012-03-08

    申请号:US13292793

    申请日:2011-11-09

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30336 G06F17/30613

    摘要: Systems and methods for regularly updating portions of a merged index are provided. Initially, upon receiving an indication that modifications have occurred to content of web-based documents, dynamic update of index (DUI) objects that identify the documents and expose the modified content are composed by ascertaining relative positions of the modified content within the documents, and packaging identifiers of the documents, the relative positions, and metadata underlying the modified content into a message. The DUI objects are applied to an overloading index that maintains structured records of recent modifications. In particular, portions of the overloading index are targeted utilizing the document identifiers and the relative positions specified by the DUI object, thereby updating the targeted portions within the overloading index corresponding to the modified content without rewriting the entire overloading index. Periodically, an association process is invoked for grouping the merged index with the overloading index for search purposes.

    摘要翻译: 提供了用于定期更新合并索引的部分的系统和方法。 最初,在接收到对基于web的文档的内容进行修改的指示时,通过确定文档中修改的内容的相对位置来构成标识文档和公开修改的内容的索引(DUI)对象的动态更新,以及 将文档的标识符,相对位置和修改内容的元数据包装到消息中。 DUI对象应用于维护最近修改的结构化记录的重载索引。 特别地,使用文档标识符和DUI对象指定的相对位置来定位重载索引的部分,从而更新与修改的内容相对应的超载索引内的目标部分,而无需重写整个重载索引。 定期地,调用关联过程以将合并的索引与用于搜索目的的重载索引进行分组。

    Updating an inverted index in a real time fashion
    3.
    发明授权
    Updating an inverted index in a real time fashion 有权
    以实时方式更新反向索引

    公开(公告)号:US08756206B2

    公开(公告)日:2014-06-17

    申请号:US13292793

    申请日:2011-11-09

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30336 G06F17/30613

    摘要: Systems and methods for regularly updating portions of a merged index are provided. Initially, upon receiving an indication that modifications have occurred to content of web-based documents, dynamic update of index (DUI) objects that identify the documents and expose the modified content are composed by ascertaining relative positions of the modified content within the documents, and packaging identifiers of the documents, the relative positions, and metadata underlying the modified content into a message. The DUI objects are applied to an overloading index that maintains structured records of recent modifications. In particular, portions of the overloading index are targeted utilizing the document identifiers and the relative positions specified by the DUI object, thereby updating the targeted portions within the overloading index corresponding to the modified content without rewriting the entire overloading index. Periodically, an association process is invoked for grouping the merged index with the overloading index for search purposes.

    摘要翻译: 提供了用于定期更新合并索引的部分的系统和方法。 最初,在接收到对基于web的文档的内容进行修改的指示时,通过确定文档中修改的内容的相对位置来构成标识文档和公开修改的内容的索引(DUI)对象的动态更新,以及 将文档的标识符,相对位置和修改内容的元数据包装到消息中。 DUI对象应用于维护最近修改的结构化记录的重载索引。 特别地,使用文档标识符和DUI对象指定的相对位置来定位重载索引的部分,从而更新与修改的内容相对应的超载索引内的目标部分,而无需重写整个重载索引。 定期地,调用关联过程以将合并的索引与用于搜索目的的重载索引进行分组。

    Updating an inverted index in a real time fashion
    4.
    发明授权
    Updating an inverted index in a real time fashion 有权
    以实时方式更新反向索引

    公开(公告)号:US08082258B2

    公开(公告)日:2011-12-20

    申请号:US12368771

    申请日:2009-02-10

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30336 G06F17/30613

    摘要: Systems and methods for regularly updating portions of a merged index are provided. Initially, upon receiving an indication that modifications have occurred to content of web-based documents, dynamic update of index (DUI) objects that identify the documents and expose the modified content are composed by ascertaining relative positions of the modified content within the documents, and packaging identifiers of the documents, the relative positions, and metadata underlying the modified content into a message. The DUI objects are applied to an overloading index that maintains structured records of recent modifications. In particular, portions of the overloading index are targeted utilizing the document identifiers and the relative positions specified by the DUI object, thereby updating the targeted portions within the overloading index corresponding to the modified content without rewriting the entire overloading index. Periodically, an association process is invoked for grouping the merged index with the overloading index for search purposes.

    摘要翻译: 提供了用于定期更新合并索引的部分的系统和方法。 最初,在接收到对基于web的文档的内容进行修改的指示时,通过确定文档中修改的内容的相对位置来构成标识文档和公开修改的内容的索引(DUI)对象的动态更新,以及 将文档的标识符,相对位置和修改内容的元数据包装到消息中。 DUI对象应用于维护最近修改的结构化记录的重载索引。 特别地,使用文档标识符和DUI对象指定的相对位置来定位重载索引的部分,从而更新与修改的内容相对应的超载索引内的目标部分,而无需重写整个重载索引。 定期地,调用关联过程以将合并的索引与用于搜索目的的重载索引进行分组。

    Dynamic update of a web index
    5.
    发明授权
    Dynamic update of a web index 有权
    动态更新Web索引

    公开(公告)号:US08224841B2

    公开(公告)日:2012-07-17

    申请号:US12127949

    申请日:2008-05-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: Systems and methods are provided for regularly updating a web index with new or updated content, such as meta words or meta streams, for a particular web page address, such as a URL. Web page addresses and associated updated information, such as meta words, meta streams, values, and locations in the web index for those meta words are received. In order to update a web index, which is used by search engines to search web documents, a document identification is retrieved and associated with the updated information. As information in the web index is stored by document identification and not by web page addresses, the document identification may replace the web page address. Each meta word received is matched with corresponding document identifications and associated updated information, which creates an inverted format of the information. The web index may now be updated and stored by the system.

    摘要翻译: 提供了系统和方法,用于针对诸如URL的特定网页地址,定期更新具有新的或更新的内容(例如元字元或元流)的网络索引。 接收网页地址和相关联的更新信息,例如元词,元流,值和这些元词的web索引中的位置。 为了更新由搜索引擎用于搜索web文档的web索引,检索文档标识并与更新的信息相关联。 由于web索引中的信息是通过文档标识而不是通过网页地址来存储的,所以文档标识可以替换网页地址。 接收的每个元字符与相应的文档标识和相关联的更新信息匹配,这创建了信息的反向格式。 网络索引现在可以被系统更新和存储。

    Domain collapsing of search results
    6.
    发明授权
    Domain collapsing of search results 有权
    域搜索结果崩溃

    公开(公告)号:US08041709B2

    公开(公告)日:2011-10-18

    申请号:US11753699

    申请日:2007-05-25

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30705 G06F17/30864

    摘要: Systems, methods, computer-readable media, and graphical user interfaces for presenting search results having collapsed domains are provided. A search result obtaining module obtains search results based upon a received query. Upon obtaining the search results, search results having the same domain are associated with one another. Thereafter, search result clusters of associated search results are formed. In some embodiments, the search result clusters may be formatted to include desired search result cluster attributes. The search result clusters are presented such that two or more associated search results form a single cluster of search results rather than being presented individually. In some embodiments, an option to view more search results with the same domain may be provided.

    摘要翻译: 提供了用于呈现具有折叠域的搜索结果的系统,方法,计算机可读介质和图形用户界面。 搜索结果获取模块基于接收到的查询获得搜索结果。 在获得搜索结果时,具有相同域的搜索结果彼此相关联。 此后,形成关联搜索结果的搜索结果集群。 在一些实施例中,搜索结果集群可以被格式化以包括期望的搜索结果集群属性。 呈现搜索结果集群,使得两个或多个相关联的搜索结果形成单个搜索结果集,而不是单独呈现。 在一些实施例中,可以提供查看具有相同域的更多搜索结果的选项。

    LEVERAGING LOW-LATENCY MEMORY ACCESS
    7.
    发明申请
    LEVERAGING LOW-LATENCY MEMORY ACCESS 有权
    引导低延迟存储器访问

    公开(公告)号:US20100121865A1

    公开(公告)日:2010-05-13

    申请号:US12269877

    申请日:2008-11-12

    IPC分类号: G06F17/30 G06F9/46

    CPC分类号: G06F17/30979 G06F9/5066

    摘要: Computational units of any task may run in different silos. In an embodiment, a search query may be evaluated efficiently on a non-uniform memory architecture (NUMA) machine, by assigning separate chunks of the index to separate memories. In a NUMA machine, each socket has an attached memory. The latency time is low or high, depending on whether a processor accesses data in its attached memory or a different memory. Copies of an index manager program, which compares a query to an index, run separately on different processors in a NUMA machine. Each instance of the index manager compares the query to the index chunk in the memory attached to the processor on which that instance is running. Thus, each instance of the index manager may compare a query to a particular portion of the index using low-latency accesses, thereby increasing the efficiency of the search.

    摘要翻译: 任何任务的计算单位都可以运行在不同的仓库中。 在一个实施例中,可以在非均匀存储器架构(NUMA)机器上有效地评估搜索查询,通过将分离的分离的分块分配给分离的存储器。 在NUMA机器中,每个插座都有一个附加的内存。 延迟时间为低或高,这取决于处理器是否访问其附加存储器中的数据或不同的存储器。 将查询与索引进行比较的索引管理器程序的副本在NUMA机器的不同处理器上单独运行。 索引管理器的每个实例将查询与附加到该实例运行的处理器的内存中的索引块进行比较。 因此,索引管理器的每个实例可以使用低延迟访问来将查询与索引的特定部分进行比较,从而提高搜索的效率。

    SUPPORT FOR REVERSE AND STEMMED HIT-HIGHLIGHTING
    8.
    发明申请
    SUPPORT FOR REVERSE AND STEMMED HIT-HIGHLIGHTING 失效
    支持反向和STELEED HIT-LIGHTING

    公开(公告)号:US20080177717A1

    公开(公告)日:2008-07-24

    申请号:US11625076

    申请日:2007-01-19

    IPC分类号: G06F7/00

    CPC分类号: G06F17/3064 Y10S707/99933

    摘要: Computerized methods and systems for generating a suggested query list with suggested search terms displayed as highlighted text utilizing a user-defined query are provided. Query search terms are received by a user-interface display. Upon inputting query search terms, the user-interface automatically generates a suggested query list. The suggested query list is associated with the query search term and the suggested query list is comprised of at least one suggested search term. A query suggestion architecture determines if the query search term and the suggested search term are a match, and if so, highlights the suggested search term that is not a match. The user interface displays the highlighted terms to assist in refining a search. The present invention further provides a stemming algorithm that extracts the root form of the query search term.

    摘要翻译: 提供了用于生成建议的查询列表的计算机化方法和系统,其中建议的搜索项使用用户定义的查询显示为突出显示的文本。 查询搜索项由用户界面显示接收。 输入查询搜索项后,用户界面自动生成建议的查询列表。 建议的查询列表与查询搜索项相关联,并且建议的查询列表由至少一个建议的搜索项组成。 查询建议体系结构确定查询搜索词和建议的搜索词是否匹配,如果是,则突出显示不符合的建议搜索词。 用户界面显示突出显示的术语,以帮助改进搜索。 本发明还提供一种提取查询搜索项的根形式的词干化算法。

    Serving cached query results based on a query portion
    9.
    发明申请
    Serving cached query results based on a query portion 有权
    根据查询部分提供缓存的查询结果

    公开(公告)号:US20070203890A1

    公开(公告)日:2007-08-30

    申请号:US11363895

    申请日:2006-02-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30902

    摘要: The embodiments contemplate a system and method for obtaining related results for a portion of a query and for generating an updated set of queries for a cache of a server. Other queries beginning with the portion of the query may be identified and obtained from a data structure that includes a server cache and a set of common queries. Once the other queries are identified, results for the other queries are obtained from the server cache or from a back-end database. A set of common queries, which may include deleted and additional queries, may be utilized to generate the updated set of queries for the server. Both missing queries and deleted queries that may belong to the server based on an assignment function are inserted into a queue, which is later delivered to the cache of the server. The transfer may occur during a low-activity or idle state.

    摘要翻译: 这些实施例考虑了用于获得查询的一部分的相关结果并且为服务器的高速缓存生成更新的一组查询的系统和方法。 可以从包括服务器高速缓存和一组常见查询的数据结构中识别并获得从查询部分开始的其他查询。 一旦识别出其他查询,则从服务器缓存或后端数据库获取其他查询的结果。 可以使用一组常见查询(可以包括删除的和附加的查询)来生成针对服务器的更新的查询集合。 基于分配功能可能属于服务器的丢失查询和已删除查询都将被插入到队列中,该队列稍后会传递到服务器的缓存。 传输可能在低活动或空闲状态期间发生。

    Using anchor text to provide context
    10.
    发明授权
    Using anchor text to provide context 有权
    使用锚文本提供上下文

    公开(公告)号:US08458207B2

    公开(公告)日:2013-06-04

    申请号:US11522227

    申请日:2006-09-15

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30528 G06F17/30864

    摘要: A search engine can provide referencing information as context for a particular search result when an excerpt from the search result, comprising at least some similar elements to the user's query, is not generated. Referencing information can include one or more anchor texts having similarity to at least some elements of the user's query, the anchor texts being used by referencing pages to link to the page returned as a search result. User selection of the anchor text can enable the user to visit a referencing page using that anchor text to link to the page returned as a search result, and having a high static rank.

    摘要翻译: 当搜索结果的摘录(包括至少一些与用户的查询相似的元素)不被生成时,搜索引擎可以提供作为特定搜索结果的上下文的引用信息。 引用信息可以包括与用户的查询的至少一些元素具有相似性的一个或多个锚定文本,通过引用链接到页面返回的搜索结果来使用锚定文本。 锚文本的用户选择可以使得用户能够使用该锚文本访问引用页面,以链接到作为搜索结果返回的页面,并具有高静态等级。