Library citation integration
    31.
    发明授权
    Library citation integration 有权
    图书馆引文整合

    公开(公告)号:US07526475B1

    公开(公告)日:2009-04-28

    申请号:US11432039

    申请日:2006-05-10

    IPC分类号: G06F7/00 G06F17/00

    摘要: An online search system generates an index of documents using index information received from a library. Some documents have restricted access; some documents may not be available online. The search system provides links to documents in the library as well as other sites based on a search, and may include link resolvers received from the library. The search system provides access links to the link resolvers if an identifier, such as a user identification or IP address, matches an affiliation list from the library.

    摘要翻译: 在线搜索系统使用从库接收的索引信息生成文档索引。 有些文件限制访问; 一些文件可能无法在线上。 搜索系统提供到图书馆中的文档以及基于搜索的其他站点的链接,并且可以包括从图书馆接收的链接解析器。 如果诸如用户标识或IP地址的标识符与来自库的隶属关系列表匹配,则搜索系统提供到链接解析器的访问链接。

    Query modification
    32.
    发明授权
    Query modification 有权
    查询修改

    公开(公告)号:US08819000B1

    公开(公告)日:2014-08-26

    申请号:US13461315

    申请日:2012-05-01

    IPC分类号: G06F17/30

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for query modification. In one aspect, a method includes receiving an original query including a first limitation. First search results responsive to a modified query are obtained, where the first limitation has been omitted from the modified query. One or more common characteristics shared by two or more resources are identified. Each of the two or more resources corresponds to a different highly-ranked result of the first search results. A second modified query including the original query and a second limitation representing the one or more common characteristics is generated. Second search results responsive to the second modified query are obtained. The second search results are provided in a response to the original query.

    摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于查询修改。 一方面,一种方法包括接收包括第一限制的原始查询。 获得响应于修改查询的第一搜索结果,其中已经从修改的查询中省略了第一个限制。 识别由两个或多个资源共享的一个或多个共同特征。 两个或更多个资源中的每一个对应于第一搜索结果的不同高度排名的结果。 生成包括原始查询和表示一个或多个共同特征的第二限制的第二修改查询。 获得响应于第二修改查询的第二搜索结果。 响应于原始查询提供第二个搜索结果。

    Search engine cache control
    34.
    发明授权
    Search engine cache control 有权
    搜索引擎缓存控制

    公开(公告)号:US07840557B1

    公开(公告)日:2010-11-23

    申请号:US10845283

    申请日:2004-05-12

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F12/0875

    摘要: A search query containing at least one term is received at a search controller from a query server and preferably normalized and hashed into a representation of the search query. The representation of the search query is transmitted towards a cache containing multiple query result entries. Each query result entry contains a list of documents associated with the previously searched search query. The cache is then searched and query result entries for the search query are sent to the search controller from the cache. Subsequently, it is determined whether the query result entries are current versions for the search query. If the query result entries are not the current versions, then current versions of the query result entries are obtained.

    摘要翻译: 包含至少一个术语的搜索查询在搜索控制器处从查询服务器接收,并且优选地被标准化并被散列成搜索查询的表示。 搜索查询的表示被发送到包含多个查询结果条目的高速缓存。 每个查询结果条目包含与先前搜索的搜索查询相关联的文档列表。 然后搜索缓存,并将搜索查询的查询结果条目从缓存发送到搜索控制器。 随后,确定查询结果条目是否是用于搜索查询的当前版本。 如果查询结果条目不是当前版本,则获取当前版本的查询结果条目。

    Generating equivalence classes and rules for associating content with document identifiers
    36.
    发明授权
    Generating equivalence classes and rules for associating content with document identifiers 有权
    生成用于将内容与文档标识符相关联的等价类和规则

    公开(公告)号:US09026566B2

    公开(公告)日:2015-05-05

    申请号:US12725381

    申请日:2010-03-16

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: A system of reducing the possibility of crawling duplicate document identifiers partitions a plurality of document identifiers into multiple clusters, each cluster having a cluster name and a set of document parameters. The system generates an equivalence rule for each cluster of document identifiers, the rule specifying which document parameters associated with the cluster are content-relevant. Next, the system groups each cluster of document identifiers into one or more equivalence classes in accordance with its associated equivalence rule, each equivalence class including one or more document identifiers that correspond to a document content and having a representative document identifier identifying the document content.

    摘要翻译: 减少爬行重复文档标识符的可能性的系统将多个文档标识符划分成多个集群,每个集群具有集群名称和一组文档参数。 系统为文档标识符的每个集群生成等价规则,该规则指定与集群相关联的文档参数与内容相关。 接下来,系统根据其相关联的等价规则将每个文档标识符簇分组为一个或多个等价类,每个等价类包括与文档内容对应的一个或多个文档标识符,并且具有标识文档内容的代表性文档标识符。

    Systems and methods for personalizing aggregated news content
    37.
    发明授权
    Systems and methods for personalizing aggregated news content 有权
    个性化聚合新闻内容的系统和方法

    公开(公告)号:US08676837B2

    公开(公告)日:2014-03-18

    申请号:US10748663

    申请日:2003-12-31

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867

    摘要: A system customizes a news document associated with a user of a news aggregation service. The system includes multiple news source servers that store news content and a remote news aggregation server. The news aggregation server creates a customized news document based on one or more personalized search queries received from a user. The news aggregation server fetches the news content from the multiple news source servers, aggregates the news content, and searches the aggregated news content based on the one or more personalized search queries. The news aggregation server provides selected news content to the customized news document based on results of the search.

    摘要翻译: 系统自定义与新闻聚合服务的用户相关联的新闻文档。 该系统包括存储新闻内容的多个新闻源服务器和远程新闻聚合服务器。 新闻聚合服务器基于从用户接收的一个或多个个性化搜索查询创建定制的新闻文档。 新闻聚合服务器从多个新闻源服务器获取新闻内容,聚合新闻内容,并根据一个或多个个性化搜索查询搜索聚合新闻内容。 新闻聚合服务器根据搜索结果向定制的新闻文档提供选定的新闻内容。

    Document search in affiliated libraries
    38.
    发明授权
    Document search in affiliated libraries 有权
    在附属图书馆进行文件搜索

    公开(公告)号:US08473487B1

    公开(公告)日:2013-06-25

    申请号:US12419872

    申请日:2009-04-07

    IPC分类号: G06F17/30

    摘要: An online search system generates an index of documents using index information received from a library. Some documents have restricted access; some documents may not be available online. The search system provides links to documents in the library as well as other sites based on a search, and may include link resolvers received from the library. The search system provides access links to the link resolvers if an identifier, such as a user identification or IP address, matches an affiliation list from the library.

    摘要翻译: 在线搜索系统使用从库接收的索引信息生成文档索引。 有些文件限制访问; 一些文件可能无法在线上。 搜索系统提供到图书馆中的文档以及基于搜索的其他站点的链接,并且可以包括从图书馆接收的链接解析器。 如果诸如用户标识或IP地址的标识符与来自库的隶属关系列表匹配,则搜索系统提供到链接解析器的访问链接。

    Search Engine Cache Control
    39.
    发明申请
    Search Engine Cache Control 有权
    搜索引擎缓存控制

    公开(公告)号:US20110035372A1

    公开(公告)日:2011-02-10

    申请号:US12905922

    申请日:2010-10-15

    IPC分类号: G06F17/30

    CPC分类号: G06F12/0875

    摘要: A search query containing one or more terms is received from a client system. In response to receiving the search query, one or more snippets obtained in response to a prior execution of said search query are requested from a cache. For a respective snippet received from the cache, it is determined whether the respective snippet is a current version. In response to a determination that the respective snippet is not the current version, the current version of the respective snippet is obtained from a corresponding document in which one or more terms from said search query are located and the snippet stored in the cache is updated using the obtained current version. Search query results including the respective snippet are transmitted to the client.

    摘要翻译: 从客户端系统接收到包含一个或多个术语的搜索查询。 响应于接收到搜索查询,响应于先前执行所述搜索查询获得的一个或多个片段从高速缓存请求。 对于从高速缓存接收到的相应片段,确定相应的片段是否是当前版本。 响应于相应片段不是当前版本的确定,从相应的文档获得当前版本的相应片段,其中来自所述搜索查询的一个或多个术语位于并且存储在高速缓存中的片段使用 获得的当前版本。 搜索查询结果,包括相应的代码段被传送到客户端。

    System for automatically managing duplicate documents when crawling dynamic documents
    40.
    发明授权
    System for automatically managing duplicate documents when crawling dynamic documents 有权
    抓取动态文档时自动管理重复文件的系统

    公开(公告)号:US07680773B1

    公开(公告)日:2010-03-16

    申请号:US11097687

    申请日:2005-03-31

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: A system of reducing the possibility of crawling duplicate document identifiers partitions a plurality of document identifiers into multiple clusters, each cluster having a cluster name and a set of document parameters. The system generates an equivalence rule for each cluster of document identifiers, the rule specifying which document parameters associated with the cluster are content-relevant. Next, the system groups each cluster of document identifiers into one or more equivalence classes in accordance with its associated equivalence rule, each equivalence class including one or more document identifiers that correspond to a document content and having a representative document identifier identifying the document content.

    摘要翻译: 减少爬行重复文档标识符的可能性的系统将多个文档标识符划分成多个集群,每个集群具有集群名称和一组文档参数。 系统为文档标识符的每个集群生成等价规则,该规则指定与集群相关联的文档参数与内容相关。 接下来,系统根据其相关联的等价规则将每个文档标识符簇分组为一个或多个等价类,每个等价类包括与文档内容对应的一个或多个文档标识符,并且具有标识文档内容的代表性文档标识符。