Keyword-based search engine results using enhanced query strategies
    1.
    发明授权
    Keyword-based search engine results using enhanced query strategies 有权
    基于关键字的搜索引擎结果使用增强查询策略

    公开(公告)号:US08645372B2

    公开(公告)日:2014-02-04

    申请号:US12915213

    申请日:2010-10-29

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30867

    摘要: Enhanced computer- and network-based methods, systems, techniques are provided for retrieving more accurate and responsive search results when searching content for a designated entity using an off-the-shelf keyword-based search engine. For example, the embodiments described herein may be used to improve search results by eliminating off-topic results when presenting queries to an existing keyword-based search engine invoked by means of an API from an intermediating application. Example embodiments provide a Keyword-Based Search Enhancement System (“KBSES”), which enables intermediating applications to obtain information more closely related to user queries by enhancing such queries, on behalf of the user, with disambiguating information when deemed necessary. Based upon a variety of rules and heuristics, which can be modified as well, the KBSES determines whether an entity name in a user's query should be enhanced with additional disambiguating information, and to what extent, to prevent the retrieval of off-topic results.

    摘要翻译: 提供基于计算机和网络的方法,系统和技术,用于在使用现成的基于关键字的搜索引擎搜索指定实体的内容时检索更准确和响应的搜索结果。 例如,本文所述的实施例可以用于通过在向来自中间应用程序的API调用的现有基于关键字的搜索引擎呈现查询时消除脱离主题结果来改进搜索结果。 示例性实施例提供了一种基于关键词的搜索增强系统(“KBSES”),其使中间应用程序能够在认为必要时通过增强这些查询来代替用户消除歧义的信息来获得与用户查询更密切相关的信息。 基于可以修改的各种规则和启发式方法,KBSES确定用户查询中的实体名称是否应增加消除歧义信息,以及在多大程度上防止检索脱离主题的结果。

    KEYWORD-BASED SEARCH ENGINE RESULTS USING ENHANCED QUERY STRATEGIES
    2.
    发明申请
    KEYWORD-BASED SEARCH ENGINE RESULTS USING ENHANCED QUERY STRATEGIES 有权
    基于关键字的搜索引擎结果使用增强的查询策略

    公开(公告)号:US20110119243A1

    公开(公告)日:2011-05-19

    申请号:US12915213

    申请日:2010-10-29

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867

    摘要: Enhanced computer- and network-based methods, systems, techniques are provided for retrieving more accurate and responsive search results when searching content for a designated entity using an off-the-shelf keyword-based search engine. For example, the embodiments described herein may be used to improve search results by eliminating off-topic results when presenting queries to an existing keyword-based search engine invoked by means of an API from an intermediating application. Example embodiments provide a Keyword-Based Search Enhancement System (“KBSES”), which enables intermediating applications to obtain information more closely related to user queries by enhancing such queries, on behalf of the user, with disambiguating information when deemed necessary. Based upon a variety of rules and heuristics, which can be modified as well, the KBSES determines whether an entity name in a user's query should be enhanced with additional disambiguating information, and to what extent, to prevent the retrieval of off-topic results.

    摘要翻译: 提供基于计算机和网络的方法,系统和技术,用于在使用现成的基于关键字的搜索引擎搜索指定实体的内容时检索更准确和响应的搜索结果。 例如,本文所述的实施例可以用于通过在向来自中间应用程序的API调用的现有基于关键字的搜索引擎呈现查询时消除脱离主题结果来改进搜索结果。 示例性实施例提供了一种基于关键词的搜索增强系统(“KBSES”),其使中间应用程序能够在认为必要时通过增强这些查询来代替用户消除歧义的信息来获得与用户查询更密切相关的信息。 基于可以修改的各种规则和启发式方法,KBSES确定用户查询中的实体名称是否应增加消除歧义信息,以及在多大程度上防止检索脱离主题的结果。

    Category-based content recommendation
    3.
    发明授权
    Category-based content recommendation 有权
    基于类别的内容推荐

    公开(公告)号:US08725739B2

    公开(公告)日:2014-05-13

    申请号:US13286778

    申请日:2011-11-01

    IPC分类号: G06F17/30

    摘要: Techniques for category-based content recommendation are described. Some embodiments provide a content recommendation system (“CRS”) configured to recommend content items (e.g., Web pages, images, videos) that are related to specified categories. In one embodiment, the CRS processes content items to determine entities referenced by the content items, and to determine categories related to the referenced entities. The determined entities and/or categories may be part of a taxonomy that is stored by the CRS. Then, in response to a received request that indicates a category, the CRS determines and provides indications of one or more content items that each have a corresponding category that matches the indicated category. In some embodiments, at least some of these techniques are employed to implement a category-based news service.

    摘要翻译: 描述了基于类别的内容推荐的技术。 一些实施例提供内容推荐系统(“CRS”),其被配置为推荐与指定类别相关的内容项目(例如,网页,图像,视频)。 在一个实施例中,CRS处理内容项目以确定由内容项目引用的实体,并且确定与被引用实体相关的类别。 确定的实体和/或类别可以是由CRS存储的分类法的一部分。 然后,响应于接收到的指示类别的请求,CRS确定并提供每个具有与指示类别匹配的对应类别的一个或多个内容项的指示。 在一些实施例中,使用这些技术中的至少一些来实现基于类别的新闻服务。

    CATEGORY-BASED CONTENT RECOMMENDATION
    4.
    发明申请
    CATEGORY-BASED CONTENT RECOMMENDATION 有权
    基于类别的内容建议

    公开(公告)号:US20120109966A1

    公开(公告)日:2012-05-03

    申请号:US13286778

    申请日:2011-11-01

    IPC分类号: G06F17/30

    摘要: Techniques for category-based content recommendation are described. Some embodiments provide a content recommendation system (“CRS”) configured to recommend content items (e.g., Web pages, images, videos) that are related to specified categories. In one embodiment, the CRS processes content items to determine entities referenced by the content items, and to determine categories related to the referenced entities. The determined entities and/or categories may be part of a taxonomy that is stored by the CRS. Then, in response to a received request that indicates a category, the CRS determines and provides indications of one or more content items that each have a corresponding category that matches the indicated category. In some embodiments, at least some of these techniques are employed to implement a category-based news service.

    摘要翻译: 描述了基于类别的内容推荐的技术。 一些实施例提供内容推荐系统(“CRS”),其被配置为推荐与指定类别相关的内容项目(例如,网页,图像,视频)。 在一个实施例中,CRS处理内容项目以确定由内容项目引用的实体,并且确定与被引用实体相关的类别。 确定的实体和/或类别可以是由CRS存储的分类法的一部分。 然后,响应于接收到的指示类别的请求,CRS确定并提供每个具有与指示类别匹配的对应类别的一个或多个内容项的指示。 在一些实施例中,使用这些技术中的至少一些来实现基于类别的新闻服务。

    Content recommendation based on collections of entities

    公开(公告)号:US09710556B2

    公开(公告)日:2017-07-18

    申请号:US13038192

    申请日:2011-03-01

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 G06F17/30867

    摘要: Techniques for content recommendation are described. Some embodiments provide a content recommendation system (“CRS”) configured to recommend content items that are related to a collection of entities. A content item may be considered related to a collection of entities based on various factors, including whether and how often the article references or otherwise covers the entities of the collection, the size of the article, other entities that are covered by the article but that are not in the collection, article recency, or article credibility. Recommending content items may also or instead include determining entities that are related to a collection. An entity can be considered related to a collection based on various factors, such as whether the entity is of the same or similar type to entities of the collection, or whether the entity appears in some article in a relationship with one or more entities of the collection.

    CLUSTER-BASED IDENTIFICATION OF NEWS STORIES
    6.
    发明申请
    CLUSTER-BASED IDENTIFICATION OF NEWS STORIES 有权
    新闻故事集群识别

    公开(公告)号:US20120254188A1

    公开(公告)日:2012-10-04

    申请号:US13434600

    申请日:2012-03-29

    IPC分类号: G06F17/30

    摘要: Methods, systems, and techniques for cluster-based content recommendation are described. Some embodiments provide a content recommendation system (“CRS”) configured to recommend news stories about events or occurrences. In some embodiments, a news story about an event includes multiple related content items that each include an account of the event and that each reference one or more entities or categories that are represented by the CRS. In one embodiment, the CRS identifies news stories by generating clusters of related content items. Then, in response to a received query that indicates a keyterm, entity, or category, the CRS determines and provides indications of one or more news stories that are relevant to the received query. In some embodiments, at least some of these techniques are employed to implement a news story recommendation facility in an online news service.

    摘要翻译: 描述了基于群集的内容推荐的方法,系统和技术。 一些实施例提供了被配置为推荐关于事件或事件的新闻故事的内容推荐系统(CRS)。 在一些实施例中,关于事件的新闻故事包括多个相关内容项,每个内容项包括事件的帐户,并且每个引用由CRS表示的一个或多个实体或类别。 在一个实施例中,CRS通过生成相关内容项目的集群来识别新闻故事。 然后,响应于接收到的指示关键字,实体或类别的查询,CRS确定并提供与所接收的查询相关的一个或多个新闻故事的指示。 在一些实施例中,使用这些技术中的至少一些来实现在线新闻服务中的新闻故事推荐设施。

    Method and system for extending keyword searching to syntactically and semantically annotated data
    7.
    发明授权
    Method and system for extending keyword searching to syntactically and semantically annotated data 有权
    将关键字搜索扩展到语法和语义注释数据的方法和系统

    公开(公告)号:US08131540B2

    公开(公告)日:2012-03-06

    申请号:US12401421

    申请日:2009-03-10

    IPC分类号: G06F17/27 G06F7/00 G06F17/30

    摘要: Methods and systems for extending keyword searching techniques to syntactically and semantically annotated data are provided. Example embodiments provide a Syntactic Query Engine (“SQE”) that parses, indexes, and stores a data set as an enhanced document index with document terms as well as information pertaining to the grammatical roles of the terms and ontological and other semantic information. In one embodiment, the enhanced document index is a form of term-clause index, that indexes terms and syntactic and semantic annotations at the clause level. The enhanced document index permits the use of a traditional keyword search engine to process relationship queries as well as to process standard document level keyword searches. In one embodiment, the SQE comprises a Query Processor, a Data Set Preprocessor, a Keyword Search Engine, a Data Set Indexer, an Enhanced Natural Language Parser (“ENLP”), a data set repository, and, in some embodiments, a user interface or an application programming interface.

    摘要翻译: 提供了将关键字搜索技术扩展到语法和语义注释数据的方法和系统。 示例性实施例提供了一种语法查询引擎(“SQE”),其用文档术语解析,索引和存储数据集作为增强文档索引,以及与术语和本体语义信息以及其他语义信息有关的语法角色的信息。 在一个实施例中,增强的文档索引是术语子句索引的形式,其在子句级别对术语和句法和语义注释进行索引。 增强的文档索引允许使用传统的关键字搜索引擎来处理关系查询以及处理标准的文档级关键词搜索。 在一个实施例中,SQE包括查询处理器,数据集预处理器,关键字搜索引擎,数据集索引器,增强自然语言解析器(“ENLP”),数据集存储库,并且在一些实施例中包括用户 接口或应用程序编程接口。

    Method and system for enhanced data searching
    8.
    发明授权
    Method and system for enhanced data searching 有权
    增强数据搜索的方法和系统

    公开(公告)号:US07283951B2

    公开(公告)日:2007-10-16

    申请号:US10007299

    申请日:2001-11-08

    IPC分类号: G06F17/27 G06F17/30

    摘要: Methods and systems for syntactically indexing and searching data sets to achieve more accurate search results are provided. Example embodiments provide a Syntactic Query Engine (“SQE”) that parses, indexes, and stores a data set, as well as processes natural language queries subsequently submitted against the data set. The SQE comprises a Query Preprocessor, a Data Set Preprocessor, a Query Builder, a Data Set Indexer, an Enhanced Natural Language Parser (“ENLP”), a data set repository, and, in some embodiments, a user interface. After preprocessing the data set, the SQE parses the data set and determines the syntactic and grammatical roles of each term to generate enhanced data representations for each object in the data set. The SQE indexes and stores these enhanced data representations in the data set repository. Upon subsequently receiving a query, the SQE parses the query similarly and searches the indexed stored data set to locate data that contains similar terms used in similar grammatical roles. In this manner, the SQE is able to achieve more contextually accurate search results more frequently than using traditional search engines.

    摘要翻译: 提供了用于语法索引和搜索数据集以实现更准确的搜索结果的方法和系统。 示例性实施例提供了解析,索引和存储数据集的语法查询引擎(“SQE”),并且处理随后针对数据集提交的自然语言查询。 SQE包括查询预处理器,数据集预处理器,查询生成器,数据集索引器,增强自然语言解析器(“ENLP”),数据集存储库以及在一些实施例中的用户界面。 在对数据集进行预处理之后,SQE解析数据集并确定每个术语的句法和语法角色,以生成数据集中每个对象的增强数据表示。 SQE索引并将这些增强型数据表示存储在数据集存储库中。 随后接收到查询,SQE类似地解析查询,并搜索索引的存储数据集,以定位包含类似语法角色中使用的类似术语的数据。 以这种方式,SQE能够比使用传统的搜索引擎更频繁地获得更加内容相对准确的搜索结果。

    NLP-based content recommender
    9.
    发明授权
    NLP-based content recommender 有权
    基于NLP的内容推荐器

    公开(公告)号:US08700604B2

    公开(公告)日:2014-04-15

    申请号:US12288349

    申请日:2008-10-16

    IPC分类号: G06F17/30

    摘要: Methods, techniques, and systems for using natural language processing to recommend related content to an associated text segment or document. Example embodiments provide a NLP-based content recommender (“NCR”) which uses NLP-based search techniques, potentially in conjunction with context or other related information, to locate and provide content related to entities that are recognized in the associated material. NCRs may be embedded as widgets, for example on Web pages to assist users in their perusal and search for information, provided by means of browser plug-ins or other application plug-ins, provided in libraries or in standalone environments, or otherwise integrated into other code, programs, or devices. This abstract is provided to comply with rules requiring an abstract, and it is submitted with the intention that it will not be used to interpret or limit the scope or meaning of the claims.

    摘要翻译: 使用自然语言处理方法,技术和系统,将相关内容推荐给相关的文本段或文档。 示例性实施例提供了基于NLP的内容推荐器(“NCR”),其使用基于NLP的搜索技术,潜在地与上下文或其他相关信息相结合来定位和提供与在相关联的材料中被识别的实体相关的内容。 NCR可以作为小部件嵌入,例如在网页上,以帮助用户阅读和搜索信息,通过浏览器插件或其他应用程序插件提供,在库或独立环境中提供,或以其他方式集成 其他代码,程序或设备。 提供本摘要以符合要求摘要的规则,并提交其意图是不会用于解释或限制权利要求书的范围或含义。

    NLP-BASED SYSTEMS AND METHODS FOR PROVIDING QUOTATIONS
    10.
    发明申请
    NLP-BASED SYSTEMS AND METHODS FOR PROVIDING QUOTATIONS 有权
    基于NLP的系统和提供报价的方法

    公开(公告)号:US20110246181A1

    公开(公告)日:2011-10-06

    申请号:US13075799

    申请日:2011-03-30

    IPC分类号: G06F17/27

    摘要: Techniques for providing quotations obtained from text documents using natural language processing techniques are described. Some embodiments provide a content recommendation system (“CRS”) configured to provide quotations by extracting quotations from a corpus text documents, and providing access to the extracted quotations in response to search requests received from users. The CRS may extract quotations by using natural language processing-based techniques to identify one or more entities, such as people, places, objects, concepts, or the like, that are referenced by the extracted quotations. The CRS may then store the extracted quotations along with identified entities, such as quotation speakers and subjects, for later access via search requests.

    摘要翻译: 描述使用自然语言处理技术从文本文档获得报价的技术。 一些实施例提供了一种内容推荐系统(“CRS”),其被配置为通过从语料库文本文档中提取报价来提供报价,并且响应于从用户接收的搜索请求提供对所提取的报价的访问。 CRS可以通过使用基于自然语言处理的技术来提取报价,以识别由提取的报价引用的一个或多个实体,例如人,地点,对象,概念等。 然后,CRS可以将所提取的报价与所识别的实体(例如引号说话者和主题)一起存储,以供稍后通过搜索请求访问。