Bootstrap and adapt a document search engine
    1.
    发明授权
    Bootstrap and adapt a document search engine 有权
    引导和调整文档搜索引擎

    公开(公告)号:US08527534B2

    公开(公告)日:2013-09-03

    申请号:US12726358

    申请日:2010-03-18

    IPC分类号: G06F17/30

    摘要: Architecture that employs a modeling technique based on language modeling to estimate a probability of a document matching the user need as expressed in the query. The modeling technique is based on the data mining results that various portions of a document (e.g., body, title, URL, anchor text, user queries) use different styles of human languages. Thus, the results based on a language can be adapted individually to match the language of query. Since the approach is based on adaptation, the framework also provides a natural means to progressively revise the model as user data are collected. Different styles of languages in a document can be recognized and adapted individually. Background language models are also employed that offer a fallback approach in case the document has incomplete fields of data, and can utilize topical or semantic hierarchy of the knowledge domain.

    摘要翻译: 采用基于语言建模的建模技术的架构来估计在查询中表达的与用户匹配的文档的概率。 建模技术基于数据挖掘结果,文档的各个部分(例如,主体,标题,URL,锚文本,用户查询)使用不同风格的人类语言。 因此,基于语言的结果可以单独调整以匹配查询语言。 由于该方法是基于适应性的,所以该框架还提供了一种在收集用户数据时逐步修改模型的自然方法。 文档中不同风格的语言可以单独识别和修改。 还使用背景语言模型,在文档具有不完整的数据领域的情况下提供回退方法,并且可以利用知识领域的局部或语义层次结构。

    BOOTSTRAP AND ADAPT A DOCUMENT SEARCH ENGINE
    2.
    发明申请
    BOOTSTRAP AND ADAPT A DOCUMENT SEARCH ENGINE 有权
    BOOTSTRAP并适应文件搜索引擎

    公开(公告)号:US20110231394A1

    公开(公告)日:2011-09-22

    申请号:US12726358

    申请日:2010-03-18

    IPC分类号: G06F17/30

    摘要: Architecture that employs a modeling technique based on language modeling to estimate a probability of a document matching the user need as expressed in the query. The modeling technique is based on the data mining results that various portions of a document (e.g., body, title, URL, anchor text, user queries) use different styles of human languages. Thus, the results based on a language can be adapted individually to match the language of query. Since the approach is based on adaptation, the framework also provides a natural means to progressively revise the model as user data are collected. Different styles of languages in a document can be recognized and adapted individually. Background language models are also employed that offer a fallback approach in case the document has incomplete fields of data, and can utilize topical or semantic hierarchy of the knowledge domain.

    摘要翻译: 采用基于语言建模的建模技术的架构来估计在查询中表达的与用户匹配的文档的概率。 建模技术基于数据挖掘结果,文档的各个部分(例如,主体,标题,URL,锚文本,用户查询)使用不同风格的人类语言。 因此,基于语言的结果可以单独调整以匹配查询语言。 由于该方法是基于适应性的,所以该框架还提供了一种在收集用户数据时逐步修改模型的自然方法。 文档中不同风格的语言可以单独识别和修改。 还使用背景语言模型,在文档具有不完整的数据领域的情况下提供回退方法,并且可以利用知识领域的局部或语义层次结构。

    ONLINE SPELLING CORRECTION/PHRASE COMPLETION SYSTEM
    4.
    发明申请
    ONLINE SPELLING CORRECTION/PHRASE COMPLETION SYSTEM 审中-公开
    在线传播校正/ PHRASE完成系统

    公开(公告)号:US20120246133A1

    公开(公告)日:2012-09-27

    申请号:US13069526

    申请日:2011-03-23

    IPC分类号: G06F17/30

    CPC分类号: G06F17/273 G06F17/276

    摘要: Online spelling correction/phrase completion is described herein. A computer-executable application receives a phrase prefix from a user, wherein the phrase prefix includes a first character sequence. A transformation probability is retrieved responsive to receipt of the phrase prefix, wherein the transformation probability indicates a probability that a second character sequence has been transformed into a first character sequence. A search is then executed over a trie to locate a most probable phrase completion based at least in part upon the transformation probability.

    摘要翻译: 本文描述了在线拼写校正/短语完成。 计算机可执行应用程序从用户接收短语前缀,其中短语前缀包括第一字符序列。 响应于短语前缀的接收来检索变换概率,其中变换概率表示第二字符序列已被变换为第一字符序列的概率。 然后,通过一个搜索来执行搜索以至少部分地基于转换概率来定位最可能的短语完成。

    Identifying query formulation suggestions for low-match queries
    5.
    发明授权
    Identifying query formulation suggestions for low-match queries 有权
    识别低匹配查询的查询制定建议

    公开(公告)号:US08965872B2

    公开(公告)日:2015-02-24

    申请号:US13172561

    申请日:2011-06-29

    IPC分类号: G06F7/00 G06F17/00 G06F17/30

    摘要: Systems, methods and computer-storage media are provided for identifying low-match search queries and determining comparable item matches to suggest to the user in response to a low-match query. “Low-match queries” are queries for which an insufficient number of exact item matches are available. In embodiments, exact and/or comparable item matches may be determined via semantic analysis. Also provided are systems, methods and computer-storage media for informing the user, by way of a presented indicator, or the like, that a presented item was selected for presentation based upon a similarity metric rather than being determined an exact match for the input query.

    摘要翻译: 系统,方法和计算机存储介质被提供用于识别低匹配搜索查询并且确定可响应于低匹配查询的用户建议的可比较项目匹配。 “低匹配查询”是对可用的确切项目匹配数量不足的查询。 在实施例中,可以通过语义分析来确定精确和/或可比较的项目匹配。 还提供了系统,方法和计算机存储介质,用于通过所呈现的指示符等向用户通知基于相似性度量而选择呈现的呈现项目,而不是确定输入的精确匹配 查询。

    FEDERATED IMPLICIT SEARCH
    6.
    发明申请
    FEDERATED IMPLICIT SEARCH 有权
    联合隐含搜索

    公开(公告)号:US20110295852A1

    公开(公告)日:2011-12-01

    申请号:US12791000

    申请日:2010-06-01

    IPC分类号: G06F17/30 G06F3/01

    CPC分类号: G06F17/30867 G06Q10/00

    摘要: A resource selection system is described for assisting a user in performing a task that includes multiple actions. At each stage of the task, the system presents a set resources from which the user may select to perform a subsequent action in the task. The system implicitly selects the set of resources based on context information that identifies the user's current informational needs. For example, the context information may be derived from textual information that is being presented on a user device, which the user is presumed to be viewing at the current time. In one implementation, the system selects the set of resources by computing language models for respective domains and respective entities. The system uses the language models to determine the relevance of the context information to each of the domains. The system then selects resources associated with domains that have been assessed as relevant.

    摘要翻译: 描述了一种资源选择系统,用于帮助用户执行包括多个动作的任务。 在任务的每个阶段,系统呈现用户可以从中选择执行任务中的后续动作的集合资源。 系统基于识别用户当前信息需求的上下文信息来隐含地选择资源集合。 例如,可以从呈现在用户设备上的文本信息导出上下文信息,用户被认为在当前时间正在观看。 在一个实现中,系统通过计算各个域和相应实体的语言模型来选择资源集合。 系统使用语言模型来确定上下文信息与每个域的相关性。 然后系统选择与被评估为相关的域相关联的资源。

    Federated implicit search
    7.
    发明授权
    Federated implicit search 有权
    联合隐式搜索

    公开(公告)号:US08359311B2

    公开(公告)日:2013-01-22

    申请号:US12791000

    申请日:2010-06-01

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867 G06Q10/00

    摘要: A resource selection system is described for assisting a user in performing a task that includes multiple actions. At each stage of the task, the system presents a set resources from which the user may select to perform a subsequent action in the task. The system implicitly selects the set of resources based on context information that identifies the user's current informational needs. For example, the context information may be derived from textual information that is being presented on a user device, which the user is presumed to be viewing at the current time. In one implementation, the system selects the set of resources by computing language models for respective domains and respective entities. The system uses the language models to determine the relevance of the context information to each of the domains. The system then selects resources associated with domains that have been assessed as relevant.

    摘要翻译: 描述了一种资源选择系统,用于帮助用户执行包括多个动作的任务。 在任务的每个阶段,系统呈现用户可以从中选择执行任务中的后续动作的集合资源。 系统基于识别用户当前信息需求的上下文信息来隐含地选择资源集合。 例如,可以从呈现在用户设备上的文本信息导出上下文信息,用户被认为在当前时间正在观看。 在一个实现中,系统通过计算各个域和相应实体的语言模型来选择资源集合。 系统使用语言模型来确定上下文信息与每个域的相关性。 然后系统选择与被评估为相关的域相关联的资源。

    Retrieval of prefix completions by way of walking nodes of a trie data structure
    8.
    发明授权
    Retrieval of prefix completions by way of walking nodes of a trie data structure 有权
    通过步行数据结构的节点检索前缀完成

    公开(公告)号:US09158758B2

    公开(公告)日:2015-10-13

    申请号:US13345750

    申请日:2012-01-09

    申请人: Bo-June Hsu

    发明人: Bo-June Hsu

    IPC分类号: G06F17/30 G06F17/27

    CPC分类号: G06F17/276 G06F17/30327

    摘要: Technologies pertaining to providing completions to proffered prefixes are disclosed herein. A suggested completion to a proffered prefix is retrieved by walking nodes of a trie data structure, wherein a node includes one or more characters that are used to extend a character sequence represented by its parent. Each node in the trie data structure is assigned a score, wherein the score maps to a best score assigned to its descendants. The nodes of the trie data structure are sorted based upon score, and the nodes are walked based upon scores assigned thereto.

    摘要翻译: 本文公开了提供完成前缀的技术。 通过特里数据结构的步行节点检索对提供的前缀的建议完成,其中节点包括用于扩展由其父代表的字符序列的一个或多个字符。 特里数据结构中的每个节点被分配一个分数,其中分数映射到分配给其后代的最佳分数。 基于分数对特里数据结构的节点进行排序,并且基于分配给它们的分数来行进节点。

    GENERATING AND PRESENTING A SUGGESTED SEARCH QUERY
    9.
    发明申请
    GENERATING AND PRESENTING A SUGGESTED SEARCH QUERY 有权
    生成和呈现建议的搜索查询

    公开(公告)号:US20110320470A1

    公开(公告)日:2011-12-29

    申请号:US12824879

    申请日:2010-06-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 G06F17/3064

    摘要: The present invention is directed to presenting a suggested search query. Responsive to receiving a user-devised search parameter, a suggested search query is identified. The user-devised search parameter might have been previously received by a search system, or alternatively, might be a unique query that has not been previously received. A suggested search query might be generated using various techniques, such as by applying an n-gram language model. A classification of the suggested search query is determined, and the suggested search query is presented together with a visual indicator, which signifies the classification.

    摘要翻译: 本发明旨在呈现建议的搜索查询。 响应于接收用户设计的搜索参数,识别出建议的搜索查询。 用户设计的搜索参数可能先前已经被搜索系统接收,或者可以是先前未被接收到的唯一查询。 可以使用各种技术来生成建议的搜索查询,例如通过应用n-gram语言模型。 确定建议的搜索查询的分类,并将建议的搜索查询与视觉指示符一起呈现,这意味着分类。

    Combined speech and alternate input modality to a mobile device
    10.
    发明授权
    Combined speech and alternate input modality to a mobile device 有权
    组合语音和交替输入模式到移动设备

    公开(公告)号:US07941316B2

    公开(公告)日:2011-05-10

    申请号:US11262230

    申请日:2005-10-28

    IPC分类号: G10L15/26

    CPC分类号: G10L15/22

    摘要: A method of entering information into a mobile device includes receiving a multi-word speech input from a user, performing speech recognition on the speech input to obtain a multi-word speech recognition result, and sequentially displaying, in a display, words in the speech recognition result for user confirmation or correction, by adding one word at a time to the display. A next word is only displayed after user confirmation or correct has been received for a previously displayed word that is immediately preceding the next word in the speech recognition result. The method also includes calculating a hypothesis lattice indicative of a plurality of speech recognition hypotheses based on the speech input and, prior to finishing calculating the hypothesis lattice and while continuing to calculate the hypothesis lattice, calculating a preliminary hypothesis lattice indicative of only partial speech recognition hypotheses based on the speech input and outputting the preliminary hypotheses lattice.

    摘要翻译: 将信息输入到移动设备的方法包括从用户接收多字语音输入,在语音输入上执行语音识别以获得多字语音识别结果,并且在显示器中依次显示语音中的单词 用户确认或校正的识别结果,通过一次添加一个单词到显示。 仅在用户确认之后才显示下一个单词,或者在语音识别结果中紧接在下一个单词之前的先前显示的单词已经接收到正确的单词。 该方法还包括基于语音输入来计算指示多个语音识别假设的假设格点,并且在完成计算假设网格之前并在继续计算假设网格的同时,计算指示仅部分语音识别的初步假设点 基于语音输入的假设,并输出初步假设格。