Speech recognition system with display information
    71.
    发明授权
    Speech recognition system with display information 有权
    具有显示信息的语音识别系统

    公开(公告)号:US08364487B2

    公开(公告)日:2013-01-29

    申请号:US12255270

    申请日:2008-10-21

    IPC分类号: G10L15/18

    CPC分类号: G10L15/22 G10L15/19

    摘要: A language processing system may determine a display form of a spoken word by analyzing the spoken form using a language model that includes dictionary entries for display forms of homonyms. The homonyms may include trade names as well as given names and other phrases. The language processing system may receive spoken language and produce a display form of the language while displaying the proper form of the homonym. Such a system may be used in search systems where audio input is converted to a graphical display of a portion of the spoken input.

    摘要翻译: 语言处理系统可以通过使用包括用于同音异构的显示形式的字典条目的语言模型来分析口头表单来确定口语单词的显示形式。 同音异义可能包括商品名称以及给定的名称和其他短语。 语言处理系统可以接收口语,并产生语言的显示形式,同时显示适当形式的同音异义。 这样的系统可以用在搜索系统中,其中音频输入被转换为口语输入的一部分的图形显示。

    Spelling Using a Fuzzy Pattern Search
    72.
    发明申请
    Spelling Using a Fuzzy Pattern Search 审中-公开
    拼写使用模糊模式搜索

    公开(公告)号:US20120323967A1

    公开(公告)日:2012-12-20

    申请号:US13159442

    申请日:2011-06-14

    IPC分类号: G06F17/30

    CPC分类号: G06F16/685 G06F16/93

    摘要: A multimedia system configured to receive user input in the form of a spelled character sequence is provided. In one implementation, a spell mode is initiated, and a user spells a character sequence. The multimedia system performs spelling recognition and recognizes a sequence of character representations having a possible ambiguity resulting from any user and/or system errors. The sequence of character representations with the possible ambiguity yields multiple search keys. The multimedia system performs a fuzzy pattern search by scoring each target item from a finite dataset of target items based on the multiple search keys. One or more relevant items are ranked and presented to the user for selection, each relevant item being a target item that exceeds a relevancy threshold. The user selects the indented character sequence from the one or more relevant items.

    摘要翻译: 提供了被配置为以拼写字符序列的形式接收用户输入的多媒体系统。 在一个实现中,启动拼写模式,并且用户拼写字符序列。 多媒体系统执行拼写识别并识别由任何用户和/或系统错误导致的可能的模糊性的字符表示序列。 具有可能模糊性的字符表示序列产生多个搜索关键字。 多媒体系统通过基于多个搜索关键词从目标物品的有限数据集中对每个目标物品进行评分来执行模糊模式搜索。 将一个或多个相关项目排序并呈现给用户进行选择,每个相关项目是超过相关阈值的目标项目。 用户从一个或多个相关项目中选择缩进的字符序列。

    Word-Dependent Language Model
    73.
    发明申请
    Word-Dependent Language Model 有权
    词语相关语言模型

    公开(公告)号:US20120166196A1

    公开(公告)日:2012-06-28

    申请号:US12977461

    申请日:2010-12-23

    IPC分类号: G10L15/04

    CPC分类号: G10L15/19

    摘要: This document describes word-dependent language models, as well as their creation and use. A word-dependent language model can permit a speech-recognition engine to accurately verify that a speech utterance matches a multi-word phrase. This is useful in many contexts, including those where one or more letters of the expected phrase are known to the speaker.

    摘要翻译: 本文档描述了依赖于字的语言模型,以及它们的创建和使用。 一个与字相关的语言模型可以允许一个语音识别引擎准确地验证一个语音发音是否匹配一个多单词短语。 这在许多情况下是有用的,包括说话者知道预期短语的一个或多个字母的情况。

    Detecting an answering machine using speech recognition
    74.
    发明授权
    Detecting an answering machine using speech recognition 失效
    使用语音识别检测应答机

    公开(公告)号:US08065146B2

    公开(公告)日:2011-11-22

    申请号:US11485011

    申请日:2006-07-12

    IPC分类号: G10L17/00

    摘要: An answering machine detection module is used to determine whether a call recipient is an actual person or an answering machine. The answering machine detection module includes a speech recognizer and a call analysis module. The speech recognizer receives an audible response of the call recipient to a call. The speech recognizer processes the audible response and provides an output indicative of recognized speech. The call analysis module processes the output of the speech recognizer to generate an output indicative of whether the call recipient is a person or an answering machine.

    摘要翻译: 应答机检测模块用于确定呼叫接收者是实际的人还是应答机。 应答机检测模块包括语音识别器和呼叫分析模块。 语音识别器接收呼叫接收者对呼叫的可听见的响应。 语音识别器处理可听见的响应并提供表示识别的语音的输出。 呼叫分析模块处理语音识别器的输出以产生指示呼叫接收者是个人还是应答机的输出。

    Conveying locations in spoken dialog systems
    75.
    发明授权
    Conveying locations in spoken dialog systems 有权
    在口语对话系统中传送位置

    公开(公告)号:US08065078B2

    公开(公告)日:2011-11-22

    申请号:US11836955

    申请日:2007-08-10

    IPC分类号: G01C21/00 G08G1/123

    CPC分类号: G01C21/3644 G01C21/3679

    摘要: The presentation of location information to a user that is distracted by traveling can result in the user quickly forgetting, or never even comprehending, key parts of the location information, such as the street number. Identification can be made of intersections and points of interest near the user's destination, which can then be provided instead of, or in addition to, the address, thereby increasing user comprehension and retention, especially when distracted. Map data can be parsed into addresses, intersections and points of interest databases. These databases can be accessed to identify proximate intersections and points of interest, which can then be filtered and subsequently ranked to identify one intersection, one point of interest, or both, that can be presented to the user to aid the user in comprehending and retaining the location information even when distracted.

    摘要翻译: 通过旅行分散给用户的位置信息的呈现可能导致用户快速地忘记甚至不理解诸如街道号码的位置信息的关键部分。 识别可以由用户目的地附近的交叉点和兴趣点组成,然后可以提供地址,也可以除了地址之外提供,从而增加用户理解和保留,特别是在分心时。 地图数据可以解析为地址,交叉点和兴趣点数据库。 可以访问这些数据库以识别最近的交叉点和兴趣点,然后可以对这些数据进行过滤并随后进行排序以识别一个交点,一个兴趣点或二者,可以呈现给用户以帮助用户理解和保留 位置信息即使分心。

    Adapting a language model to accommodate inputs not found in a directory assistance listing
    76.
    发明授权
    Adapting a language model to accommodate inputs not found in a directory assistance listing 有权
    适应语言模型以适应在目录帮助列表中找不到的输入

    公开(公告)号:US07912707B2

    公开(公告)日:2011-03-22

    申请号:US11642003

    申请日:2006-12-19

    IPC分类号: G06F17/21

    CPC分类号: G10L15/063 G10L15/197

    摘要: A statistical language model is trained for use in a directory assistance system using the data in a directory assistance listing corpus. Calculations are made to determine how important words in the corpus are in distinguishing a listing from other listings, and how likely words are to be omitted or added by a user. The language model is trained using these calculations.

    摘要翻译: 训练统计语言模型,以使用目录援助列表语料库中的数据在目录辅助系统中使用。 进行计算,以确定语料库中的单词在区分列表和其他列表中的重要程度以及用户可能忽略或添加单词的可能性。 使用这些计算训练语言模型。

    Shareable filler model for grammar authoring
    77.
    发明授权
    Shareable filler model for grammar authoring 有权
    用于语法创作的可共享填充模型

    公开(公告)号:US07865357B2

    公开(公告)日:2011-01-04

    申请号:US11375488

    申请日:2006-03-14

    CPC分类号: G10L15/197 G10L15/193

    摘要: A method of forming a shareable filler model (shareable model for garbage words) from a word n-gram model is provided. The word n-gram model is converted into a probabilistic context free grammar (PCFG). The PCFG is modified into a substantially application-independent PCFG, which constitutes the shareable filler model.

    摘要翻译: 提供了从单词n-gram模型形成可共享填充模型(垃圾字的可共享模型)的方法。 单词n-gram模型被转换成概率上下文无关语法(PCFG)。 PCFG被修改为基本上与应用无关的PCFG,其构成可共享填充模型。

    REPLYING TO TEXT MESSAGES VIA AUTOMATED VOICE SEARCH TECHNIQUES
    78.
    发明申请
    REPLYING TO TEXT MESSAGES VIA AUTOMATED VOICE SEARCH TECHNIQUES 有权
    通过自动语音搜索技术回复文字信息

    公开(公告)号:US20100145694A1

    公开(公告)日:2010-06-10

    申请号:US12329406

    申请日:2008-12-05

    IPC分类号: G10L15/26

    摘要: An automated “Voice Search Message Service” provides a voice-based user interface for generating text messages from an arbitrary speech input. Specifically, the Voice Search Message Service provides a voice-search information retrieval process that evaluates user speech inputs to select one or more probabilistic matches from a database of pre-defined or user-defined text messages. These probabilistic matches are also optionally sorted in terms of relevancy. A single text message from the probabilistic matches is then selected and automatically transmitted to one or more intended recipients. Optionally, one or more of the probabilistic matches are presented to the user for confirmation or selection prior to transmission. Correction or recovery of speech recognition errors avoided since the probabilistic matches are intended to paraphrase the user speech input rather than exactly reproduce that speech, though exact matches are possible. Consequently, potential distractions to the user are significantly reduced relative to conventional speech recognition techniques.

    摘要翻译: 自动“语音搜索消息服务”提供基于语音的用户界面,用于从任意语音输入生成文本消息。 具体地,语音搜索消息服务提供语音搜索信息检索过程,其评估用户语音输入以从预定义或用户定义的文本消息的数据库中选择一个或多个概率匹配。 这些概率匹配也可以根据相关性进行排序。 然后选择来自概率匹配的单个文本消息并将其自动发送到一个或多个预期接收者。 可选地,一个或多个概率匹配在发送之前被呈现给用户进行确认或选择。 纠正或恢复语音识别错误避免了,因为概率匹配旨在释义用户语音输入,而不是精确地再现该语音,尽管精确匹配是可能的。 因此,相对于传统的语音识别技术,对用户的潜在干扰显着降低。

    INTRA-LANGUAGE STATISTICAL MACHINE TRANSLATION
    79.
    发明申请
    INTRA-LANGUAGE STATISTICAL MACHINE TRANSLATION 有权
    语言统计机翻译

    公开(公告)号:US20090248422A1

    公开(公告)日:2009-10-01

    申请号:US12058328

    申请日:2008-03-28

    IPC分类号: G10L11/00 G06F17/28

    CPC分类号: G06F17/2818 G06F17/2827

    摘要: Training data may be provided, the training data including pairs of source phrases and target phrases. The pairs may be used to train an intra-language statistical machine translation model, where the intra-language statistical machine translation model, when given an input phrase of text in the human language, can compute probabilities of semantic equivalence of the input phrase to possible translations of the input phrase in the human language. The statistical machine translation model may be used to translate between queries and listings. The queries may be text strings in the human language submitted to a search engine. The listing strings may be text strings of formal names of real world entities that are to be searched by the search engine to find matches for the query strings.

    摘要翻译: 可以提供训练数据,训练数据包括源短语和目标短语对。 这些对可以用于训练语言间统计机器翻译模型,其中语言内统计机器翻译模型在给予人类语言的文本的输入短语时可以计算输入短语的语义等同性的可能性 输入短语在人类语言中的翻译。 统计机器翻译模型可用于在查询和列表之间进行翻译。 查询可以是提交给搜索引擎的人类语言中的文本字符串。 列表字符串可以是要由搜索引擎搜索以查找查询字符串的匹配的真实世界实体的正式名称的文本串。

    ASSOCIATIVE INTERFACE FOR PERSONALIZING VOICE DATA ACCESS
    80.
    发明申请
    ASSOCIATIVE INTERFACE FOR PERSONALIZING VOICE DATA ACCESS 审中-公开
    用于个性化语音数据访问的相关接口

    公开(公告)号:US20090100340A1

    公开(公告)日:2009-04-16

    申请号:US11870039

    申请日:2007-10-10

    IPC分类号: G06F3/16 G06F21/00 G10L13/08

    摘要: The claimed subject matter according to one aspect provides systems and/or methods that effectuate user development, customization, or utilization of dynamically configurable dialogue flow systems. The system can include devices and components that employ data associated with a user to retrieve navigation panes unique with respect to the user, scans the navigation panes and identifies adjustable attributes, utilizes the adjustable attributes to generate voice prompts communicated to the user via handheld devices, the user in reply to the voice prompts utters personalized responses associated with the voice prompts, and based at least on the personalized responses initiates actions associated with the adjustable attributes.

    摘要翻译: 根据一个方面的要求保护的主题提供了实现用户开发,定制或利用可动态配置的对话流系统的系统和/或方法。 系统可以包括使用与用户相关联的数据的设备和组件来检索关于用户唯一的导航窗格,扫描导航窗格并识别可调整的属性,利用可调整属性来产生通过手持设备传送给用户的语音提示, 回复语音提示的用户发出与语音提示相关联的个性化响应,并且至少基于个性化响应启动与可调整属性相关联的动作。