-
公开(公告)号:US08838449B2
公开(公告)日:2014-09-16
申请号:US12977461
申请日:2010-12-23
申请人: Yun-Cheng Ju , Ivan J. Tashev , Chad R. Heinemann
发明人: Yun-Cheng Ju , Ivan J. Tashev , Chad R. Heinemann
CPC分类号: G10L15/19
摘要: This document describes word-dependent language models, as well as their creation and use. A word-dependent language model can permit a speech-recognition engine to accurately verify that a speech utterance matches a multi-word phrase. This is useful in many contexts, including those where one or more letters of the expected phrase are known to the speaker.
摘要翻译: 本文档描述了依赖于字的语言模型,以及它们的创建和使用。 一个与字相关的语言模型可以允许一个语音识别引擎准确地验证一个语音发音是否匹配一个多单词短语。 这在许多情况下是有用的,包括说话者知道预期短语的一个或多个字母的情况。
-
公开(公告)号:US20120166196A1
公开(公告)日:2012-06-28
申请号:US12977461
申请日:2010-12-23
申请人: Yun-Cheng Ju , Ivan J. Tashev , Chad R. Heinemann
发明人: Yun-Cheng Ju , Ivan J. Tashev , Chad R. Heinemann
IPC分类号: G10L15/04
CPC分类号: G10L15/19
摘要: This document describes word-dependent language models, as well as their creation and use. A word-dependent language model can permit a speech-recognition engine to accurately verify that a speech utterance matches a multi-word phrase. This is useful in many contexts, including those where one or more letters of the expected phrase are known to the speaker.
摘要翻译: 本文档描述了依赖于字的语言模型,以及它们的创建和使用。 一个与字相关的语言模型可以允许一个语音识别引擎准确地验证一个语音发音是否匹配一个多单词短语。 这在许多情况下是有用的,包括说话者知道预期短语的一个或多个字母的情况。
-
公开(公告)号:US20090082037A1
公开(公告)日:2009-03-26
申请号:US11860433
申请日:2007-09-24
申请人: Yun-Cheng Ju , Michael Seltzer , Ivan J. Tashev
发明人: Yun-Cheng Ju , Michael Seltzer , Ivan J. Tashev
IPC分类号: H04Q7/20
CPC分类号: G01C21/3608 , G01C21/3679 , H04L67/18
摘要: Framework for receiving, processing, and re-using personal points of interest (PPOI) information of a user in a location-based application. A telephone dialog system provides location-based information related PPOI of a user. For example, the PPOI information can include major intersections that the user may normally travel, gas stations, clubs, etc., based on real-time data obtained via web services. The PPOI information can be acquired using common names and nicknames, which are added into system lexicon and recognition grammars. Each PPOI is also tagged to the user (or “owner”) who defined it. The PPOI information can also be shared to support a community of users. The framework also resolves conflicting PPOI information between multiple users and multiple locations. PPOI information input by one user can be used to extract demographic information and personal preferences and be re-used by other users by automatically popping up common names and attributes other users entered for the same nickname.
摘要翻译: 用于在基于位置的应用程序中接收,处理和重新使用用户的个人兴趣点(PPOI)信息的框架。 电话对话系统提供用户的基于位置的信息相关的PPOI。 例如,PPOI信息可以包括基于通过web服务获得的实时数据,用户可能正常旅行的主要交叉路口,加油站,俱乐部等。 可以使用通用名称和昵称来获取PPOI信息,这些名称和昵称被添加到系统词典和识别语法中。 每个PPOI也被标记给定义它的用户(或“所有者”)。 PPOI信息也可以共享,以支持用户社区。 该框架还解决了多个用户和多个位置之间的冲突的PPOI信息。 一个用户输入的PPOI信息可以用于提取人口统计信息和个人偏好,并通过自动弹出其他用户为相同昵称输入的公用名称和属性,由其他用户重新使用。
-
公开(公告)号:US07983913B2
公开(公告)日:2011-07-19
申请号:US11888275
申请日:2007-07-31
申请人: Michael L. Seltzer , Yun-Cheng Ju , Ivan J. Tashev
发明人: Michael L. Seltzer , Yun-Cheng Ju , Ivan J. Tashev
IPC分类号: G10L15/04
CPC分类号: G10L15/1815 , G10L15/193
摘要: In one embodiment, the present system recognizes a user's speech input using an automatically generated probabilistic context free grammar for street names that maps all pronunciation variations of a street name to a single canonical representation during recognition. A tokenizer expands the representation using position-dependent phonetic tokens and an intersection classifier classifies an intersection, despite the presence of recognition errors and incomplete street names.
摘要翻译: 在一个实施例中,本系统使用针对街道名称的自动产生的概率上下文无关语法来识别用户的语音输入,其在识别期间将街道名称的所有发音变体映射到单个规范表示。 令牌化器使用位置相关的语音令牌来扩展表示,尽管存在识别错误和不完整的街道名称,交叉分类器对交点进行了分类。
-
公开(公告)号:US20120323967A1
公开(公告)日:2012-12-20
申请号:US13159442
申请日:2011-06-14
申请人: Yun-Cheng Ju , Ivan J. Tashev , Xiao Li , Dax Hawkins , Thomas Soemo , Michael H. Kim
发明人: Yun-Cheng Ju , Ivan J. Tashev , Xiao Li , Dax Hawkins , Thomas Soemo , Michael H. Kim
IPC分类号: G06F17/30
CPC分类号: G06F16/685 , G06F16/93
摘要: A multimedia system configured to receive user input in the form of a spelled character sequence is provided. In one implementation, a spell mode is initiated, and a user spells a character sequence. The multimedia system performs spelling recognition and recognizes a sequence of character representations having a possible ambiguity resulting from any user and/or system errors. The sequence of character representations with the possible ambiguity yields multiple search keys. The multimedia system performs a fuzzy pattern search by scoring each target item from a finite dataset of target items based on the multiple search keys. One or more relevant items are ranked and presented to the user for selection, each relevant item being a target item that exceeds a relevancy threshold. The user selects the indented character sequence from the one or more relevant items.
摘要翻译: 提供了被配置为以拼写字符序列的形式接收用户输入的多媒体系统。 在一个实现中,启动拼写模式,并且用户拼写字符序列。 多媒体系统执行拼写识别并识别由任何用户和/或系统错误导致的可能的模糊性的字符表示序列。 具有可能模糊性的字符表示序列产生多个搜索关键字。 多媒体系统通过基于多个搜索关键词从目标物品的有限数据集中对每个目标物品进行评分来执行模糊模式搜索。 将一个或多个相关项目排序并呈现给用户进行选择,每个相关项目是超过相关阈值的目标项目。 用户从一个或多个相关项目中选择缩进的字符序列。
-
公开(公告)号:US20090037174A1
公开(公告)日:2009-02-05
申请号:US11888275
申请日:2007-07-31
申请人: Michael L. Seltzer , Yun-Cheng Ju , Ivan J. Tashev
发明人: Michael L. Seltzer , Yun-Cheng Ju , Ivan J. Tashev
IPC分类号: G10L15/00
CPC分类号: G10L15/1815 , G10L15/193
摘要: In one embodiment, the present system recognizes a user's speech input using an automatically generated probabilistic context free grammar for street names that maps all pronunciation variations of a street name to a single canonical representation during recognition. A tokenizer expands the representation using position-dependent phonetic tokens and an intersection classifier classifies an intersection, despite the presence of recognition errors and incomplete street names.
摘要翻译: 在一个实施例中,本系统使用针对街道名称的自动产生的概率上下文无关语法来识别用户的语音输入,其在识别期间将街道名称的所有发音变体映射到单个规范表示。 令牌化器使用位置相关的语音令牌来扩展表示,尽管存在识别错误和不完整的街道名称,交叉分类器对交点进行了分类。
-
7.
公开(公告)号:US09082403B2
公开(公告)日:2015-07-14
申请号:US13326659
申请日:2011-12-15
CPC分类号: G10L15/1822
摘要: The subject disclosure is directed towards training a classifier for spoken utterances without relying on human-assistance. The spoken utterances may be related to a voice menu program for which a speech comprehension component interprets the spoken utterances into voice menu options. The speech comprehension component provides confirmations to some of the spoken utterances in order to accurately assign a semantic label. For each spoken utterance with a denied confirmation, the speech comprehension component automatically generates a pseudo-semantic label that is consistent with the denied confirmation and selected from a set of potential semantic labels and updates a classification model associated with the classifier using the pseudo-semantic label.
摘要翻译: 主题披露旨在培训用于讲话的分类器,而不依赖人力援助。 讲话话语可能与语音菜单程序相关,语音理解组件将语音话语解释成语音菜单选项。 语音理解组件为一些语音语音提供了确认,以便准确地分配语义标签。 对于每个具有拒绝确认的口语说话,语音理解组件自动生成与拒绝确认一致的伪语义标签,并从一组潜在语义标签中选择,并使用伪语义更新与分类器相关联的分类模型 标签。
-
公开(公告)号:US20120185252A1
公开(公告)日:2012-07-19
申请号:US13428917
申请日:2012-03-23
申请人: Ye-Yi Wang , Yun-Cheng Ju , Dong Yu
发明人: Ye-Yi Wang , Yun-Cheng Ju , Dong Yu
IPC分类号: G10L15/04
CPC分类号: G10L15/1822
摘要: A method of generating a confidence measure generator is provided for use in a voice search system, the voice search system including voice search components comprising a speech recognition system, a dialog manager and a search system. The method includes selecting voice search features, from a plurality of the voice search components, to be considered by the confidence measure generator in generating a voice search confidence measure. The method includes training a model, using a computer processor, to generate the voice search confidence measure based on selected voice search features.
摘要翻译: 提供了一种产生置信度量产生器的方法,用于语音搜索系统中,该语音搜索系统包括包括语音识别系统,对话管理器和搜索系统的语音搜索组件。 该方法包括从多个语音搜索组件中选择语音搜索特征,以由置信度量产生器在生成语音搜索置信度量时考虑。 该方法包括使用计算机处理器来训练模型,以基于所选择的语音搜索特征生成语音搜索置信度度量。
-
公开(公告)号:US20090287626A1
公开(公告)日:2009-11-19
申请号:US12200648
申请日:2008-08-28
CPC分类号: G06F16/3322 , G10L15/26
摘要: A multi-modal search system (and corresponding methodology) is provided. The system employs text, speech, touch and gesture input to establish a search query. Additionally, a subset of the modalities can be used to obtain search results based upon exact or approximate matches to a search result. For example, wildcards, which can either be triggered by the user or inferred by the system, can be employed in the search.
摘要翻译: 提供了一种多模式搜索系统(及相应的方法)。 系统采用文字,语音,触摸和手势输入建立搜索查询。 此外,模态的子集可以用于基于与搜索结果的精确或近似匹配来获得搜索结果。 例如,可以由用户触发或由系统推断的通配符可用于搜索。
-
公开(公告)号:US20080172376A1
公开(公告)日:2008-07-17
申请号:US11652733
申请日:2007-01-12
申请人: Dong Yu , Alejandro Acero , Yun-Cheng Ju , Ye-Yi Wang
发明人: Dong Yu , Alejandro Acero , Yun-Cheng Ju , Ye-Yi Wang
IPC分类号: G06F17/30
CPC分类号: G06F17/30663 , Y10S707/99933 , Y10S707/99935 , Y10S707/99942
摘要: A computer-implemented method is disclosed for providing a directory assistance service. The method includes generating an indexing file that is a representation of information associated with a collection of listings stored in an index. The indexing file is utilized as a basis for ranking listings in an index based on the strength of association with a query. Based at least in part on the ranking, an output is provided and is indicative of listings in the index that are likely correspond to the query. At least one particular listing in the index is excluded from the output without there ever being a comparison of features in the query with features in the one particular listing.
摘要翻译: 公开了一种用于提供目录辅助服务的计算机实现的方法。 该方法包括生成索引文件,其是与存储在索引中的列表的集合相关联的信息的表示。 基于与查询的关联强度,索引文件被用作在索引中对列表进行排名的基础。 至少部分地基于排名,提供输出并且指示索引中可能对应于查询的列表。 索引中的至少一个特定列表从输出中排除,而不会将查询中的功能与特定列表中的功能进行比较。
-
-
-
-
-
-
-
-
-