专利检索 ipc:"G10L15/065" 第 6 页

51.

发明授权
METHOD AND SYSTEM FOR DETERMINING USER INTENT IN A SPOKEN DIALOG BASED ON TRANSFORMING AT LEAST ONE PORTION OF A SEMANTIC KNOWLEDGE GRAPH TO A PROBABILISTIC STATE GRAPH 有权

公开(公告)号：EP3230979B1

公开(公告)日：2018-09-26

申请号：EP15823223.1

申请日：2015-12-08

申请人： Microsoft Technology Licensing, LLC

发明人： CROOK, Paul , SARIKAYA, Ruhi

IPC分类号： G10L15/22 , G10L15/18 , G10L15/065 , G06F17/27 , G10L15/08 , G10L15/06

CPC分类号： G10L15/22 , G06F17/2785 , G10L15/02 , G10L15/065 , G10L15/08 , G10L15/10 , G10L15/12 , G10L15/1815 , G10L15/183 , G10L15/265 , G10L15/30 , G10L2015/0635 , G10L2015/0638 , G10L2015/088 , G10L2015/223 , G10L2015/225

摘要： Systems and methods for responding to spoken language input or multi-modal input are described herein. More specifically, one or more user intents are determined or inferred from the spoken language input or multi-modal input to determine one or more user goals via a dialogue belief tracking system. The systems and methods disclosed herein utilize the dialogue belief tracking system to perform actions based on the determined one or more user goals and allow a device to engage in human like conversation with a user over multiple turns of a conversation. Preventing the user from having to explicitly state each intent and desired goal while still receiving the desired goal from the device, improves a user's ability to accomplish tasks, perform commands, and get desired products and/or services. Additionally, the improved response to spoken language inputs from a user improves user interactions with the device.

52.

发明公开
MOTION ADAPTIVE SPEECH RECOGNITION FOR ENHANCED VOICE DESTINATION ENTRY 审中-公开
标题翻译：运动自适应语音识别增强语音目的地入场

公开(公告)号：EP3308379A1

公开(公告)日：2018-04-18

申请号：EP15730373.6

申请日：2015-06-10

申请人： Nuance Communications, Inc.

发明人： GEORGES, Munir, Nikolai Alexander , ANASTASIADIS, Josef, Damianus , BENDER, Oliver

IPC分类号： G10L15/065 , G10L15/30 , G10L15/32 , G10L15/22

CPC分类号： G10L15/22 , G01C21/3608 , G10L15/065 , G10L15/30 , G10L15/32 , G10L2015/223 , G10L2015/225 , G10L2015/227 , G10L2015/228

摘要： A method or associated system for motion adaptive speech processing includes dynamically estimating a motion profile that is representative of a user's motion based on data from one or more resources, such as sensors and non-speech resources, associated with the user. The method includes effecting processing of a speech signal received from the user, for example, while the user is in motion, the processing taking into account the estimated motion profile to produce an interpretation of the speech signal. Dynamically estimating the motion profile can include computing a motion weight vector using the data from the one or more resources associated with the user, and can further include interpolating a plurality of models using the motion weight vector to generate a motion adaptive model. The motion adaptive model can be used to enhance voice destination entry for the user and re-used for other users who do not provide motion profiles.

53.

发明公开
CONTEXT-SENSITIVE DYNAMIC UPDATE OF VOICE TO TEXT MODEL IN A VOICE-ENABLED ELECTRONIC DEVICE 审中-公开

公开(公告)号：EP3304545A1

公开(公告)日：2018-04-11

申请号：EP16727080

申请日：2016-05-20

申请人： GOOGLE INC

发明人： GAO YULI , SUNG SANGSOO , MURUGESAN PRATHAB

IPC分类号： G10L15/22 , G10L15/065 , G10L15/08 , G10L15/18

CPC分类号： G10L15/26 , G10L15/065 , G10L15/083 , G10L15/1822 , G10L15/22 , G10L2015/223 , G10L2015/228

摘要： A voice to text model used by a voice-enabled electronic device is dynamically and in a context-sensitive manner updated to facilitate recognition of entities that potentially may be spoken by a user in a voice input directed to the voice-enabled electronic device. The dynamic update to the voice to text model may be performed, for example, based upon processing of a first portion of a voice input, e.g., based upon detection of a particular type of voice action, and may be targeted to facilitate the recognition of entities that may occur in a later portion of the same voice input, e.g., entities that are particularly relevant to one or more parameters associated with a detected type of voice action.

54.

发明公开
CONCATENATED EXPECTED RESPONSES FOR SPEECH RECOGNITION 审中-公开
标题翻译： VERKETTETE ERWARTETE ANTWORTENFÜRSPRACHERKENNUNG

公开(公告)号：EP3023980A1

公开(公告)日：2016-05-25

申请号：EP15192854.6

申请日：2015-11-03

申请人： Hand Held Products, Inc.

发明人： BRAHO, Keith , MAKAY, Jason M.

IPC分类号： G10L15/065 , G10L15/22

CPC分类号： G10L15/22 , G06F17/2725 , G10L13/04 , G10L15/01 , G10L15/065 , G10L15/08 , G10L2015/088

摘要： A speech recognition system used for hands-free data entry receives and analyzes speech input to recognize and accept a user's response. Under certain conditions, a user's response might be expected. In these situations, the expected response may modify the behavior of the speech recognition system to improve performance. For example, if the hypothesis of a user's response matches the expected response then there is a high probability that the user's response was recognized correctly. This information may be used to make adjustments. An expected response may include expected response parts, each part containing expected words. By considering an expected response as the concatenation of expected response parts, each part may be considered independently for the purposes of adjusting an acceptance algorithm, adjusting a model, or recording an apparent error. In this way, the speech recognition system may make modifications based on a wide range of user responses.

摘要翻译： 用于免提数据输入的语音识别系统接收和分析语音输入以识别和接受用户的响应。在某些情况下，可能会期待用户的回应。在这些情况下，预期的响应可能会修改语音识别系统的行为以提高性能。例如，如果用户的响应的假设与期望的响应相匹配，那么用户的响应被正确识别的概率很高。此信息可用于进行调整。期望的响应可以包括预期的响应部分，每个部分包含预期的单词。通过考虑作为预期响应部分的级联的预期响应，为了调整接受算法，调整模型或记录明显误差，可以独立地考虑每个部分。以这种方式，语音识别系统可以基于广泛的用户响应进行修改。

55.

发明公开
METHOD AND SYSTEM FOR RECOGNIZING SPEECH USING WILDCARDS IN AN EXPECTED RESPONSE 审中-公开
标题翻译： VERFAHREN UND SYSTEM ZUR SPRACHERKENNUNG MIT PLATZHALTERN IN EINER ERWARTETEN ANTWORT

公开(公告)号：EP3023979A1

公开(公告)日：2016-05-25

申请号：EP15191528.7

申请日：2015-10-26

申请人： Hand Held Products, Inc.

发明人： BRAHO, Keith , MAKAY, Jason M.

IPC分类号： G10L15/065 , G10L15/22

CPC分类号： G10L15/07 , G10L15/065 , G10L15/22 , G10L25/51 , G10L2015/088

摘要： A speech recognition system used in a workflow receives and analyzes speech input to recognize and accept a user's response to a task. Under certain conditions, a user's response might be expected. In these situations, the expected response may modify the behavior of the speech recognition system to improve recognition accuracy. For example, if the hypothesis of a user's response matches the expected response then there is a high probability that the user's response was recognized correctly. An expected response may include expected words and wildcard words. Wildcard words represent any recognized word in a user's response. By including wildcard words in the expected response, the speech recognition system may make modifications based on a wide range of user responses.

摘要翻译： 工作流中使用的语音识别系统接收和分析语音输入以识别和接受用户对任务的响应。在某些情况下，可能会期待用户的回应。在这些情况下，预期的响应可以修改语音识别系统的行为以提高识别精度。例如，如果用户的响应的假设与期望的响应相匹配，那么用户的响应被正确识别的概率很高。预期的响应可以包括预期的单词和通配符词。通配符字表示用户响应中的任何识别的字。通过在期望的响应中包含通配符字，语音识别系统可以基于广泛的用户响应进行修改。

56.

发明公开
METHOD OF PROVIDING VOICE COMMAND AND ELECTRONIC DEVICE SUPPORTING THE SAME 审中-公开
标题翻译：方法提供语音命令和电子设备的支持THEREOF

公开(公告)号：EP2963642A1

公开(公告)日：2016-01-06

申请号：EP15174352.3

申请日：2015-06-29

申请人： Samsung Electronics Co., Ltd

发明人： SUBHOJIT, Chakladar , LEE, Sang Hoon , LEE, Ji Min

IPC分类号： G10L15/065 , G10L15/22 , G10L15/30 , G10L15/183

CPC分类号： G10L15/22 , G10L15/06 , G10L15/065 , G10L15/183 , G10L15/30 , G10L2015/223

摘要： An electronic device, a method, and a chip set are provided. The electronic device includes a memory configured to store at least one of audio feature data of audio data and speech recognition data obtained by speech recognition of audio data; and a control module connected to the memory, wherein the control module is configured to update a voice command that is set to execute a function through voice, the function being selected based on at least one of the audio feature data, the speech recognition data, and function execution data executed in relation to the audio data.

摘要翻译： 本发明提供一种电子设备，方法和芯片组。该电子设备包括被配置为存储由音频数据的语音识别获得的音频数据和语音识别数据的音频特征数据中的至少一个存储器; 和连接到所述存储器，worin所述控制模块被配置为更新的语音命令并设定通过语音来执行的功能的控制模块，基于所述音频特征数据，语音识别数据中的至少一个选择的功能，和相对于所述音频数据来执行功能的执行数据。

57.

发明公开
METHODS AND SYSTEMS FOR PROVIDING SPEECH RECOGNITION SYSTEMS BASED ON SPEECH RECORDINGS LOGS 审中-公开
标题翻译：方法和系统提供语音识别系统基于语音记录日志

公开(公告)号：EP2941768A1

公开(公告)日：2015-11-11

申请号：EP13826817.2

申请日：2013-12-20

申请人： Google Inc.

发明人： MENGIBAR, Pedro J., Moreno , WEINSTEIN, Eugene

IPC分类号： G10L15/065

CPC分类号： G10L15/065 , G10L15/32 , G10L2015/0636

摘要： Examples of methods and systems for providing speech recognition systems based on speech recordings logs are described. In some examples, a method may be performed by a computing device within a system to generate modified data logs to use as a training data set for an acoustic model for a particular language. A device may receive one or more data logs that comprise at least one or more recordings of spoken queries and transcribe the recordings. Based on comparisons, the device may identify any transcriptions that may be indicative of noise and may remove those transcriptions indicative of noise from the data logs. Further, the device may remove unwanted transcriptions from the data logs and the device may provide the modified data logs as a training data set to one or more acoustic models for particular languages.

摘要翻译： 的方法和系统，用于提供基于语音的录音记录的语音识别系统的实例进行描述。在一些实例中，一种方法可以由计算设备的系统内执行，以生成修改后的数据记录到在声学模型作为锻炼数据集使用用于特定语言。设备可以接收一个或多个数据记录中做了口语查询至少包含一种或多种录音和转录录音。基于比较，设备可以识别任何转录确实可以指示噪声的并且可以去除这些转录指示从所述数据记录的噪声。此外，设备可以从数据日志删除不想要的转录和该装置可以作为一个锻炼数据设置为特定语言的一个或多个声学模型提供修改后的数据的日志。

58.

发明授权
GENERATING ACOUSTIC MODELS 有权
标题翻译： CREATION声学模型的

公开(公告)号：EP2638542B1

公开(公告)日：2014-08-06

申请号：EP11791107.3

申请日：2011-11-08

申请人： Google Inc.

发明人： WEINSTEIN, Eugene , MORENO MENGIBAR, Pedro J.

IPC分类号： G10L15/06 , G10L15/00 , G10L15/065

CPC分类号： G10L15/063 , G10L15/005 , G10L15/06 , G10L15/065

59.

发明公开
IDENTIFYING AND CORRECTING AUTOMATIC SPEECH RECOGNITION (ASR) MISRECOGNITIONS IN A DECENTRALIZED MANNER 审中-实审

公开(公告)号：EP4374366A1

公开(公告)日：2024-05-29

申请号：EP23750810.6

申请日：2023-07-06

申请人： GOOGLE LLC

发明人： MATHEWS, Rajiv , PRABHAVALKAR, Rohit , MOTTA, Giovanni , CHEN, Mingqing , ZHOU, Lillian , GULIANI, Dhruv , ZHANG, Harry , STROHMAN, Trevor , BEAUFAYS, Françoise

IPC分类号： G10L15/32 , G10L15/065 , G10L15/30

CPC分类号： G10L15/065 , G10L15/30

60.

发明公开
DETERMINING DIALOG STATES FOR LANGUAGE MODELS 审中-公开

公开(公告)号：EP4235647A3

公开(公告)日：2023-10-18

申请号：EP23179644.2

申请日：2016-11-30

申请人： Google LLC

发明人： ALEKSIC, Petar , MENGIBAR, Pedro J. Moreno

IPC分类号： G10L15/22 , G10L15/065 , G10L15/197 , G10L15/183

摘要： A computer-implemented method, comprising: receiving, by a computing device, audio data for a voice input to the computing device, wherein the voice input corresponds to an unknown stage of a multi-stage voice dialog between the computing device and a user of the computing device; determining an estimate for the unknown stage of the multi-stage voice dialog; providing, to a voice dialog system, (i) the audio data for the voice input to the computing device and (ii) an indication of the estimate for the unknown stage of the multi-stage voice dialog; obtaining, by the computing device and from the voice dialog system, a transcription of the voice input, wherein the transcription was generated by processing the audio data with a model that was biased according to parameters that correspond to a particular prediction for the unknown stage of the multi-stage voice dialog, wherein the voice dialog system is configured to determine the particular prediction for the unknown stage of the multi-stage voice dialog based on (i) the estimate for the unknown stage of the multi-stage voice dialog and (ii) additional information that indicates a context of the voice input; and presenting the transcription of the voice input with the computing device.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类