专利检索 ap:("Marcus Alexander Foster" OR "Richard Zarek Cohen") AND inv:"Richard Zarek Cohen" 第 1 页

1.

发明授权
Automatically training speech synthesizers 有权
标题翻译：自动训练语音合成器

公开(公告)号：US08423366B1

公开(公告)日：2013-04-16

申请号：US13552484

申请日：2012-07-18

申请人： Marcus Alexander Foster , Richard Zarek Cohen

发明人： Marcus Alexander Foster , Richard Zarek Cohen

IPC分类号： G10L13/00

CPC分类号： G10L13/06 , G10L15/26 , G10L15/30

摘要： A method includes receiving, by a system, a voice recording associated with a user, transcribing, the voice recording into text that includes a group of words, and storing an association between a portion of each respective word and a corresponding portion of the voice recording. The corresponding portion of the voice recording is the portion of the voice recording from which the portion of the respective word was transcribed. The method may also include determining a modification to a speech synthesis voice associated with the user based at least in part on the association.

摘要翻译： 一种方法包括：通过系统接收与用户相关联的语音记录，将所述语音记录转录成包括一组单词的文本，以及存储每个相应单词的一部分与所述语音记录的相应部分之间的关联。语音记录的相应部分是语音记录的一部分，从该录音的各个单词的部分被转录。该方法还可以包括至少部分地基于关联来确定与用户相关联的语音合成语音的修改。

2.

发明授权
Directing dictation into input fields 有权
标题翻译：指导输入字段

公开(公告)号：US08255218B1

公开(公告)日：2012-08-28

申请号：US13245698

申请日：2011-09-26

申请人： Richard Zarek Cohen , Luca Zanolin , Marcus Alexander Foster

发明人： Richard Zarek Cohen , Luca Zanolin , Marcus Alexander Foster

IPC分类号： G10L11/00 , G10L15/00 , G10L17/00

CPC分类号： G10L15/22 , G06F3/167

摘要： In general, this disclosure describes techniques to direct textual characters converted from vocal input into selected graphical user interface input fields. Vocal input may be received. Textual characters may be identified based on the vocal input. A first portion of the textual characters corresponding to a first portion of the vocal input may be graphically inputted into a first input field of a GUI. While receiving the vocal input, a selection by of a second input field in the GUI may be accepted after the first portion of the vocal input has been received. After accepting the selection of the second input field, a second portion of the textual characters corresponding to a second portion of the vocal input received after the selection of the second input field may be inputted into the second input field.

摘要翻译： 通常，本公开描述了将从声乐输入转换的文本字符引导到选定的图形用户界面输入字段中的技术。可以接收声音输入。可以基于声音输入来识别文本字符。对应于声音输入的第一部分的文本字符的第一部分可以被图形地输入到GUI的第一输入字段中。在接收到声音输入时，在接收到声音输入的第一部分之后，可以接受GUI中的第二输入字段的选择。在接受第二输入字段的选择之后，可以将与选择第二输入字段之后接收的声音输入的第二部分对应的文本字符的第二部分输入到第二输入字段。

3.

发明授权
Multi hotword robust continuous voice command detection in mobile devices 有权
标题翻译：在移动设备中的多个强大的连续语音命令检测

公开(公告)号：US08924219B1

公开(公告)日：2014-12-30

申请号：US13586975

申请日：2012-08-16

申请人： Bjorn Erik Bringert , Hugo Barra , Richard Zarek Cohen

发明人： Bjorn Erik Bringert , Hugo Barra , Richard Zarek Cohen

IPC分类号： G10L21/00

CPC分类号： G10L15/183 , G06F3/04842 , G10L15/22 , G10L15/32 , G10L2015/223

摘要： In a first speech detection mode, a computing device listens for speech that corresponds to one of a plurality of activation phrases or “hotwords” that cause the computing device to recognize further speech input in a second speech detection mode. Each activation phrase is associated with a respective application. During the first speech detection mode, the computing device compares detected speech to the activation phrases to identify any potential matches. In response to identifying a matching activation phrase with a sufficiently high confidence, the computing device invokes the application associated with the matching activation phrase and enters the second speech detection mode. In the second speech detection mode, the computing device listens for speech input related to the invoked application.

摘要翻译： 在第一语音检测模式中，计算设备侦听与多个激活短语或“热词”中的一个相对应的语音，该语音使得计算设备在第二语音检测模式中识别进一步的语音输入。每个激活短语与相应的应用相关联。在第一语音检测模式期间，计算设备将检测到的语音与激活短语进行比较，以识别任何潜在的匹配。响应于识别具有足够高置信度的匹配激活短语，计算设备调用与匹配激活短语相关联的应用并进入第二语音检测模式。在第二语音检测模式中，计算装置监听与被调用应用相关的语音输入。

4.

发明授权
Layered mobile application user interfaces 有权
标题翻译：分层移动应用用户界面

公开(公告)号：US09135914B1

公开(公告)日：2015-09-15

申请号：US13621094

申请日：2012-09-15

申请人： Bjorn Erik Bringert , Pawel Pietryka , Peter John Hodgson , Simon Tickner , Henrique Penha , Richard Zarek Cohen , Luca Zanolin

发明人： Bjorn Erik Bringert , Pawel Pietryka , Peter John Hodgson , Simon Tickner , Henrique Penha , Richard Zarek Cohen , Luca Zanolin

IPC分类号： G10L21/00 , G10L25/00 , G06F3/01 , G06F3/16 , G10L15/22

CPC分类号： G10L15/22 , G06F3/016 , G06F3/0488 , G06F2203/0381 , G10L2015/223

摘要： Disclosed are systems, methods, and devices for providing a layered user interface for one or more applications. A user-interface layer for a voice user interface is generated. The user-interface layer can be based on a markup-language-structured user-interface description for an application configured to execute on a computing device. The user-interface layer can include a command display of one or more voice-accessible commands for the application. The computing device can display at least the user-interface layer of the voice user interface. The computing device can receive an input utterance, obtain input text based upon speech recognition performed upon the input utterance, and determine that the input text corresponds to a voice-accessible command displayed as part of the command display. The computing device can execute the application to perform the command.

摘要翻译： 公开了用于为一个或多个应用提供分层用户界面的系统，方法和设备。生成用于语音用户界面的用户界面层。用户界面层可以基于配置为在计算设备上执行的应用的标记语言结构的用户界面描述。用户界面层可以包括用于应用的一个或多个语音可访问命令的命令显示。计算设备至少可以显示语音用户界面的用户界面层。计算设备可以接收输入话语，基于在输入话语上执行的语音识别获得输入文本，并且确定输入文本对应于作为命令显示的一部分显示的语音可访问命令。计算设备可以执行应用程序来执行命令。

5.

发明授权
Structuring verbal commands to allow concatenation in a voice interface in a mobile device 有权
标题翻译：构造语言命令以允许在移动设备中的语音接口中连接

公开(公告)号：US08452602B1

公开(公告)日：2013-05-28

申请号：US13621018

申请日：2012-09-15

申请人： Bjorn Erik Bringert , Pawel Pietryka , Peter John Hodgson , Henrique Penha , Simon Tickner , Luca Zanolin , Richard Zarek Cohen , Michael J. LeBeau

发明人： Bjorn Erik Bringert , Pawel Pietryka , Peter John Hodgson , Henrique Penha , Simon Tickner , Luca Zanolin , Richard Zarek Cohen , Michael J. LeBeau

IPC分类号： G10L21/00 , G10L15/04

CPC分类号： G06F3/167 , G10L15/22 , G10L2015/223 , G10L2015/228

摘要： A spoken utterance includes at least a first level of a multi-level command format, in which the first level identifies an application. The spoken utterance may also include a second level of the multi-level command format, in which the second level identifies an action. In response to receiving the spoken utterance at a computing device, a representation of the application identified by the first level is displayed on a display of the computing device. If the spoken utterance includes the second level of the multi-level command format, the action identified by the second level is initiated. If the spoken utterance does not include the second level of the multi-level command format, the computing device waits for a predetermined period of time and provides at least one of an audible or visual action prompt if the second level is not received within the predetermined period of time.

摘要翻译： 讲话话语包括至少第一级的多级命令格式，其中第一级识别应用。讲话话语还可以包括多级命令格式的第二级，其中第二级标识动作。响应于在计算设备处接收到说出的话语，在计算设备的显示器上显示由第一级标识的应用的表示。如果说出的话语包括多级命令格式的第二级，则启动由第二级标识的动作。如果说话话语不包括多级命令格式的第二级，则计算设备等待预定的时间段，并且如果在预定的时间段内没有接收到第二级别，则提供听觉或视觉动作提示中的至少一个一段的时间。

6.

发明授权
Voice control for asynchronous notifications 有权

公开(公告)号：US08468022B2

公开(公告)日：2013-06-18

申请号：US13626375

申请日：2012-09-25

申请人： Bjorn Erik Bringert , Pawel Pietryka , Peter John Hodgson , Dave Burke , Henrique Penha , Simon Tickner , Richard Zarek Cohen , Luca Zanolin , Michael J. LeBeau

发明人： Bjorn Erik Bringert , Pawel Pietryka , Peter John Hodgson , Dave Burke , Henrique Penha , Simon Tickner , Richard Zarek Cohen , Luca Zanolin , Michael J. LeBeau

IPC分类号： G10L21/00

CPC分类号： G10L13/02 , G06F3/167 , G10L15/26 , G10L15/265 , G10L15/28 , G10L2015/221 , G10L2015/223 , H04L51/24 , H04M1/72519 , H04M1/72552 , H04M3/4936 , H04M2250/74

摘要： A computing device may receive an incoming communication and, in response, generate a notification that indicates that the incoming communication can be accessed using a particular application on the communication device. The computing device may further provide an audio signal indicative of the notification and automatically activate a listening mode. The computing device may receive a voice input during the listening mode, and an input text may be obtained based on speech recognition performed upon the voice input. A command may be detected in the input text. In response to the command, the computing device may generate an output text that is based on at least the notification and provide a voice output that is generated from the output text via speech synthesis. The voice output identifies at least the particular application.

7.

发明授权
Systems and methods for continual speech recognition and detection in mobile computing devices 有权
标题翻译：用于移动计算设备中连续语音识别和检测的系统和方法

公开(公告)号：US08452597B2

公开(公告)日：2013-05-28

申请号：US13621068

申请日：2012-09-15

申请人： Bjorn Erik Bringert , Pawel Pietryka , Peter John Hodgson , Simon Tickner , Henrique Penha , Richard Zarek Cohen , Luca Zanolin , Dave Burke

发明人： Bjorn Erik Bringert , Pawel Pietryka , Peter John Hodgson , Simon Tickner , Henrique Penha , Richard Zarek Cohen , Luca Zanolin , Dave Burke

IPC分类号： G10L15/04 , G10L15/00 , G10L21/00

CPC分类号： G10L15/285 , G06F3/167 , G10L15/26 , G10L15/28 , G10L17/02 , G10L17/22 , G10L21/10 , G10L2015/088 , G10L2015/223 , G10L2015/228

摘要： The present application describes systems, articles of manufacture, and methods for continuous speech recognition for mobile computing devices. One embodiment includes determining whether a mobile computing device is receiving operating power from an external power source or a battery power source, and activating a trigger word detection subroutine in response to determining that the mobile computing device is receiving power from the external power source. In some embodiments, the trigger word detection subroutine operates continually while the mobile computing device is receiving power from the external power source. The trigger word detection subroutine includes determining whether a plurality of spoken words received via a microphone includes one or more trigger words, and in response to determining that the plurality of spoken words includes at least one trigger word, launching an application corresponding to the at least one trigger word included in the plurality of spoken words.

摘要翻译： 本申请描述了用于移动计算设备的连续语音识别的系统，制品和方法。一个实施例包括确定移动计算设备是否从外部电源或电池电源接收工作电力，以及响应于确定移动计算设备正在从外部电源接收电力而激活触发字检测子程序。在一些实施例中，触发字检测子程序在移动计算设备正在从外部电源接收电力的同时工作。触发词检测子程序包括确定通过麦克风接收的多个口语单词是否包括一个或多个触发词，并且响应于确定所述多个口语单词包括至少一个触发词，启动与至少一个对应的应用程序一个触发词包括在多个口语中。

8.

发明授权
Voice application finding and user invoking applications related to a single entity 有权
标题翻译：语音应用程序查找和用户调用与单个实体相关的应用程序

公开(公告)号：US08515766B1

公开(公告)日：2013-08-20

申请号：US13631282

申请日：2012-09-28

申请人： Bjorn Erik Bringert , Pawel Pietryka , Peter John Hodgson , Simon Tickner , Henrique Penha , Richard Zarek Cohen , Luca Zanolin , Marcus Foster

发明人： Bjorn Erik Bringert , Pawel Pietryka , Peter John Hodgson , Simon Tickner , Henrique Penha , Richard Zarek Cohen , Luca Zanolin , Marcus Foster

IPC分类号： G10L15/08 , G10L15/00

CPC分类号： G10L15/22 , G10L2015/223 , G10L2015/228

摘要： A computing device is configured to initiate actions in response to speech input that includes a name or other indication of an entity, in a first spoken utterance, followed by user choosing an application related to an entity, in a second spoken utterance. The computing device receives the first spoken utterance, identifies an entity based on the first spoke utterance, and indicates a plurality of available applications related to the identified entity. The computing device then receives the second spoken utterance and identifies a selection of at least one of the available applications based on the second spoken utterance. The computing device then invokes the at least one selected application.

摘要翻译： 计算设备被配置为响应于包括实体的名称或其他指示的语音输入的动作，以第一讲话语音，随后是用户在第二语音话语中选择与实体相关的应用。计算设备接收第一讲话话语，基于第一讲话话语识别实体，并且指示与所识别的实体相关的多个可用应用。然后，计算设备接收第二语音话语，并且基于第二语音话语来识别至少一个可用应用的选择。然后计算设备调用至少一个所选择的应用。

9.

发明申请
Voice Control For Asynchronous Notifications 有权
标题翻译：语音控制异步通知

公开(公告)号：US20130085761A1

公开(公告)日：2013-04-04

申请号：US13626375

申请日：2012-09-25

申请人： Bjorn Erik Bringert , Pawel Pietryka , Peter John Hodgson , Dave Burke , Henrique Penha , Simon Tickner , Richard Zarek Cohen , Luca Zanolin , Michael J. LeBeau

发明人： Bjorn Erik Bringert , Pawel Pietryka , Peter John Hodgson , Dave Burke , Henrique Penha , Simon Tickner , Richard Zarek Cohen , Luca Zanolin , Michael J. LeBeau

IPC分类号： G10L21/00

CPC分类号： G10L13/02 , G06F3/167 , G10L15/26 , G10L15/265 , G10L15/28 , G10L2015/221 , G10L2015/223 , H04L51/24 , H04M1/72519 , H04M1/72552 , H04M3/4936 , H04M2250/74

摘要： A computing device may receive an incoming communication and, in response, generate a notification that indicates that the incoming communication can be accessed using a particular application on the communication device. The computing device may further provide an audio signal indicative of the notification and automatically activate a listening mode. The computing device may receive a voice input during the listening mode, and an input text may be obtained based on speech recognition performed upon the voice input. A command may be detected in the input text. In response to the command, the computing device may generate an output text that is based on at least the notification and provide a voice output that is generated from the output text via speech synthesis. The voice output identifies at least the particular application.

摘要翻译： 计算设备可以接收传入通信，并且作为响应，生成指示可以使用通信设备上的特定应用访问传入通信的通知。计算设备还可以提供指示通知的音频信号并且自动激活聆听模式。计算设备可以在聆听模式期间接收语音输入，并且可以基于在语音输入上执行的语音识别来获得输入文本。可以在输入文本中检测到命令。响应于该命令，计算设备可以生成基于至少该通知的输出文本，并且通过语音合成提供从输出文本生成的语音输出。语音输出至少识别特定应用程序。

10.

发明申请
Systems And Methods For Continual Speech Recognition And Detection In Mobile Computing Devices 有权
标题翻译：用于移动计算设备中连续语音识别和检测的系统和方法

公开(公告)号：US20130085755A1

公开(公告)日：2013-04-04

申请号：US13621068

申请日：2012-09-15

申请人： Bjorn Erik Bringert , Pawel Pietryka , Peter John Hodgson , Simon Tickner , Henrique Penha , Richard Zarek Cohen , Luca Zanolin , Dave Burke

发明人： Bjorn Erik Bringert , Pawel Pietryka , Peter John Hodgson , Simon Tickner , Henrique Penha , Richard Zarek Cohen , Luca Zanolin , Dave Burke

IPC分类号： G10L15/26

CPC分类号： G10L15/285 , G06F3/167 , G10L15/26 , G10L15/28 , G10L17/02 , G10L17/22 , G10L21/10 , G10L2015/088 , G10L2015/223 , G10L2015/228

摘要： The present application describes systems, articles of manufacture, and methods for continuous speech recognition for mobile computing devices. One embodiment includes determining whether a mobile computing device is receiving operating power from an external power source or a battery power source, and activating a trigger word detection subroutine in response to determining that the mobile computing device is receiving power from the external power source. In some embodiments, the trigger word detection subroutine operates continually while the mobile computing device is receiving power from the external power source. The trigger word detection subroutine includes determining whether a plurality of spoken words received via a microphone includes one or more trigger words, and in response to determining that the plurality of spoken words includes at least one trigger word, launching an application corresponding to the at least one trigger word included in the plurality of spoken words.

摘要翻译： 本申请描述了用于移动计算设备的连续语音识别的系统，制品和方法。一个实施例包括确定移动计算设备是否从外部电源或电池电源接收工作电力，以及响应于确定移动计算设备正在从外部电源接收电力而激活触发字检测子程序。在一些实施例中，触发字检测子程序在移动计算设备正在从外部电源接收电力的同时工作。触发词检测子程序包括确定通过麦克风接收的多个口语单词是否包括一个或多个触发词，并且响应于确定所述多个口语单词包括至少一个触发词，启动与至少一个对应的应用程序一个触发词包括在多个口语中。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类