Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Fred Torok"

41.

发明申请
IDENTIFICATION OF UTTERANCE SUBJECTS 审中-公开
Title translation: 确定学位课程

公开(公告)号：US20150179175A1

公开(公告)日：2015-06-25

申请号：US14642365

申请日：2015-03-09

Applicant: Amazon Technologies, Inc.

Inventor： Fred Torok , Frédéric Johan Georges Deramat , Vikram Kumar Gundeti

IPC: G10L15/26 , G10L15/08

CPC classification number: G10L15/26 , G06F17/30684 , G06F17/3074 , G06F17/30746 , G06F17/30778 , G10L15/08 , G10L15/222 , G10L15/30

Abstract: Features are disclosed for generating markers for elements or other portions of an audio presentation so that a speech processing system may determine which portion of the audio presentation a user utterance refers to. For example, an utterance may include a pronoun with no explicit antecedent. The marker may be used to associate the utterance with the corresponding content portion for processing. The markers can be provided to a client device with a text-to-speech (“TTS”) presentation. The markers may then be provided to a speech processing system along with a user utterance captured by the client device. The speech processing system, which may include automatic speech recognition (“ASR”) modules and/or natural language understanding (“NLU”) modules, can generate hints based on the marker. The hints can be provided to the ASR and/or NLU modules in order to aid in processing the meaning or intent of a user utterance.

Abstract translation: 公开了用于为音频呈现的元件或其他部分生成标记的特征，使得语音处理系统可以确定用户话语所指的音频呈现的哪一部分。例如，话语可能包括没有明确先行词的代词。标记可以用于将话语与相应的内容部分相关联以进行处理。可以将标记提供给具有文本到语音（“TTS”）呈现的客户端设备。然后可以将标记与客户端设备捕获的用户话语一起提供给语音处理系统。可以包括自动语音识别（“ASR”）模块和/或自然语言理解（“NLU”）模块的语音处理系统可以基于标记产生提示。可以将提示提供给ASR和/或NLU模块，以帮助处理用户话语的含义或意图。

42.

发明授权
Measurement of user perceived latency in a cloud based speech application 有权
Title translation: 基于云的语音应用程序中用户感知延迟的测量

公开(公告)号：US09064495B1

公开(公告)日：2015-06-23

申请号：US13889277

申请日：2013-05-07

Applicant: Amazon Technologies, Inc.

Inventor： Fred Torok , Peter Spalding VanLund

IPC: G10L15/00 , G10L15/28

CPC classification number: G10L15/01 , G10L15/30 , H04L41/5067 , H04L43/0852 , H04L43/0864

Abstract: In some embodiments, a user device receives a voice signal corresponding to a user utterance. The user device may set a time marker corresponding to a point in time in the voice signal. The voice signal and the time marker may be transmitted to a server device. The server device may perform speech recognition using the voice signal. The server device may determine a time offset corresponding to a difference in time between an end point of the user utterance and a time associated with the time marker. The server device may determine a response to the user utterance. The server device may transmit the time offset and the response to the user device. The user device may use the time offset to determine a user-perceived latency between the end of the user utterance and a beginning of the response.

Abstract translation: 在一些实施例中，用户设备接收对应于用户话语的语音信号。用户设备可以设置对应于语音信号中的时间点的时间标记。语音信号和时间标记可以被发送到服务器设备。服务器设备可以使用语音信号来执行语音识别。服务器设备可以确定对应于用户发话的终点与与时间标记相关联的时间之间的时间差的时间偏移。服务器设备可以确定对用户话语的响应。服务器设备可以发送时间偏移和对用户设备的响应。用户设备可以使用时间偏移来确定用户话语结束和响应开始之间的用户感知等待时间。

43.

发明授权
Identification of utterance subjects 有权
Title translation: 确定话语科目

公开(公告)号：US08977555B2

公开(公告)日：2015-03-10

申请号：US13723026

申请日：2012-12-20

Applicant: Amazon Technologies, Inc.

Inventor： Fred Torok , Frédéric Johan Georges Deramat , Vikram Kumar Gundeti

IPC: G10L21/00 , G10L25/00 , G10L13/00 , G10L15/00 , G06F17/30

CPC classification number: G10L15/26 , G06F17/30684 , G06F17/3074 , G06F17/30746 , G06F17/30778 , G10L15/08 , G10L15/222 , G10L15/30

Abstract: Features are disclosed for generating markers for elements or other portions of an audio presentation so that a speech processing system may determine which portion of the audio presentation a user utterance refers to. For example, an utterance may include a pronoun with no explicit antecedent. The marker may be used to associate the utterance with the corresponding content portion for processing. The markers can be provided to a client device with a text-to-speech (“TTS”) presentation. The markers may then be provided to a speech processing system along with a user utterance captured by the client device. The speech processing system, which may include automatic speech recognition (“ASR”) modules and/or natural language understanding (“NLU”) modules, can generate hints based on the marker. The hints can be provided to the ASR and/or NLU modules in order to aid in processing the meaning or intent of a user utterance.

Abstract translation: 公开了用于为音频呈现的元件或其他部分生成标记的特征，使得语音处理系统可以确定用户话语所指的音频呈现的哪一部分。例如，话语可能包括没有明确先行词的代词。标记可以用于将话语与相应的内容部分相关联以进行处理。可以将标记提供给具有文本到语音（“TTS”）呈现的客户端设备。然后可以将标记与客户端设备捕获的用户话语一起提供给语音处理系统。可以包括自动语音识别（“ASR”）模块和/或自然语言理解（“NLU”）模块的语音处理系统可以基于标记产生提示。可以将提示提供给ASR和/或NLU模块，以帮助处理用户话语的含义或意图。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification