Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Thomas Schaaf"

1.

发明授权
Voice-assisted scanning 有权

公开(公告)号：US09767501B1

公开(公告)日：2017-09-19

申请号：US14074346

申请日：2013-11-07

Applicant: Amazon Technologies, Inc.

Inventor： Thomas Schaaf , Stan Weidner Salvador

IPC: G06Q30/00 , G06K7/10 , G10L15/26 , G06Q30/06 , G06F17/20

CPC classification number: G06Q30/0623

Abstract: In some cases, a handheld device that includes a microphone and a scanner may be used for voice-assisted scanning. For example, a user may provide a voice input via the microphone and may activate the scanner to scan an item identifier (e.g., a barcode). The handheld device may communicate voice data and item identifier information to a remote system for voice-assisted scanning. The remote system may perform automatic speech recognition (ASR) operations on the voice data and may perform item identification operations based on the scanned identifier. Natural language understanding (NLU) processing may be improved by combining ASR information with item information obtained based on the scanned identifier. An action may be executed based on the likely user intent.

2.

发明申请
SMART CIRCULAR AUDIO BUFFER 有权
Title translation: 智能通信音频缓冲器

公开(公告)号：US20150066494A1

公开(公告)日：2015-03-05

申请号：US14016403

申请日：2013-09-03

Applicant: Amazon Technologies. Inc.

Inventor： Stan Weidner Salvador , Thomas Schaaf

IPC: G10L21/02

CPC classification number: G10L21/0202 , G06F3/16 , G06F3/165 , G06F3/167 , G06F17/2872 , G10L15/00 , G10L17/00 , G10L21/00 , G10L21/0208 , G10L25/78 , H04L1/1874

Abstract: An audio buffer is used to capture audio in anticipation of a user command to do so. Sensors and processor activity may be monitored, looking for indicia suggesting that the user command may be forthcoming. Upon detecting such indicia, a circular buffer is activated. Audio correction may be applied to the audio stored in the circular buffer. After receiving the user command instructing the device to process or record audio, at least a portion of the audio that was stored in the buffer before the command is combined with audio received after the command. The combined audio may then be processed, transmitted or stored.

Abstract translation: 使用音频缓冲器来捕获音频，期望用户命令这样做。可以监控传感器和处理器的活动，寻找表明用户命令可能出现的标记。在检测到这样的标记时，循环缓冲器被激活。音频校正可以应用于存储在循环缓冲器中的音频。在接收到指令设备处理或记录音频的用户命令之后，在命令与命令之后接收的音频组合之后存储在缓冲器中的音频的至少一部分。然后可以处理，发送或存储组合的音频。

3.

发明授权
Smart circular audio buffer 有权

公开(公告)号：US09633669B2

公开(公告)日：2017-04-25

申请号：US14016403

申请日：2013-09-03

Applicant: Amazon Technologies, Inc.

Inventor： Stan Weidner Salvador , Thomas Schaaf

IPC: G10L21/00 , G10L21/02 , G06F17/28 , H04L1/18 , G10L17/00 , G10L25/78 , G10L21/0208

CPC classification number: G10L21/0202 , G06F3/16 , G06F3/165 , G06F3/167 , G06F17/2872 , G10L15/00 , G10L17/00 , G10L21/00 , G10L21/0208 , G10L25/78 , H04L1/1874

Abstract: An audio buffer is used to capture audio in anticipation of a user command to do so. Sensors and processor activity may be monitored, looking for indicia suggesting that the user command may be forthcoming. Upon detecting such indicia, a circular buffer is activated. Audio correction may be applied to the audio stored in the circular buffer. After receiving the user command instructing the device to process or record audio, at least a portion of the audio that was stored in the buffer before the command is combined with audio received after the command. The combined audio may then be processed, transmitted or stored.

4.

发明授权
Controlling offensive content in output 有权
Title translation: 控制产出中的令人反感的内容

公开(公告)号：US09405741B1

公开(公告)日：2016-08-02

申请号：US14223648

申请日：2014-03-24

Applicant: Amazon Technologies, Inc.

Inventor： Thomas Schaaf , Sumedha Arvind Kshirsagar , Roger Alix-Gaudreau , Remus Razvan Mois , Rafal Kuklinski , Derek Christopher Murman

IPC: G06F17/27 , G10L15/08

CPC classification number: G06F17/27 , G06F17/274 , G10L13/00 , G10L15/08 , G10L15/22

Abstract: Features are disclosed for recognizing inappropriate content in an output. The offensive content may be generated as a result of a speech processing error. A system may identify the inappropriate elements of a generated output and select among different appropriate alternatives. The system may be adjusted based on certain user characteristics. The system may be localized based on language and cultural features. The system may modify the generated output based on characteristics such as the tolerance threshold of known persons in the proximity of the system. The tolerance threshold may further be used to personalize and modify available content. Models used by the system may be further trained using input from a user.

Abstract translation: 公开了用于识别输出中的不适当内容的特征。可能由于语音处理错误而产生令人反感的内容。系统可以识别生成的输出的不适当元素，并在不同的适当替代方案中进行选择。可以基于某些用户特征来调整系统。该系统可以基于语言和文化特征进行本地化。系统可以基于诸如在系统附近的已知人员的容许阈值的特性来修改生成的输出。公差阈值还可用于个性化和修改可用内容。可以使用来自用户的输入来进一步训练系统使用的模型。

5.

发明授权
Vehicle voice user interface 有权

公开(公告)号：US11273778B1

公开(公告)日：2022-03-15

申请号：US15808022

申请日：2017-11-09

Applicant: Amazon Technologies, Inc.

Inventor： Hamza Lakhani , Thomas Schaaf , Leah Rose Nicolich-Henkin , Ricardo DeMatos , Mingzhi Yu

IPC: B60R16/03 , B60R16/037 , G06F3/16 , G06K9/00 , G10L13/10 , G10L13/033 , G10L15/30 , G10L25/63 , G10L15/22 , G06N20/00

Abstract: Techniques for engaging a drowsy or otherwise impaired driver of a vehicle in a VUI dialog are described. A vehicle computing system sends data (e.g., raw sensor data and/or an indication that a driver is impaired determined based on the raw sensor data) to a remote server(s). The remote server(s) may separately determine whether the driver is impaired based on the raw sensor data and/or other contextual data. The remote server(s) selects a speechlet to provide output data based on the sensor data, contextual data, and or a level at which the driver is impaired. The remote server(s) then causes the vehicle computing system to present output audio corresponding to output data provided by the speechlet.

6.

发明授权
Voice-assisted scanning 有权

公开(公告)号：US11321756B1

公开(公告)日：2022-05-03

申请号：US15707816

申请日：2017-09-18

Applicant: Amazon Technologies, Inc.

Inventor： Thomas Schaaf , Stan Weidner Salvador

IPC: G06Q30/06 , G06F40/00

Abstract: In some cases, a handheld device that includes a microphone and a scanner may be used for voice-assisted scanning. For example, a user may provide a voice input via the microphone and may activate the scanner to scan an item identifier (e.g., a barcode). The handheld device may communicate voice data and item identifier information to a remote system for voice-assisted scanning. The remote system may perform automatic speech recognition (ASR) operations on the voice data and may perform item identification operations based on the scanned identifier. Natural language understanding (NLU) processing may be improved by combining ASR information with item information obtained based on the scanned identifier. An action may be executed based on the likely user intent.

7.

发明授权
Selective speech recognition scoring using articulatory features 有权
Title translation: 使用发音功能的选择性语音识别评分

公开(公告)号：US09355636B1

公开(公告)日：2016-05-31

申请号：US14027828

申请日：2013-09-16

Applicant: Amazon Technologies, Inc.

Inventor： Jeffrey Cornelius O'Neill , Jeffrey Paul Lilly , Thomas Schaaf

IPC: G10L15/187 , G10L15/14

CPC classification number: G10L15/142 , G10L15/14 , G10L25/48 , G10L25/93

Abstract: Features are provided for selectively scoring portions of user utterances based at least on articulatory features of the portions. One or more articulatory features of a portion of a user utterance can be determined. Acoustic models or subsets of individual acoustic model components (e.g., Gaussians or Gaussian mixture models) can be selected based on the articulatory features of the portion. The portion can then be scored using a selected acoustic model or subset of acoustic model components. The process may be repeated for the multiple portions of the utterance, and speech recognition results can be generated from the scored portions.

Abstract translation: 提供了特征，用于至少基于部分的关节特征来选择性地评分用户话语的部分。可以确定用户话语的一部分的一个或多个发音特征。可以基于该部分的关节特征来选择单个声学模型分量（例如，高斯混合模型或高斯混合模型）的声学模型或子集。然后可以使用选定的声学模型或声学模型组件的子集对该部分进行评分。可以对话语的多个部分重复该过程，并且可以从刻痕部分产生语音识别结果。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification