Patent search ap:("Amazon Technologies Page Inc.") AND inv:"James Garnet Droppo"

1.

发明授权
Acoustic event detection 有权

公开(公告)号：US12087320B1

公开(公告)日：2024-09-10

申请号：US17671194

申请日：2022-02-14

Applicant: Amazon Technologies, Inc.

Inventor： Qin Zhang , Qingming Tang , Ming Sun , Chao Wang , Steve Mark Lorusso , Andrew Thomas Bydlon , James Garnet Droppo , Viktor Rozgic , Sripal Mehta , Yang Liu

IPC: G10L25/51 , G10L15/18 , G10L15/22 , G10L15/30

CPC classification number: G10L25/51 , G10L15/1815 , G10L15/22 , G10L15/30

Abstract: A system may be configured to detect custom acoustic events, where the system generates an acoustic event profile for the custom acoustic event based on a natural language description provided by a user and using an audio sample of the described acoustic event. For example, the user may describe the custom acoustic event as “dog bark.” The system may ask the user questions to refine the description (e.g., dog breed, dog gender, age, etc.). Using an audio sample of the refined description, the system may then determine that audio captured in the user's environment is a potential sample of the custom acoustic event. Such captured audio may be presented to the user for confirmation, and then may be used to detect future occurrences of the custom acoustic event in the user's environment.

2.

发明公开
ACOUSTIC EVENT DETECTION 审中-公开

公开(公告)号：US20240071408A1

公开(公告)日：2024-02-29

申请号：US18243804

申请日：2023-09-08

Applicant: Amazon Technologies, Inc.

Inventor： Qingming Tang , Chieh-Chi Kao , Qin Zhang , Ming Sun , Chao Wang , Sumit Garg , Rong Chen , James Garnet Droppo , Chia-Jung Chang

IPC: G10L25/51 , G06N3/045 , G06N3/08 , G10L25/21 , G10L25/30

CPC classification number: G10L25/51 , G06N3/045 , G06N3/08 , G10L25/21 , G10L25/30 , G10L15/08

Abstract: A system may include a first acoustic event detection (AED) component configured to detect a predetermined set of acoustic events, and include a second AED component configured to detect custom acoustic events that a user configures a device to detect. The first and second AED components are configured to perform task-specific processing, and may receive as input the same acoustic feature data corresponding to audio data that potentially represents occurrence of one or more events. Based on processing by the first and second AED components, a device may output data indicating that one or more acoustic events occurred, where the acoustic events may be a predetermined acoustic event and/or a custom acoustic event.

3.

发明授权
Acoustic event detection 有权

公开(公告)号：US11790932B2

公开(公告)日：2023-10-17

申请号：US17547644

申请日：2021-12-10

Applicant: Amazon Technologies, Inc.

Inventor： Qingming Tang , Chieh-Chi Kao , Qin Zhang , Ming Sun , Chao Wang , Sumit Garg , Rong Chen , James Garnet Droppo , Chia-Jung Chang

IPC: G10L25/51 , G10L25/21 , G10L25/30 , G06N3/08 , G06N3/045 , G10L15/08 , G10L15/22

CPC classification number: G10L25/51 , G06N3/045 , G06N3/08 , G10L25/21 , G10L25/30 , G10L15/08 , G10L15/22 , G10L2015/088 , G10L2015/223

Abstract: A system may include a first acoustic event detection (AED) component configured to detect a predetermined set of acoustic events, and include a second AED component configured to detect custom acoustic events that a user configures a device to detect. The first and second AED components are configured to perform task-specific processing, and may receive as input the same acoustic feature data corresponding to audio data that potentially represents occurrence of one or more events. Based on processing by the first and second AED components, a device may output data indicating that one or more acoustic events occurred, where the acoustic events may be a predetermined acoustic event and/or a custom acoustic event.

4.

发明授权
Synthetic speech processing 有权

公开(公告)号：US11580955B1

公开(公告)日：2023-02-14

申请号：US17218740

申请日：2021-03-31

Applicant: Amazon Technologies, Inc.

Inventor： Yixiong Meng , Roberto Barra Chicote , Grzegorz Beringer , Zeya Chen , Jie Liang , James Garnet Droppo , Chia-Hao Chang , Oguz Hasan Elibol

IPC: G10L13/08 , G10L13/027 , G10L15/06 , G10L13/033 , G10L19/008 , G10L13/047

Abstract: A speech-processing system receives input data representing text. A first encoder processes segments of the text to determine embedding data representing the text, and a second encoder processes corresponding audio data to determine prosodic data corresponding to the text. The embedding and prosodic data is processed to create output data including a representation of speech corresponding to the text and prosody.

5.

发明申请
NATURAL LANGUAGE GENERATION 有权

公开(公告)号：US20250104693A1

公开(公告)日：2025-03-27

申请号：US18474484

申请日：2023-09-26

Applicant: Amazon Technologies, Inc.

Inventor： Constantinos Papayiannis , Roberto Barra Chicote , Trevor Michael Wood , James Garnet Droppo

IPC: G10L13/10 , G10L13/047 , G10L25/18

Abstract: Techniques for using a language model (e.g., a large language model (LLM)) to generate a natural language response to a user input and prosody information (e.g., voice characteristics associated with a synthetic voice to output the natural language response to the user) are described. The prosody information may correspond to a natural language (e.g., text or tokenized) description, a spectrogram, and/or a latent representation of the voice characteristic(s) associated with the natural language response. In some embodiments, the natural language response and the prosody information may be generated by different portions of layers of the language model. In such embodiments, the output of the layer(s) of the language model configured to generate the natural language response may be provided to the layer(s) of the language model configured to generate the prosody information and the output may be used to generate the prosody information, and vice versa.

6.

发明授权
Endpointing in speech processing 有权

公开(公告)号：US12211517B1

公开(公告)日：2025-01-28

申请号：US17475699

申请日：2021-09-15

Applicant: Amazon Technologies, Inc.

Inventor： Roland Maximilian Rolf Maas , Bjorn Hoffmeister , Ariya Rastrow , James Garnet Droppo , Veerdhawal Pande , Maarten Van Segbroeck , Gautam Tiwari , Andrew Smith , Eli Joshua Fidler

IPC: G10L25/78 , G06N3/045 , G10L15/26 , G10L25/30

Abstract: A speech-processing system may determine potential endpoints in a user's speech. Such endpoint prediction may include determining a potential endpoint in a stream of audio data, and may additionally including determining an endpoint score representing a likelihood that the potential endpoint represents an end of speech representing a complete user input. When the potential endpoint has been determined, the system may publish a transcript of speech that preceded the potential endpoint, and send it to downstream components. The system may continue to transcribe audio data and determine additional potential endpoints while the downstream components process the transcript. The downstream components may determine whether the transcript is complete; e.g., represents the entirety of the user input. Final endpoint determinations may be made based on the results of the downstream processing including automatic speech recognition, natural language understanding, etc.

7.

发明公开
ACOUSTIC EVENT DETECTION 审中-公开

公开(公告)号：US20230186939A1

公开(公告)日：2023-06-15

申请号：US17547644

申请日：2021-12-10

Applicant: Amazon Technologies, Inc.

Inventor： Qingming Tang , Chieh-Chi Kao , Qin Zhang , Ming Sun , Chao Wang , Sumit Garg , Rong Chen , James Garnet Droppo , Chia-Jung Chang

IPC: G10L25/51 , G10L25/21 , G10L25/30 , G06N3/08 , G06N3/04

CPC classification number: G10L25/51 , G10L25/21 , G10L25/30 , G06N3/08 , G06N3/0454 , G10L15/22

Abstract: A system may include a first acoustic event detection (AED) component configured to detect a predetermined set of acoustic events, and include a second AED component configured to detect custom acoustic events that a user configures a device to detect. The first and second AED components are configured to perform task-specific processing, and may receive as input the same acoustic feature data corresponding to audio data that potentially represents occurrence of one or more events. Based on processing by the first and second AED components, a device may output data indicating that one or more acoustic events occurred, where the acoustic events may be a predetermined acoustic event and/or a custom acoustic event.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification