Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Chieh-Chi Kao"

1.

发明授权
Audio event detection 有权

公开(公告)号：US10803885B1

公开(公告)日：2020-10-13

申请号：US16023923

申请日：2018-06-29

Applicant: Amazon Technologies, Inc.

Inventor： Chieh-Chi Kao , Chao Wang , Weiran Wang , Ming Sun

IPC: G10L25/78 , G10L25/51 , G10L15/22 , G10L15/16

Abstract: An audio event detection system that processes audio data into audio feature data and processes the audio feature data using pre-configured candidate interval lengths to identify top candidate regions of the feature data that may include an audio event. The feature data from the top candidate regions are then scored by a classifier, where the score indicates a likelihood that the candidate region corresponds to a desired audio event. The scores are compared to a threshold, and if the threshold is satisfied, the top scoring candidate region is determined to include an audio event.

2.

发明授权
Acoustic event detection 有权

公开(公告)号：US11302329B1

公开(公告)日：2022-04-12

申请号：US16914589

申请日：2020-06-29

Applicant: Amazon Technologies, Inc.

Inventor： Ming Sun , Spyridon Matsoukas , Venkata Naga Krishna Chaitanya Puvvada , Chao Wang , Chieh-Chi Kao

IPC: G10L15/22 , G10L15/08

Abstract: A system may include an acoustic event detection component for detecting acoustic events, which may be non-speech sounds. Upon detection of a command to detect a new sound, a device may prompt a user to cause occurrence of the sound one or more times. The acoustic event detection component may then be reconfigured, using audio data corresponding to the occurrences, to detect future occurrences of the event.

3.

发明授权
Self-supervised federated learning 有权

公开(公告)号：US12039998B1

公开(公告)日：2024-07-16

申请号：US17665129

申请日：2022-02-04

Applicant: Amazon Technologies, Inc.

Inventor： Chieh-Chi Kao , Qingming Tang , Ming Sun , Viktor Rozgic , Spyridon Matsoukas , Chao Wang

IPC: G10L25/78 , G06N3/045 , G06N3/08 , G10L25/21

CPC classification number: G10L25/78 , G06N3/045 , G06N3/08 , G10L25/21

Abstract: An acoustic event detection system may employ self-supervised federated learning to update encoder and/or classifier machine learning models. In an example operation, an encoder may be pre-trained to extract audio feature data from an audio signal. A decoder may be pre-trained to predict a subsequent portion of audio data (e.g., a subsequent frame of audio data represented by log filterbank energies). The encoder and decoder may be trained using self-supervised learning to improve the decoder's predictions and, by extension, the quality of the audio feature data generated by the encoder. The system may apply federated learning to share encoder updates across user devices. The system may fine-tune the classifier to improve inferences based on the improved audio feature data. The system may distribute classifier updates to the user device(s) to update the on-device classifier.

4.

发明公开
ACOUSTIC EVENT DETECTION 审中-公开

公开(公告)号：US20230186939A1

公开(公告)日：2023-06-15

申请号：US17547644

申请日：2021-12-10

Applicant: Amazon Technologies, Inc.

Inventor： Qingming Tang , Chieh-Chi Kao , Qin Zhang , Ming Sun , Chao Wang , Sumit Garg , Rong Chen , James Garnet Droppo , Chia-Jung Chang

IPC: G10L25/51 , G10L25/21 , G10L25/30 , G06N3/08 , G06N3/04

CPC classification number: G10L25/51 , G10L25/21 , G10L25/30 , G06N3/08 , G06N3/0454 , G10L15/22

Abstract: A system may include a first acoustic event detection (AED) component configured to detect a predetermined set of acoustic events, and include a second AED component configured to detect custom acoustic events that a user configures a device to detect. The first and second AED components are configured to perform task-specific processing, and may receive as input the same acoustic feature data corresponding to audio data that potentially represents occurrence of one or more events. Based on processing by the first and second AED components, a device may output data indicating that one or more acoustic events occurred, where the acoustic events may be a predetermined acoustic event and/or a custom acoustic event.

5.

发明授权
Media presence detection 有权

公开(公告)号：US11069352B1

公开(公告)日：2021-07-20

申请号：US16278440

申请日：2019-02-18

Applicant: Amazon Technologies, Inc.

Inventor： Qingming Tang , Ming Sun , Chieh-Chi Kao , Chao Wang , Viktor Rozgic

IPC: G10L15/22 , G10L25/78 , G10L15/16 , G10L15/02

Abstract: Described herein is a system for media presence detection in audio. The system analyzes audio data to recognize whether a given audio segment contains sounds from a media source as a way of differentiating recorded media source sounds from other live sounds. In exemplary embodiments, the system includes a hierarchical model architecture for processing audio data segments, where individual audio data segments are processed by a trained machine learning model operating locally, and another trained machine learning model provides historical and contextual information to determine a score indicating the likelihood that the audio data segment contains sounds from a media source.

6.

发明公开
ACOUSTIC EVENT DETECTION 审中-公开

公开(公告)号：US20240071408A1

公开(公告)日：2024-02-29

申请号：US18243804

申请日：2023-09-08

Applicant: Amazon Technologies, Inc.

Inventor： Qingming Tang , Chieh-Chi Kao , Qin Zhang , Ming Sun , Chao Wang , Sumit Garg , Rong Chen , James Garnet Droppo , Chia-Jung Chang

IPC: G10L25/51 , G06N3/045 , G06N3/08 , G10L25/21 , G10L25/30

CPC classification number: G10L25/51 , G06N3/045 , G06N3/08 , G10L25/21 , G10L25/30 , G10L15/08

Abstract: A system may include a first acoustic event detection (AED) component configured to detect a predetermined set of acoustic events, and include a second AED component configured to detect custom acoustic events that a user configures a device to detect. The first and second AED components are configured to perform task-specific processing, and may receive as input the same acoustic feature data corresponding to audio data that potentially represents occurrence of one or more events. Based on processing by the first and second AED components, a device may output data indicating that one or more acoustic events occurred, where the acoustic events may be a predetermined acoustic event and/or a custom acoustic event.

7.

发明授权
Acoustic event detection 有权

公开(公告)号：US11790932B2

公开(公告)日：2023-10-17

申请号：US17547644

申请日：2021-12-10

Applicant: Amazon Technologies, Inc.

Inventor： Qingming Tang , Chieh-Chi Kao , Qin Zhang , Ming Sun , Chao Wang , Sumit Garg , Rong Chen , James Garnet Droppo , Chia-Jung Chang

IPC: G10L25/51 , G10L25/21 , G10L25/30 , G06N3/08 , G06N3/045 , G10L15/08 , G10L15/22

CPC classification number: G10L25/51 , G06N3/045 , G06N3/08 , G10L25/21 , G10L25/30 , G10L15/08 , G10L15/22 , G10L2015/088 , G10L2015/223

Abstract: A system may include a first acoustic event detection (AED) component configured to detect a predetermined set of acoustic events, and include a second AED component configured to detect custom acoustic events that a user configures a device to detect. The first and second AED components are configured to perform task-specific processing, and may receive as input the same acoustic feature data corresponding to audio data that potentially represents occurrence of one or more events. Based on processing by the first and second AED components, a device may output data indicating that one or more acoustic events occurred, where the acoustic events may be a predetermined acoustic event and/or a custom acoustic event.

8.

发明授权
Audio event detection 有权

公开(公告)号：US10418957B1

公开(公告)日：2019-09-17

申请号：US16023990

申请日：2018-06-29

Applicant: Amazon Technologies, Inc.

Inventor： Weiran Wang , Chao Wang , Chieh-Chi Kao

IPC: H03G3/32 , G10L25/78 , G06N5/04

Abstract: An audio event detection system that subsamples input audio data using a series of recurrent neural networks to create data of a coarser time scale than the audio data. Data frames corresponding to the coarser time scale may then be upsampled to data frames that match the finer time scale of the original audio data frames. The resulting data frames are then scored with a classifier to determine a likelihood that the individual frames correspond to an audio event. Each frame is then weighted by its score and a composite weighted frame is created by summing the weighted frames and dividing by the cumulative score. The composite weighted frame is then scored by the classifier. The resulting score is taken as an overall score indicating a likelihood that the input audio data includes an audio event.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification