Patent search ap:("NEC Laboratories America Page Inc.") AND inv:"Bo Dong"

1.

发明申请
SEQUENCE MODELS FOR AUDIO SCENE RECOGNITION 有权

公开(公告)号：US20210065735A1

公开(公告)日：2021-03-04

申请号：US16997314

申请日：2020-08-19

Applicant: NEC Laboratories America, Inc.

Inventor： Cristian Lumezanu , Yuncong Chen , Dongjin Song , Takehiko Mizuguchi , Haifeng Chen , Bo Dong

IPC: G10L25/51 , G10L25/24

Abstract: A method is provided. Intermediate audio features are generated from an input acoustic sequence. Using a nearest neighbor search, segments of the input acoustic sequence are classified based on the intermediate audio features to generate a final intermediate feature as a classification for the input acoustic sequence. Each segment corresponds to a respective different acoustic window. The generating step includes learning the intermediate audio features from Multi-Frequency Cepstral Component (MFCC) features extracted from the input acoustic sequence. The generating step includes dividing the same scene into the different acoustic windows having varying MFCC features. The generating step includes feeding the MFCC features of each of the different acoustic windows into respective LSTM units such that a hidden state of each respective LSTM unit is passed through an attention layer to identify feature correlations between hidden states at different time steps corresponding to different ones of the different acoustic windows.

2.

发明授权
Sequence models for audio scene recognition 有权

公开(公告)号：US10930301B1

公开(公告)日：2021-02-23

申请号：US16997314

申请日：2020-08-19

Applicant: NEC Laboratories America, Inc.

Inventor： Cristian Lumezanu , Yuncong Chen , Dongjin Song , Takehiko Mizuguchi , Haifeng Chen , Bo Dong

IPC: G10L25/51 , G10L25/24 , G10L25/18 , G10L25/21

Abstract: A method is provided. Intermediate audio features are generated from an input acoustic sequence. Using a nearest neighbor search, segments of the input acoustic sequence are classified based on the intermediate audio features to generate a final intermediate feature as a classification for the input acoustic sequence. Each segment corresponds to a respective different acoustic window. The generating step includes learning the intermediate audio features from Multi-Frequency Cepstral Component (MFCC) features extracted from the input acoustic sequence. The generating step includes dividing the same scene into the different acoustic windows having varying MFCC features. The generating step includes feeding the MFCC features of each of the different acoustic windows into respective LSTM units such that a hidden state of each respective LSTM unit is passed through an attention layer to identify feature correlations between hidden states at different time steps corresponding to different ones of the different acoustic windows.

3.

发明申请
AUDIO SCENE RECOGNITION USING TIME SERIES ANALYSIS 有权

公开(公告)号：US20210065734A1

公开(公告)日：2021-03-04

申请号：US16997249

申请日：2020-08-19

Applicant: NEC Laboratories America, Inc.

Inventor： Cristian Lumezanu , Yuncong Chen , Dongjin Song , Takehiko Mizuguchi , Haifeng Chen , Bo Dong

IPC: G10L25/51 , G10L25/24 , G10L25/18 , G10L25/21

Abstract: A method is provided. Intermediate audio features are generated from respective segments of an input acoustic time series for a same scene. Using a nearest neighbor search, respective segments of the input acoustic time series are classified based on the intermediate audio features to generate a final intermediate feature as a classification for the input acoustic time series. Each respective segment corresponds to a respective different acoustic window. The generating step includes learning the intermediate audio features from Multi-Frequency Cepstral Component (MFCC) features extracted from the input acoustic time series, dividing the same scene into the different windows having varying MFCC features, and feeding the MFCC features of each window into respective LSTM units such that a hidden state of each respective LSTM unit is passed through an attention layer to identify feature correlations between hidden states at different time steps corresponding to different ones of the different windows.

4.

发明授权
Audio scene recognition using time series analysis 有权

公开(公告)号：US11355138B2

公开(公告)日：2022-06-07

申请号：US16997249

申请日：2020-08-19

Applicant: NEC Laboratories America, Inc.

Inventor： Cristian Lumezanu , Yuncong Chen , Dongjin Song , Takehiko Mizuguchi , Haifeng Chen , Bo Dong

IPC: G10L25/51 , G10L25/24 , G10L25/18 , G10L25/21

Abstract: A method is provided. Intermediate audio features are generated from respective segments of an input acoustic time series for a same scene. Using a nearest neighbor search, respective segments of the input acoustic time series are classified based on the intermediate audio features to generate a final intermediate feature as a classification for the input acoustic time series. Each respective segment corresponds to a respective different acoustic window. The generating step includes learning the intermediate audio features from Multi-Frequency Cepstral Component (MFCC) features extracted from the input acoustic time series, dividing the same scene into the different windows having varying MFCC features, and feeding the MFCC features of each window into respective LSTM units such that a hidden state of each respective LSTM unit is passed through an attention layer to identify feature correlations between hidden states at different time steps corresponding to different ones of the different windows.

Patent Agency Ranking