-
公开(公告)号:US20180240472A1
公开(公告)日:2018-08-23
申请号:US15960140
申请日:2018-04-23
Applicant: Cirrus Logic, Inc.
Inventor: Earl Vickers , Fredrick D. Geiger , Erik Sherwood
IPC: G10L21/0264 , G10L25/60 , G10L21/0224 , G10L25/84 , G10L25/78 , G10L25/30 , G10L15/06
CPC classification number: G10L21/0264 , G10L21/0224 , G10L25/30 , G10L25/60 , G10L25/78 , G10L25/84 , G10L2015/0636
Abstract: A “running range normalization” method includes computing running estimates of the range of values of features useful for voice activity detection (VAD) and normalizing the features by mapping them to a desired range. Running range normalization includes computation of running estimates of the minimum and maximum values of VAD features and normalizing the feature values by mapping the original range to a desired range. Smoothing coefficients are optionally selected to directionally bias a rate of change of at least one of the running estimates of the minimum and maximum values. The normalized VAD feature parameters are used to train a machine learning algorithm to detect voice activity and to use the trained machine learning algorithm to isolate or enhance the speech component of the audio data.
-
公开(公告)号:US09953661B2
公开(公告)日:2018-04-24
申请号:US14866824
申请日:2015-09-25
Applicant: Cirrus Logic Inc.
Inventor: Earl Vickers
IPC: G10L21/02 , G10L21/0264 , G10L21/0224 , G10L25/60 , G10L25/84 , G10L25/78 , G10L25/30 , G10L15/06
CPC classification number: G10L21/0264 , G10L21/0224 , G10L25/30 , G10L25/60 , G10L25/78 , G10L25/84 , G10L2015/0636
Abstract: A “running range normalization” method includes computing running estimates of the range of values of features useful for voice activity detection (VAD) and normalizing the features by mapping them to a desired range. Running range normalization includes computation of running estimates of the minimum and maximum values of VAD features and normalizing the feature values by mapping the original range to a desired range. Smoothing coefficients are optionally selected to directionally bias a rate of change of at least one of the running estimates of the minimum and maximum values. The normalized VAD feature parameters are used to train a machine learning algorithm to detect voice activity and to use the trained machine learning algorithm to isolate or enhance the speech component of the audio data.
-