VOICE OR SPEECH RECOGNITION IN NOISY ENVIRONMENTS

    公开(公告)号:US20230197085A1

    公开(公告)日:2023-06-22

    申请号:US17997243

    申请日:2020-06-22

    摘要: Embodiments include methods for voice/speech recognition in noisy environments executed by a processor of a computing device. In various embodiments, voice or speech recognition may be executed by a processor of a computing device, which may include determining a voice recognition model to use for voice and/or speech recognition based on a location where an audio input is received and performing voice and/or speech recognition on the audio input using the determined voice recognition model. Some embodiments my receive from a computing device, an audio input and location information associated with a location where the audio input was recorded. The received audio input may be used to generate a voice recognition model associated with the location where the audio input was recorded for use in voice and/or speech recognition. The generated voice recognition model associated with the location may be provided to the computing device.

    Dynamic threshold for speaker verification

    公开(公告)号:US09972323B2

    公开(公告)日:2018-05-15

    申请号:US15599578

    申请日:2017-05-19

    申请人: Google LLC

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a dynamic threshold for speaker verification are disclosed. In one aspect, a method includes the actions of receiving, for each of multiple utterances of a hotword, a data set including at least a speaker verification confidence score, and environmental context data. The actions further include selecting from among the data sets, a subset of the data sets that are associated with a particular environmental context. The actions further include selecting a particular data set from among the subset of data sets based on one or more selection criteria. The actions further include selecting, as a speaker verification threshold for the particular environmental context, the speaker verification confidence score. The actions further include providing the speaker verification threshold for use in performing speaker verification of utterances that are associated with the particular environmental context.

    SYSTEM AND METHOD FOR DETECTING SYNTHETIC SPEAKER VERIFICATION

    公开(公告)号:US20180075851A1

    公开(公告)日:2018-03-15

    申请号:US15804220

    申请日:2017-11-06

    摘要: Disclosed herein are systems, methods, and tangible computer readable-media for detecting synthetic speaker verification. The method comprises receiving a plurality of speech samples of the same word or phrase for verification, comparing each of the plurality of speech samples to each other, denying verification if the plurality of speech samples demonstrate little variance over time or are the same, and verifying the plurality of speech samples if the plurality of speech samples demonstrates sufficient variance over time. One embodiment further adds that each of the plurality of speech samples is collected at different times or in different contexts. In other embodiments, variance is based on a pre-determined threshold or the threshold for variance is adjusted based on a need for authentication certainty. In another embodiment, if the initial comparison is inconclusive, additional speech samples are received.

    REVERBERATION COMPENSATION FOR FAR-FIELD SPEAKER RECOGNITION

    公开(公告)号:US20180053512A1

    公开(公告)日:2018-02-22

    申请号:US15242882

    申请日:2016-08-22

    申请人: INTEL CORPORATION

    摘要: Techniques are provided for reverberation compensation for far-field speaker recognition. A methodology implementing the techniques according to an embodiment includes receiving an authentication audio signal associated with speech of a user and extracting features from the authentication audio signal. The method also includes scoring results of application of one or more speaker models to the extracted features. Each of the speaker models is trained based on a training audio signal processed by a reverberation simulator to simulate selected far-field environmental effects to be associated with that speaker model. The method further includes selecting one of the speaker models, based on the score, and mapping the selected speaker model to a known speaker identification or label that is associated with the user.

    DYNAMIC THRESHOLD FOR SPEAKER VERIFICATION

    公开(公告)号:US20170345430A1

    公开(公告)日:2017-11-30

    申请号:US15599578

    申请日:2017-05-19

    申请人: Google Inc.

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a dynamic threshold for speaker verification are disclosed. In one aspect, a method includes the actions of receiving, for each of multiple utterances of a hotword, a data set including at least a speaker verification confidence score, and environmental context data. The actions further include selecting from among the data sets, a subset of the data sets that are associated with a particular environmental context. The actions further include selecting a particular data set from among the subset of data sets based on one or more selection criteria. The actions further include selecting, as a speaker verification threshold for the particular environmental context, the speaker verification confidence score. The actions further include providing the speaker verification threshold for use in performing speaker verification of utterances that are associated with the particular environmental context.