Voice identification enrollment
    92.
    发明授权

    公开(公告)号:US11152006B2

    公开(公告)日:2021-10-19

    申请号:US16020911

    申请日:2018-06-27

    摘要: Examples are disclosed that relate to voice identification enrollment. One example provides a method of voice identification enrollment comprising, during a meeting in which two or more human speakers speak at different times, determining whether one or more conditions of a protocol for sampling meeting audio used to establish human speaker voiceprints are satisfied, and in response to determining that the one or more conditions are satisfied, selecting a sample of meeting audio according to the protocol, the sample representing an utterance made by one of the human speakers. The method further comprises establishing, based at least on the sample, a voiceprint of the human speaker.

    Playback Device Supporting Concurrent Voice Assistants

    公开(公告)号:US20210289607A1

    公开(公告)日:2021-09-16

    申请号:US17101949

    申请日:2020-11-23

    申请人: Sonos, Inc.

    发明人: Dayn Wilberding

    摘要: Disclosed herein are example techniques to support multiple voice assistant services. An example implementation may involve a playback device capturing audio from the one or more microphones into one or more buffers as a sound data stream monitoring the sound data stream for a wake word associated with a specific voice assistant service and monitoring the sound data stream for a wake word associated with the media playback system. The playback device generates a second wake-word event corresponding to a voice input when sound data matching the wake word associated with the media playback system in a portion of the sound data stream is detected. The playback device determines that the voice input includes sound data matching one or more playback commands and sends sound data representing the voice input to a voice assistant associated with the media playback system for processing of the second voice input.

    Neural networks for speaker verification

    公开(公告)号:US11107478B2

    公开(公告)日:2021-08-31

    申请号:US16752007

    申请日:2020-01-24

    申请人: Google LLC

    IPC分类号: G10L17/18 G10L17/04 G10L17/02

    摘要: This document generally describes systems, methods, devices, and other techniques related to speaker verification, including (i) training a neural network for a speaker verification model, (ii) enrolling users at a client device, and (iii) verifying identities of users based on characteristics of the users' voices. Some implementations include a computer-implemented method. The method can include receiving, at a computing device, data that characterizes an utterance of a user of the computing device. A speaker representation can be generated, at the computing device, for the utterance using a neural network on the computing device. The neural network can be trained based on a plurality of training samples that each: (i) include data that characterizes a first utterance and data that characterizes one or more second utterances, and (ii) are labeled as a matching speakers sample or a non-matching speakers sample.

    VOICEPRINT RECOGNITION METHOD AND APPARATUS

    公开(公告)号:US20210225380A1

    公开(公告)日:2021-07-22

    申请号:US16300444

    申请日:2018-02-27

    发明人: Wenyu WANG Yuan HU

    摘要: The present disclosure provides a voiceprint recognition method and apparatus, comprising: according to an obtained command speech, recognizing, in a voiceprint recognition manner, a user class sending a command speech; according to the user class, using a corresponding speech recognition model to perform speech recognition for the command speech, to obtain a command described by the command speech; providing resources according to the user class and command. The present disclosure can avoid the problems that in a conventional voiceprint recognition method in the prior art, a client needs to participate in voiceprint recognition, and the user's ID needs to be further recognized through a voiceprint training process, and that the user's degree of satisfaction is not high. While the user speaks naturally, it is feasible to perform processing for these very “ordinary” speech, and meanwhile complete the work of voiceprint recognition.

    PHONEME-BASED SPEAKER MODEL ADAPTATION METHOD AND DEVICE

    公开(公告)号:US20210193153A1

    公开(公告)日:2021-06-24

    申请号:US17273542

    申请日:2019-08-09

    发明人: Chisang JUNG

    IPC分类号: G10L17/08 G10L17/02 G10L17/04

    摘要: The present disclosure relates to a speaker model adaptation method and device for enhancing text-independent speaker recognition performance. Specifically, the disclosure relates to a method and a device whereby, for the adaption of a speaker model pre-stored in an electronic device, text-independent speaker recognition performance is improved by considering variations in the amount of speaker characteristics information per phoneme unit.

    METHOD AND APPARATUS FOR AUTHENTICATING SPEAKER

    公开(公告)号:US20210193151A1

    公开(公告)日:2021-06-24

    申请号:US16813564

    申请日:2020-03-09

    发明人: Jungmin SONG

    摘要: A speaker voice authentication method and apparatus according to an embodiment of the present disclosure prevent a third party from attempting speaker authentication using a recorded file by distinguishing an actual voice of a speaker from a recorded file obtained by recording the voice of the speaker. Further, at the time of voice authentication, voice recognition artificial intelligence technology is selectively utilized to allow the speaker to perform voice authentication through only one utterance, and receiving of the voice of the speaker may be performed in an Internet of Things (IoT) environment using a 5G network.

    EAR-WORN ELECTRONIC DEVICE INCORPORATING ANNOYANCE MODEL DRIVEN SELECTIVE ACTIVE NOISE CONTROL

    公开(公告)号:US20210174818A1

    公开(公告)日:2021-06-10

    申请号:US17125566

    申请日:2020-12-17

    摘要: A system comprises an ear-worn electronic device configured to be worn by a wearer. The ear-worn electronic device comprises a processor and memory coupled to the processor. The memory is configured to store an annoying sound dictionary representative of a plurality of annoying sounds pre-identified by the wearer. A microphone is coupled to the processor and configured to monitor an acoustic environment of the wearer. A speaker or a receiver is coupled to the processor. The processor is configured to identify different background noises present in the acoustic environment, determine which of the background noises correspond to one or more of the plurality of annoying sounds, and attenuate the one or more annoying sounds in an output signal provided to the speaker or receiver.