Speaker Identification Accuracy
    91.
    发明申请

    公开(公告)号:US20230015169A1

    公开(公告)日:2023-01-19

    申请号:US17933164

    申请日:2022-09-19

    申请人: Google LLC

    IPC分类号: G10L17/06

    摘要: A method of generating an accurate speaker representation for an audio sample includes receiving a first audio sample from a first speaker and a second audio sample from a second speaker. The method includes dividing a respective audio sample into a plurality of audio slices. The method also includes, based on the plurality of slices, generating a set of candidate acoustic embeddings where each candidate acoustic embedding includes a vector representation of acoustic features. The method further includes removing a subset of the candidate acoustic embeddings from the set of candidate acoustic embeddings. The method additionally includes generating an aggregate acoustic embedding from the remaining candidate acoustic embeddings in the set of candidate acoustic embeddings after removing the subset of the candidate acoustic embeddings.

    SYSTEMS AND METHODS FOR ENABLING VOICE-BASED TRANSACTIONS AND VOICE-BASED COMMANDS

    公开(公告)号:US20220406313A1

    公开(公告)日:2022-12-22

    申请号:US17808224

    申请日:2022-06-22

    申请人: LISNR

    摘要: Aspects of the present disclosure involve processing audio signals to determine the presence and proximity of a user to a computing device, such as a voice-controlled computing device located within an environment. When the proximity of the user in comparison to the computing device is within an acceptable threshold, a voice command is detected that is associated with the user of a plurality of users located in the environment. In some instances, a device command is generated based on the voice command. The device command is executed, for example, at the computing device.

    User Identification with Voiceprints on Online Social Networks

    公开(公告)号:US20220405625A1

    公开(公告)日:2022-12-22

    申请号:US17896623

    申请日:2022-08-26

    摘要: In one embodiment, a method includes, by one or more computing devices of an online social network, receiving, from a client system of a first user and from a second user, a biometric input used to identify the second user, sending, to the client system, a personal identifier for presentation to the second user, receiving, from the client system in response to the presentation of the personal identifier to the second user, an audio input from the second user, determining, based on a comparison of the audio input to a voiceprint of the second user, wherein the voiceprint comprises audio data for auditory identification of the second user, whether the audio input comprises the personal identifier spoken by the second user, and authenticating the second user to access an online account associated with the second user via the client system if the audio input is determined to be spoken by the second user and comprise the personal identifier spoken by the second user.

    Secure nonscheduled video visitation system

    公开(公告)号:US11528450B2

    公开(公告)日:2022-12-13

    申请号:US17228053

    申请日:2021-04-12

    发明人: Stephen Lee Hodge

    摘要: Described are methods and systems in which the censorship and supervision tasks normally performed by secured facility personnel are augmented or automated entirely by a Secure Nonscheduled Video Visitation System. In embodiments, the Secure Nonscheduled Video Visitation System performs voice biometrics, speech recognition, non-verbal audio classification, fingerprint and other biometric authentication, image object classification, facial recognition, body joint location determination analysis, and/or optical character recognition on the video visitation data. The Secure Nonscheduled Video Visitation utilizes these various analysis techniques in concert to determine if all rules and regulations enforced by the jurisdiction operation the secured facility are being followed by the parties to the video visitation session.

    SPEAKER AUTHENTICATION SYSTEM, METHOD, AND PROGRAM

    公开(公告)号:US20220375476A1

    公开(公告)日:2022-11-24

    申请号:US17764288

    申请日:2019-10-17

    申请人: NEC Corporation

    发明人: Satoru MOMIYAMA

    IPC分类号: G10L17/06 G10L17/02 G10L25/18

    摘要: Provided is a speaker authentication system capable of achieving robustness against adversarial examples. A data storage unit 112 stores data related to voice of a speaker. A plurality of voice processing units 11 respectively perform speaker authentication based on input voice and the data stored in the data storage unit 112. A post-processing unit 116 specifies one speaker authentication result based on speaker authentication results obtained respectively by the plurality of the voice processing units 11. A method or parameters of the pre-processing applied to the voice in each voice processing unit 11 are different for each voice processing unit 11.

    Method for Information Processing, Device, and Computer Storage Medium

    公开(公告)号:US20220363282A1

    公开(公告)日:2022-11-17

    申请号:US17860998

    申请日:2022-07-08

    摘要: A method for information processing, a device, and a computer readable storage medium are provided. The method includes the following. At an in-vehicle electronic device, in response to a determination that a received voice input is associated with switching a source device for switching a rendered content shown at the in-vehicle electronic device, voiceprint information of the voice input is obtained. A user identity corresponding to the voiceprint information is identified. A mapping table between user identities and connected electronic devices or connectable electronic devices is searched for a first electronic device corresponding to the identified user identity. In response to searching out the first electronic device, a first channel for transmission of rendered content between the in-vehicle electronic device and the first electronic device is established, so as to render a first presenting content which is received from the first electronic device through the first channel.

    Headset for acoustic authentication of a user

    公开(公告)号:US11494473B2

    公开(公告)日:2022-11-08

    申请号:US16614764

    申请日:2018-05-18

    申请人: Plantronics, Inc.

    发明人: Shridhar K Mukund

    摘要: A headset for acoustic authentication of a user is provided, the headset comprising at least a first microphone, a second microphone, a controllable filter, and an authenticator. The first microphone is arranged to obtain a first input signal. The second microphone is arranged to obtain a second input signal. The controllable filter is configured to receive the first input signal and the second input signal and to determine at least one filter transfer function from the received first input signal and the second input signal. The authenticator is configured to determine a current user acoustic signature from the at least one filter transfer function and to compare the current user acoustic signature with a predefined user acoustic signature and to authenticate the user based on the comparison of the current user acoustic signature with the predefined user acoustic signature.