-
公开(公告)号:US20230015169A1
公开(公告)日:2023-01-19
申请号:US17933164
申请日:2022-09-19
申请人: Google LLC
发明人: Yeming Fang , Quan Wang , Pedro Jose Moreno Mengibar , Ignacio Lopez Moreno , Gang Feng , Fang Chu , Jin Shi , Jason William Pelecanos
IPC分类号: G10L17/06
摘要: A method of generating an accurate speaker representation for an audio sample includes receiving a first audio sample from a first speaker and a second audio sample from a second speaker. The method includes dividing a respective audio sample into a plurality of audio slices. The method also includes, based on the plurality of slices, generating a set of candidate acoustic embeddings where each candidate acoustic embedding includes a vector representation of acoustic features. The method further includes removing a subset of the candidate acoustic embeddings from the set of candidate acoustic embeddings. The method additionally includes generating an aggregate acoustic embedding from the remaining candidate acoustic embeddings in the set of candidate acoustic embeddings after removing the subset of the candidate acoustic embeddings.
-
公开(公告)号:US11554750B2
公开(公告)日:2023-01-17
申请号:US16199101
申请日:2018-11-23
发明人: Ann Claudia Chapin
IPC分类号: B60R25/25 , B60R25/24 , B60R25/102 , G05D1/00 , G05D1/02 , B60R25/20 , G10L17/06 , B60R25/23 , G10L17/00 , G06V40/16 , B60R25/10
摘要: A system for limiting vehicle operation is disclosed. The system may comprise one or more memory units for storing instructions and one or more processors configured to execute the instructions to perform operations. The operations may comprise determining that a key device is inside the vehicle; performing an authentication; setting the vehicle in a first vehicle mode; starting the vehicle based on the determination and the first authentication; and limiting vehicle operation based on the first vehicle mode.
-
公开(公告)号:US20220406313A1
公开(公告)日:2022-12-22
申请号:US17808224
申请日:2022-06-22
申请人: LISNR
摘要: Aspects of the present disclosure involve processing audio signals to determine the presence and proximity of a user to a computing device, such as a voice-controlled computing device located within an environment. When the proximity of the user in comparison to the computing device is within an acceptable threshold, a voice command is detected that is associated with the user of a plurality of users located in the environment. In some instances, a device command is generated based on the voice command. The device command is executed, for example, at the computing device.
-
公开(公告)号:US20220405625A1
公开(公告)日:2022-12-22
申请号:US17896623
申请日:2022-08-26
申请人: Meta Platforms, Inc.
IPC分类号: G06N7/00 , H04L67/306 , H04L43/16 , H04L67/02 , G06Q50/00 , G10L17/22 , G10L17/06 , H04W12/63 , H04L67/52
摘要: In one embodiment, a method includes, by one or more computing devices of an online social network, receiving, from a client system of a first user and from a second user, a biometric input used to identify the second user, sending, to the client system, a personal identifier for presentation to the second user, receiving, from the client system in response to the presentation of the personal identifier to the second user, an audio input from the second user, determining, based on a comparison of the audio input to a voiceprint of the second user, wherein the voiceprint comprises audio data for auditory identification of the second user, whether the audio input comprises the personal identifier spoken by the second user, and authenticating the second user to access an online account associated with the second user via the client system if the audio input is determined to be spoken by the second user and comprise the personal identifier spoken by the second user.
-
公开(公告)号:US11528450B2
公开(公告)日:2022-12-13
申请号:US17228053
申请日:2021-04-12
发明人: Stephen Lee Hodge
IPC分类号: H04N7/15 , G06Q50/26 , G06V20/52 , G06V40/50 , G06V40/16 , G10L17/06 , G10L17/22 , G10L17/00 , G10L15/00 , G06F21/32 , G06V30/10 , G06V40/10 , G06F21/31
摘要: Described are methods and systems in which the censorship and supervision tasks normally performed by secured facility personnel are augmented or automated entirely by a Secure Nonscheduled Video Visitation System. In embodiments, the Secure Nonscheduled Video Visitation System performs voice biometrics, speech recognition, non-verbal audio classification, fingerprint and other biometric authentication, image object classification, facial recognition, body joint location determination analysis, and/or optical character recognition on the video visitation data. The Secure Nonscheduled Video Visitation utilizes these various analysis techniques in concert to determine if all rules and regulations enforced by the jurisdiction operation the secured facility are being followed by the parties to the video visitation session.
-
公开(公告)号:US20220375476A1
公开(公告)日:2022-11-24
申请号:US17764288
申请日:2019-10-17
申请人: NEC Corporation
发明人: Satoru MOMIYAMA
摘要: Provided is a speaker authentication system capable of achieving robustness against adversarial examples. A data storage unit 112 stores data related to voice of a speaker. A plurality of voice processing units 11 respectively perform speaker authentication based on input voice and the data stored in the data storage unit 112. A post-processing unit 116 specifies one speaker authentication result based on speaker authentication results obtained respectively by the plurality of the voice processing units 11. A method or parameters of the pre-processing applied to the voice in each voice processing unit 11 are different for each voice processing unit 11.
-
公开(公告)号:US20220363282A1
公开(公告)日:2022-11-17
申请号:US17860998
申请日:2022-07-08
发明人: Zhenkai Ying , Hongren Shi , Ming Yu
摘要: A method for information processing, a device, and a computer readable storage medium are provided. The method includes the following. At an in-vehicle electronic device, in response to a determination that a received voice input is associated with switching a source device for switching a rendered content shown at the in-vehicle electronic device, voiceprint information of the voice input is obtained. A user identity corresponding to the voiceprint information is identified. A mapping table between user identities and connected electronic devices or connectable electronic devices is searched for a first electronic device corresponding to the identified user identity. In response to searching out the first electronic device, a first channel for transmission of rendered content between the in-vehicle electronic device and the first electronic device is established, so as to render a first presenting content which is received from the first electronic device through the first channel.
-
98.
公开(公告)号:US20220358934A1
公开(公告)日:2022-11-10
申请号:US17621766
申请日:2019-06-28
申请人: NEC Corporation
摘要: A spoofing detection apparatus 100 includes a multi-channel spectrogram creation unit 10 and an evaluation unit 40. The multi-channel spectrogram creation unit 10 extracts different type of spectrograms from speech data and integrates the different type of spectrograms to create a multi-channel spectrogram. The evaluation unit 40 evaluates the created multi-channel spectrogram by applying the created multi-channel spectrogram to a classifier constructed using labeled multi-channel spectrograms as training data and classifies it to either genuine or spoof.
-
公开(公告)号:US11496842B2
公开(公告)日:2022-11-08
申请号:US17171226
申请日:2021-02-09
发明人: Yonatan Wexler , Amnon Shashua
IPC分类号: H04R25/00 , G06F3/16 , G06K9/62 , G10L17/04 , G10L17/06 , G10L17/18 , G10L21/003 , G10L21/034 , G10L25/51 , H04R1/08 , G03B31/00 , G06F1/16 , G10L21/0272 , H04N7/18 , G10L17/00 , H04N5/225 , H04N5/38 , G10L15/26 , G06V20/10 , G06V40/10 , G06V40/16 , G06V40/20
摘要: A system may include a camera configured to capture images from an environment of a user and a microphone configured to capture sounds from an environment of the user. The system may also include a processor programmed to: receive the images; identify a representation of a first individual and a representation of a second individual in the images; receive, from the microphone, a first audio signal associated with a voice of the first individual and a second audio signal associated with a voice of the second individual; detect an amplification criteria indicative of a voice amplification priority between the first individual and the second individual; selectively amplify the first audio signal relative to the second audio signal when the amplification criteria indicates that the first individual has voice amplification priority over the second individual; and cause transmission of the selectively amplified first audio signal to a hearing interface device.
-
公开(公告)号:US11494473B2
公开(公告)日:2022-11-08
申请号:US16614764
申请日:2018-05-18
申请人: Plantronics, Inc.
发明人: Shridhar K Mukund
IPC分类号: G06F21/32 , G10L17/06 , G10L21/0232 , G10L21/0272 , H04R1/08 , H04R1/10 , H04R1/40 , H04R3/00 , H04W12/06 , G10L17/00 , G10L21/0216
摘要: A headset for acoustic authentication of a user is provided, the headset comprising at least a first microphone, a second microphone, a controllable filter, and an authenticator. The first microphone is arranged to obtain a first input signal. The second microphone is arranged to obtain a second input signal. The controllable filter is configured to receive the first input signal and the second input signal and to determine at least one filter transfer function from the received first input signal and the second input signal. The authenticator is configured to determine a current user acoustic signature from the at least one filter transfer function and to compare the current user acoustic signature with a predefined user acoustic signature and to authenticate the user based on the comparison of the current user acoustic signature with the predefined user acoustic signature.
-
-
-
-
-
-
-
-
-