摘要:
Method for text-dependent Speaker Recognition using a speaker adapted Universal Background Model, wherein the speaker adapted Universal Background Model is a speaker adapted Hidden Markov Model comprising channel correction.
摘要:
A tamper-resistant element for use in speaker recognition, the tamper-resistant element being adapted for storing data representing speaker information based on speaker recognition enrollment data and for checking whether information based on a speaker recognition testing signal matches the speaker information. The tamper-resistant element is also adapted for carrying out a data integrity check. Also, a system including such a tamper-resistant element and method for speaker recognition.
摘要:
Disclosed herein are methods of diarizing audio data using first-pass blind diarization and second-pass blind diarization that generate speaker statistical models, wherein the first pass-blind diarization is on a per-frame basis and the second pass-blind diarization is on a per-word basis, and methods of creating acoustic signatures for a common speaker based only on the statistical models of the speakers in each audio session.
摘要:
A method for assessing suicide risk for a human subject including receiving recorded voice data of the subject; and classifying the subject as suicidal or non-suicidal based upon a computerized analysis of one or more nonverbal characteristics of the speech data, especially features associated with a breathy phonation type. The analysis of the nonverbal characteristics of the voice data can include an analysis of acoustic characteristics of speech, and/or an analysis of prosodic and voice quality-related features of the voice data. Related apparatus, systems, techniques and articles are also described.
摘要:
A device and method for pass-phrase modeling for speaker verification and a speaker verification system are provided. The device comprises a front end which receives enrollment speech from a target speaker, and a template generation unit which generates a pass-phrase template with a general speaker model based on the enrollment speech. With the device, method and system of the present disclosure, by taking the rich variations contained in a general speaker model into account, the robust pass-phrase modeling is ensured even the enrollment data is insufficient, even just one pass-phrase is available from a target speaker.