DEVICE AND METHOD FOR PASS-PHRASE MODELING FOR SPEAKER VERIFICATION, AND VERIFICATION SYSTEM
    1.
    发明申请
    DEVICE AND METHOD FOR PASS-PHRASE MODELING FOR SPEAKER VERIFICATION, AND VERIFICATION SYSTEM 有权
    用于演讲者验证的PASS-PHR建模的设备和方法,以及验证系统

    公开(公告)号:US20130238334A1

    公开(公告)日:2013-09-12

    申请号:US13989577

    申请日:2010-12-10

    IPC分类号: G10L17/04

    CPC分类号: G10L17/04 G10L17/16

    摘要: A device and method for pass-phrase modeling for speaker verification and a speaker verification system are provided. The device comprises a front end which receives enrollment speech from a target speaker, and a template generation unit which generates a pass-phrase template with a general speaker model based on the enrollment speech. With the device, method and system of the present disclosure, by taking the rich variations contained in a general speaker model into account, the robust pass-phrase modeling is ensured even the enrollment data is insufficient, even just one pass-phrase is available from a target speaker.

    摘要翻译: 提供了一种用于说话人验证和扬声器验证系统的通行短语建模的装置和方法。 该装置包括从目标扬声器接收登记语音的前端,以及基于登记语音生成具有一般说话者模型的密码短语模板的模板生成单元。 利用本公开的装置,方法和系统,通过考虑到一般说话者模型中包含的丰富变体,即使登记数据不足,也确保了鲁棒的密码短语建模,即使只有一个通行短语可从 目标演讲者

    ANCHOR MODEL ADAPTATION DEVICE, INTEGRATED CIRCUIT, AV (AUDIO VIDEO) DEVICE, ONLINE SELF-ADAPTATION METHOD, AND PROGRAM THEREFOR
    2.
    发明申请
    ANCHOR MODEL ADAPTATION DEVICE, INTEGRATED CIRCUIT, AV (AUDIO VIDEO) DEVICE, ONLINE SELF-ADAPTATION METHOD, AND PROGRAM THEREFOR 审中-公开
    ANCHOR型号适配器件,集成电路,AV(音频视频)器件,在线自适应方法及其程序

    公开(公告)号:US20120093327A1

    公开(公告)日:2012-04-19

    申请号:US13379827

    申请日:2011-04-19

    IPC分类号: H04R29/00

    CPC分类号: G10L25/57 G10L2015/0631

    摘要: The present invention provides a device that performs online self-adaption of anchor models for an acoustic space, and a method thereof, the anchor models being used for categorization of an AV stream which is performed based on an audio stream in the AV stream. The device divides an input audio stream into audio segments, each being estimated to have a single acoustic feature, and estimates a single probability model for each audio segment. Then, the device performs clustering on the estimated probability models and probability models stored therein, thereby generating a new anchor model.

    摘要翻译: 本发明提供一种执行用于声学空间的锚模型的在线自适应的装置及其方法,所述锚模型用于基于AV流中的音频流执行的AV流的分类。 该设备将输入音频流划分成音频段,每个音频段被估计具有单个声学特征,并且估计每个音频段的单个概率模型。 然后,设备对存储在其中的估计概率模型和概率模型进行聚类,从而生成新的锚模型。

    Modeling device and method for speaker recognition, and speaker recognition system
    3.
    发明授权
    Modeling device and method for speaker recognition, and speaker recognition system 有权
    扬声器识别的建模装置和方法,以及扬声器识别系统

    公开(公告)号:US09595260B2

    公开(公告)日:2017-03-14

    申请号:US13989508

    申请日:2010-12-10

    IPC分类号: G10L17/04

    CPC分类号: G10L17/04

    摘要: A modeling device comprises a front end which receives enrollment speech data from each target speaker, a reference anchor set generation unit which generates a reference anchor set using the enrollment speech data based on an anchor space, and a voice print generation unit which generates voice prints based on the reference anchor set and the enrollment speech data. By taking the enrollment speech and speaker adaptation technique into account, anchor models with a smaller size can be generated, so reliable and robust speaker recognition with a smaller size reference anchor set is possible.

    摘要翻译: 建模装置包括从每个目标讲话者接收登记语音数据的前端,基于锚定空间使用登记语音数据生成参考锚集合的参考锚集合生成单元,以及生成语音打印的语音打印生成单元 基于参考锚集和登记语音数据。 通过考虑入场语音和说话人适应技术,可以生成尺寸更小的锚模型,因此可以使用较小尺寸的参考锚集合进行可靠和鲁棒的说话人识别。

    Device and method for pass-phrase modeling for speaker verification, and verification system
    4.
    发明授权
    Device and method for pass-phrase modeling for speaker verification, and verification system 有权
    用于讲话者验证的通行短语建模的设备和方法,以及验证系统

    公开(公告)号:US09257121B2

    公开(公告)日:2016-02-09

    申请号:US13989577

    申请日:2010-12-10

    CPC分类号: G10L17/04 G10L17/16

    摘要: A device and method for pass-phrase modeling for speaker verification and a speaker verification system are provided. The device comprises a front end which receives enrollment speech from a target speaker, and a template generation unit which generates a pass-phrase template with a general speaker model based on the enrollment speech. With the device, method and system of the present disclosure, by taking the rich variations contained in a general speaker model into account, the robust pass-phrase modeling is ensured even the enrollment data is insufficient, even just one pass-phrase is available from a target speaker.

    摘要翻译: 提供了一种用于说话人验证和扬声器验证系统的通行短语建模的装置和方法。 该装置包括从目标扬声器接收登记语音的前端,以及基于登记语音生成具有一般说话者模型的密码短语模板的模板生成单元。 利用本公开的装置,方法和系统,通过考虑到一般说话者模型中包含的丰富变体,即使登记数据不足,也确保了鲁棒的密码短语建模,即使只有一个通行短语可从 目标演讲者

    MODELING DEVICE AND METHOD FOR SPEAKER RECOGNITION, AND SPEAKER RECOGNITION SYSTEM
    5.
    发明申请
    MODELING DEVICE AND METHOD FOR SPEAKER RECOGNITION, AND SPEAKER RECOGNITION SYSTEM 有权
    扬声器识别的建模装置和方法,以及扬声器识别系统

    公开(公告)号:US20130253931A1

    公开(公告)日:2013-09-26

    申请号:US13989508

    申请日:2010-12-10

    IPC分类号: G10L17/04

    CPC分类号: G10L17/04

    摘要: A modeling device and method for speaker recognition and a speaker recognition system are provided. The modeling device comprises a front end which receives enrollment speech data from each target speaker, a reference anchor set generation unit which generates a reference anchor set using the enrollment speech data based on an anchor space, and a voice print generation unit which generates voice prints based on the reference anchor set and the enrollment speech data. With the present disclosure, by taking the enrollment speech and speaker adaptation technique into account, anchor models with smaller size can be generated, so reliable and robust speaker recognition with smaller size reference anchor set is possible. It brings great advantages for computation speed improvement and great memory reduction.

    摘要翻译: 提供了一种用于说话者识别的建模装置和方法以及一个说话者识别系统。 该建模装置包括从每个目标讲话者接收登记语音数据的前端,基于锚定空间使用登记语音数据生成参考锚集的参考锚集合生成单元,以及生成语音打印的语音打印生成单元 基于参考锚集和登记语音数据。 通过本公开,通过考虑入场语音和说话者适应技术,可以生成具有较小尺寸的锚模型,因此具有较小尺寸参考锚集的可靠和鲁棒的说话者识别是可能的。 它为计算速度提高和记忆减少带来巨大的优势。