一种基于因子分析的说话人分段聚类方法及系统

Invention Publication

CN107342077A 一种基于因子分析的说话人分段聚类方法及系统无效 - 驳回

Please log in to see more content

Patent Title: 一种基于因子分析的说话人分段聚类方法及系统
Patent Title (English): Speaker segmented clustering method and system based on factor analysis
Application No.: CN201710395341.7

Application Date: 2017-05-27
Publication No.: CN107342077A

Publication Date: 2017-11-10
Inventor: 计哲 , 颜永红 , 安茂波 , 陈燕妮 , 苗权 , 李鹏 , 张震 , 万辛
Applicant: 国家计算机网络与信息安全管理中心
Applicant Address: 北京市朝阳区裕民路甲3号
Assignee: 国家计算机网络与信息安全管理中心
Current Assignee: 国家计算机网络与信息安全管理中心
Current Assignee Address: 北京市朝阳区裕民路甲3号
Agency: 北京君尚知识产权代理事务所
Agent 邱晓锋
Main IPC: G10L15/06
IPC: G10L15/06 ; G10L15/07 ; G10L15/14 ; G10L17/04 ; G10L17/14

Abstract:

本发明涉及一种基于因子分析的说话人分段聚类方法及系统。该方法包括：1)提取训练语音的声学特征，训练高斯混合通用背景模型，进而训练总变化因子模型和高斯概率线性判别分析模型；2)对测试语音进行分段并提取语音片段的声学特征；3)依据高斯混合通用背景模型和总变化因子模型将提取的声学特征映射为总变化量因子，加载高斯概率线性判别分析模型，根据总变化量因子计算任意两语音片段之间的对数似然比得分；4)选择得分最高的两类进行合并，根据层次聚类的方法逐步迭代至收敛，最终输出说话人分段聚类结果。本发明将总变化因子的不确定性引入到高斯概率线性判别分析模型进行训练和打分，能够提升短时语音片段上的基于因子分析的系统性能。

Abstract(English):

The invention relates to a speaker segmented clustering method and system based on factor analysis. The speaker segmented clustering method based on factor analysis comprises steps of 1) extracting acoustic characteristics of training voice, training a gauss mixed general background model, training a total changing factor model and a gauss probability linear determination analysis model, (2) performing segmentation on testing voice and extracting acoustic characteristics of voice segments, (3) mapping the extracted voice characteristics as total changing amount factors according to the gauss mixed general background model and the total changing factor model, loading a gauss probability linear determination analysis model, calculating a log-likelihood ratio score between any two voice segments according to the total changing amount factors, (4) choosing two kinds of segments having highest scores and performing combination, performing gradual iteration until convergence according to a level clustering method and finally outputting a segmented clustering result of the speaker. The speaker segmented clustering method and system based on factor analysis introduce uncertainty of the total changing factor into the gauss probability linear determination model to perform training and marking, and can improve system performance of analyzing based factors on a short-time voice segment.

Information query

Chinese Patent Announcement Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）