Invention Application
- Patent Title: METHOD AND DEVICE FOR ACOUSTIC LANGUAGE MODEL TRAINING
- Patent Title (中): 用于语音语言模型训练的方法和装置
-
Application No.: PCT/CN2013/085948Application Date: 2013-10-25
-
Publication No.: WO2014117548A1Publication Date: 2014-08-07
- Inventor: LU, Duling , LI, Lu , RAO, Feng , CHEN, Bo , LU, Li , ZHANG, Xiang , WANG, Eryu , YUE, Shuai
- Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
- Applicant Address: Room 403, East Block 2, SEG Park Zhenxing Road, Futian District Shenzhen, Guangdong 518044 CN
- Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
- Current Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
- Current Assignee Address: Room 403, East Block 2, SEG Park Zhenxing Road, Futian District Shenzhen, Guangdong 518044 CN
- Agency: ADVANCE CHINA I.P. LAW OFFICE
- Priority: CN201310040085.1 20130201
- Main IPC: G10L15/06
- IPC: G10L15/06
Abstract:
A method and a device for training an acoustic language model, include: conducting word segmentation for training samples in a training corpus using an initial language model containing no word class labels, to obtain initial word segmentation data containing no word class labels; performing word class replacement for the initial word segmentation data containing no word class labels, to obtain first word segmentation data containing word class labels; using the first word segmentation data containing word class labels to train a first language model containing word class labels; using the first language model containing word class labels to conduct word segmentation for the training samples in the training corpus, to obtain second word segmentation data containing word class labels; and in accordance with the second word segmentation data meeting one or more predetermined criteria, using the second word segmentation data containing word class labels to train the acoustic language model.
Information query