APPARATUS AND METHOD FOR CONSTRUCTING MULTILINGUAL ACOUSTIC MODEL AND COMPUTER READABLE RECORDING MEDIUM FOR STORING PROGRAM FOR PERFORMING THE METHOD
    1.
    发明申请
    APPARATUS AND METHOD FOR CONSTRUCTING MULTILINGUAL ACOUSTIC MODEL AND COMPUTER READABLE RECORDING MEDIUM FOR STORING PROGRAM FOR PERFORMING THE METHOD 审中-公开
    用于构造多通道声学模型的装置和方法以及用于存储方案的计算机可读记录介质

    公开(公告)号:US20140149104A1

    公开(公告)日:2014-05-29

    申请号:US14087490

    申请日:2013-11-22

    CPC classification number: G06F17/289 G10L15/00

    Abstract: An apparatus and a method for constructing a multilingual acoustic model, and a computer readable recording medium are provided. The method for constructing a multilingual acoustic model includes dividing an input feature into a common language portion and a distinctive language portion, acquiring a tandem feature by training the divided common language portion and distinctive language portion using a neural network to estimate and remove correlation between phonemes, dividing parameters of an initial acoustic model constructed using the tandem feature into common language parameters and distinctive language parameters, adapting the common language parameters using data of a training language, adapting the distinctive language parameters using data of a target language, and constructing an acoustic model for the target language using the adapted common language parameters and the adapted distinctive language parameters.

    Abstract translation: 提供了一种用于构建多语言声学模型的装置和方法,以及一种计算机可读记录介质。 用于构建多语言声学模型的方法包括将输入特征划分为公共语言部分和特定语言部分,通过使用神经网络训练分割的公共语言部分和特定语言部分来获取串联特征以估计和消除音素之间的相关性 将使用串联特征构建的初始声学模型的参数划分为公共语言参数和特征语言参数,使用训练语言的数据来适配公共语言参数,使用目标语言的数据来适应不同语言参数,以及构建声学 使用适应的公共语言参数和适应的特征语言参数的目标语言模型。

    VOICE RECOGNITION SERVER AND CONTROL METHOD THEREOF
    4.
    发明申请
    VOICE RECOGNITION SERVER AND CONTROL METHOD THEREOF 审中-公开
    语音识别服务器及其控制方法

    公开(公告)号:US20170076716A1

    公开(公告)日:2017-03-16

    申请号:US15063872

    申请日:2016-03-08

    CPC classification number: G10L15/19 G10L15/1822 G10L2015/223

    Abstract: Provided herein is a voice recognition server and a control method thereof, the method including determining an index value for each of a plurality of training texts; setting a group for each of the plurality of training texts based on the index values of the plurality of training texts, and matching a function corresponding to each group and storing the matched results; in response to receiving a user's uttered voice from a user terminal apparatus, determining an index value from the received uttered voice; and searching a group corresponding to the index value determined from the received uttered voice, and performing the function corresponding to the uttered voice, thereby providing a voice recognition result of a variety of user's uttered voices suitable to the user's intentions.

    Abstract translation: 本文提供了一种语音识别服务器及其控制方法,该方法包括确定多个训练文本中的每一个的索引值; 基于所述多个训练文本的索引值,为所述多个训练文本中的每一个设置组,并且匹配与每个组对应的功能并存储所述匹配结果; 响应于从用户终端设备接收到用户的发话语音,从接收到的语音确定索引值; 并且搜索与从接收到的发言语音确定的索引值相对应的组,并且执行与发出的语音相对应的功能,从而提供适合于用户意图的各种用户发出的语音的语音识别结果。

    SPEECH SIGNAL PROCESSING METHOD AND SPEECH SIGNAL PROCESSING APPARATUS
    5.
    发明申请
    SPEECH SIGNAL PROCESSING METHOD AND SPEECH SIGNAL PROCESSING APPARATUS 审中-公开
    语音信号处理方法和语音信号处理装置

    公开(公告)号:US20160133249A1

    公开(公告)日:2016-05-12

    申请号:US14936043

    申请日:2015-11-09

    CPC classification number: G10L15/063 G10L15/04 G10L15/30 G10L2015/0635

    Abstract: A speech signal processing method of a user terminal includes: receiving a speech signal, detecting a personalized information section including personal information in the speech signal, performing data processing on the personalized information section of the speech signal by using a personalized model generated based on the personal information, and receiving, from a server, a result of the data processing performed by the server on a general information section of the speech signal that is different than the personalized information section of the speech signal.

    Abstract translation: 用户终端的语音信号处理方法包括:接收语音信号,检测包含语音信号中的个人信息的个性化信息部分,通过使用基于所述语音信号生成的个性化模型对语音信号的个性化信息部分进行数据处理 个人信息,以及从服务器接收与语音信号的个性化信息部分不同的语音信号的通用信息部分对服务器执行的数据处理的结果。

    METHOD OF RECOGNIZING SPEECH AND ELECTRONIC DEVICE THEREOF
    6.
    发明申请
    METHOD OF RECOGNIZING SPEECH AND ELECTRONIC DEVICE THEREOF 审中-公开
    识别语音及其电子设备的方法

    公开(公告)号:US20140019131A1

    公开(公告)日:2014-01-16

    申请号:US13940848

    申请日:2013-07-12

    CPC classification number: G10L15/18 G10L15/05 G10L15/142

    Abstract: A method of recognizing a speech and an electronic device thereof are provided. The method includes: segmenting a speech signal into a plurality of sections at preset time intervals; performing a phoneme recognition with respect to one of the plurality of sections of the speech signal by using a first acoustic model; extracting a candidate word of the one of the plurality of sections of the speech signal by using the phoneme recognition result; and performing a speech recognition with respect to the one the plurality of sections the speech signal by using the candidate word.

    Abstract translation: 提供了一种识别语音及其电子设备的方法。 该方法包括:以预设的时间间隔将语音信号分割成多个部分; 通过使用第一声学模型对语音信号的多个部分之一执行音素识别; 通过使用音素识别结果提取语音信号的多个部分中的一个部分的候选词; 以及通过使用所述候选词对所述语音信号的所述多个部分中的一个执行语音识别。

Patent Agency Ranking