Voice messaging system with unified pitch and voice tracking

发明授权

US4696038A Voice messaging system with unified pitch and voice tracking 失效

标题翻译：具有统一音调和语音跟踪功能的语音留言系统

请登陆查看更多内容

专利标题： Voice messaging system with unified pitch and voice tracking
专利标题（中）： 具有统一音调和语音跟踪功能的语音留言系统
申请号： US484718

申请日： 1983-04-13
公开(公告)号： US4696038A

公开(公告)日： 1987-09-22
发明人: George R. Doddington , Bruce G. Secrest
申请人： George R. Doddington , Bruce G. Secrest
申请人地址： TX Dallas
专利权人： Texas Instruments Incorporated
当前专利权人： Texas Instruments Incorporated
当前专利权人地址： TX Dallas
主分类号： G10L11/06
IPC分类号： G10L11/06 ; G10L19/06 ; G10L5/00

Voice messaging system with unified pitch and voice tracking

摘要：

This voice messaging system provides an LPC analyzer in combination with a pitch extractor wherein LPC parameters and a residual signal organized in a sequence of speech data frames are provided by the LPC analyzer as an output representative of an analog speech signal. The pitch extractor is operably associated with the LPC analyzer and produces a plurality of pitch candidates for each of the speech data frames in the sequence thereof. Dynamic programming is performed on the plurality of pitch candidates for each speech data frame and also with respect to a voiced/unvoiced decision of the speech data for each frame by tracking both pitch and voicing from frame to frame to provide an optimal pitch value and also an optimal voicing decision. During dynamic programming, a cumulative penalty for a sequence of frame pitch/voicing decisions is accumulated by defining a transition error between each pitch candidate of a current speech data frame and each pitch candidate of the preceding frame, and defining a cumulative error for each pitch candidate of the current frame equal to the transition error between the pitch candidate of the current frame plus the cumulative error of an optimally identified pitch candidate in the preceding frame to locate the track providing optimal pitch and voicing decisions based upon the lowest cumulative penalty. An encoder then encodes the LPC parameters as generated by the LPC analyzer and the optimal pitch and voicing decisions for each speech data frame for subsequent use in providing an audible synthesized speech output substantially identical to the original speech input.

摘要（中）：

该语音消息传送系统提供了LPC分析器与音调提取器的组合，其中LPC参数和以语音数据帧序列组织的残余信号由LPC分析器提供作为模拟语音信号的输出代表。音调提取器可操作地与LPC分析器相关联，并且为其序列中的每个语音数据帧产生多个音调候选。对于每个语音数据帧，对于每个语音数据帧的多个音调候选进行动态编程，并且还针对每帧的语音数据的有声/无声决定，通过跟踪帧间的音调和发音，以提供最佳音调值，并且还最佳发声决定。在动态编程期间，通过定义当前语音数据帧的每个音调候选和前一帧的每个音调候选之间的转换误差来累积帧间距/发音决定序列的累积损失，并且定义每个音调的累积误差当前帧的候选者等于当前帧的音调候选之间的转换误差加上前一帧中最佳识别的音调候选的累积误差，以根据最低累积罚分定位提供最佳音调和发声决定的音轨。然后，编码器对由LPC分析器生成的LPC参数进行编码，并且为每个语音数据帧提供最佳的音调和发音决定，以便随后用于提供与原始语音输入基本相同的可听合成语音输出。

信息查询

Global Dossier Espacenet