发明授权
- 专利标题: System and method for enriching spoken language translation with prosodic information
- 专利标题(中): 用韵律信息丰富口语翻译的系统和方法
-
申请号: US12241660申请日: 2008-09-30
-
公开(公告)号: US08571849B2公开(公告)日: 2013-10-29
- 发明人: Srinivas Bangalore , Vivek Kumar Rangarajan Sridhar
- 申请人: Srinivas Bangalore , Vivek Kumar Rangarajan Sridhar
- 申请人地址: US GA Atlanta
- 专利权人: AT&T Intellectual Property I, L.P.
- 当前专利权人: AT&T Intellectual Property I, L.P.
- 当前专利权人地址: US GA Atlanta
- 主分类号: G06F17/28
- IPC分类号: G06F17/28
摘要:
Disclosed herein are systems, methods, and computer readable-media for enriching spoken language translation with prosodic information in a statistical speech translation framework. The method includes receiving speech for translation to a target language, generating pitch accent labels representing segments of the received speech which are prosodically prominent, and injecting pitch accent labels with word tokens within the translation engine to create enriched target language output text. A further step may be added of synthesizing speech in the target language based on the prosody enriched target language output text. An automatic prosody labeler can generate pitch accent labels. An automatic prosody labeler can exploit lexical, syntactic, and prosodic information of the speech. A maximum entropy model may be used to determine which segments of the speech are prosodically prominent. A pitch accent label can include an indication of certainty that a respective segment of the speech is prosodically prominent and/or an indication of prosodic prominence of a respective segment of speech.
公开/授权文献
信息查询