Interactive debugging and tuning method for CTTS voice building
    1.
    发明授权
    Interactive debugging and tuning method for CTTS voice building 有权
    CTTS语音建立的交互式调试和调优方法

    公开(公告)号:US07487092B2

    公开(公告)日:2009-02-03

    申请号:US10688041

    申请日:2003-10-17

    IPC分类号: G10L13/08

    CPC分类号: G10L13/033

    摘要: A method, a system, and an apparatus for identifying and correcting sources of problems in synthesized speech which is generated using a concatenative text-to-speech (CTTS) technique. The method can include the step of displaying a waveform corresponding to synthesized speech generated from concatenated phonetic units. The synthesized speech can be generated from text input received from a user. The method further can include the step of displaying parameters corresponding to at least one of the phonetic units. The method can include the step of displaying the original recordings containing selected phonetic units. An editing input can be received from the user and the parameters can be adjusted in accordance with the editing input.

    摘要翻译: 一种用于识别和校正使用连续文本到语音(CTTS)技术产生的合成语音中的问题源的方法,系统和装置。 该方法可以包括显示对应于从拼接语音单元产生的合成语音的波形的步骤。 可以从从用户接收的文本输入生成合成语音。 该方法还可以包括显示与至少一个语音单元对应的参数的步骤。 该方法可以包括显示包含所选语音单元的原始记录的步骤。 可以从用户接收编辑输入,并且可以根据编辑输入来调整参数。

    Method for detecting misaligned phonetic units for a concatenative text-to-speech voice
    2.
    发明授权
    Method for detecting misaligned phonetic units for a concatenative text-to-speech voice 有权
    用于检测拼接的文本到语音的语音单元的方法

    公开(公告)号:US07280967B2

    公开(公告)日:2007-10-09

    申请号:US10630113

    申请日:2003-07-30

    IPC分类号: G10L13/08 G10L15/00

    CPC分类号: G10L13/06

    摘要: A method of filtering phonetic units to be used within a concatenative text-to-speech (CTTS) voice. Initially, a normality threshold can be established. At least one phonetic unit that has been automatically extracted from a speech corpus in order to construct the CTTS voice can be received. An abnormality index can be calculated for the phonetic unit. Then, the abnormality index can be compared to the established normality threshold. If the abnormality index exceeds the normality threshold, the phonetic unit can be marked as a suspect phonetic unit. If the abnormality index does not exceed the normality threshold, the phonetic unit can be marked as a verified phonetic unit. The concatenative text-to-speech voice can be built using the verified phonetic units.

    摘要翻译: 一种在连续的文本到语音(CTTS)语音中过滤语音单元的方法。 最初,可以建立正常阈值。 可以接收到从语音语料库自动提取以构建CTTS语音的至少一个语音单元。 可以为语音单元计算异常指数。 然后,将异常指数与建立的正常阈值进行比较。 如果异常指数超过正常阈值,则可以将语音单元标记为可疑语音单元。 如果异常指数不超过正常阈值,则可以将语音单元标记为已验证的语音单元。 可以使用经过验证的语音单元构建级联的文本到语音的语音。

    Interactive debugging and tuning of methods for CTTS voice building
    3.
    发明授权
    Interactive debugging and tuning of methods for CTTS voice building 有权
    一种用于CTTS语音构建的交互式调试和调优方法

    公开(公告)号:US07853452B2

    公开(公告)日:2010-12-14

    申请号:US12327579

    申请日:2008-12-03

    IPC分类号: G10L13/06

    CPC分类号: G10L13/033

    摘要: A method, a system, and an apparatus for identifying and correcting sources of problems in synthesized speech which is generated using a concatenative text-to-speech (CTTS) technique. The method can include the step of displaying a waveform corresponding to synthesized speech generated from concatenated phonetic units. The synthesized speech can be generated from text input received from a user. The method further can include the step of displaying parameters corresponding to at least one of the phonetic units. The method can include the step of displaying the original recordings containing selected phonetic units. An editing input can be received from the user and the parameters can be adjusted in accordance with the editing input.

    摘要翻译: 一种用于识别和校正使用连续文本到语音(CTTS)技术产生的合成语音中的问题源的方法,系统和装置。 该方法可以包括显示对应于从拼接语音单元产生的合成语音的波形的步骤。 可以从从用户接收的文本输入生成合成语音。 该方法还可以包括显示与至少一个语音单元对应的参数的步骤。 该方法可以包括显示包含所选语音单元的原始记录的步骤。 可以从用户接收编辑输入,并且可以根据编辑输入来调整参数。

    INTERACTIVE DEBUGGING AND TUNING OF METHODS FOR CTTS VOICE BUILDING
    4.
    发明申请
    INTERACTIVE DEBUGGING AND TUNING OF METHODS FOR CTTS VOICE BUILDING 有权
    CTTS语音建筑方法的互动调试和调谐

    公开(公告)号:US20090083037A1

    公开(公告)日:2009-03-26

    申请号:US12327579

    申请日:2008-12-03

    IPC分类号: G10L13/08

    CPC分类号: G10L13/033

    摘要: A method, a system, and an apparatus for identifying and correcting sources of problems in synthesized speech which is generated using a concatenative text-to-speech (CTTS) technique. The method can include the step of displaying a waveform corresponding to synthesized speech generated from concatenated phonetic units. The synthesized speech can be generated from text input received from a user. The method further can include the step of displaying parameters corresponding to at least one of the phonetic units. The method can include the step of displaying the original recordings containing selected phonetic units. An editing input can be received from the user and the parameters can be adjusted in accordance with the editing input.

    摘要翻译: 一种用于识别和校正使用连续文本到语音(CTTS)技术产生的合成语音中的问题源的方法,系统和装置。 该方法可以包括显示对应于从拼接语音单元产生的合成语音的波形的步骤。 可以从从用户接收的文本输入生成合成语音。 该方法还可以包括显示与至少一个语音单元对应的参数的步骤。 该方法可以包括显示包含所选语音单元的原始记录的步骤。 可以从用户接收编辑输入,并且可以根据编辑输入来调整参数。

    Interactive debugging and tuning method for CTTS voice building
    5.
    发明申请
    Interactive debugging and tuning method for CTTS voice building 有权
    CTTS语音建立的交互式调试和调优方法

    公开(公告)号:US20050086060A1

    公开(公告)日:2005-04-21

    申请号:US10688041

    申请日:2003-10-17

    IPC分类号: G10L11/00 G10L13/02

    CPC分类号: G10L13/033

    摘要: A method, a system, and an apparatus for identifying and correcting sources of problems in synthesized speech which is generated using a concatenative text-to-speech (CTTS) technique. The method can include the step of displaying a waveform corresponding to synthesized speech generated from concatenated phonetic units. The synthesized speech can be generated from text input received from a user. The method further can include the step of displaying parameters corresponding to at least one of the phonetic units. The method can include the step of displaying the original recordings containing selected phonetic units. An editing input can be received from the user and the parameters can be adjusted in accordance with the editing input.

    摘要翻译: 一种用于识别和校正使用连续文本到语音(CTTS)技术产生的合成语音中的问题源的方法,系统和装置。 该方法可以包括显示对应于从拼接语音单元产生的合成语音的波形的步骤。 可以从从用户接收的文本输入生成合成语音。 该方法还可以包括显示与至少一个语音单元对应的参数的步骤。 该方法可以包括显示包含所选语音单元的原始记录的步骤。 可以从用户接收编辑输入,并且可以根据编辑输入来调整参数。

    Object specific language extension interface for a multi-level data structure
    6.
    发明授权
    Object specific language extension interface for a multi-level data structure 失效
    用于多级数据结构的对象专用语言扩展接口

    公开(公告)号:US07464065B2

    公开(公告)日:2008-12-09

    申请号:US11284150

    申请日:2005-11-21

    IPC分类号: G06F17/00 G06N5/02

    CPC分类号: G10L13/04 G06F17/2775

    摘要: A computerized method (300) and software product (200) is provided for querying and modifying a Multi-Level Data Structure (106) stored in a Text-to-Speech (100) engine of a data processing system having a Central Processing Unit (202), a processing system memory (203), and an operating system (201), using an application program written in an interpretive programming language. The method includes the steps of initializing (302) by means of the CPU implementing a set of commands, a data processing environment for processing the application program, processing (306) the application program, where the processing includes identifying a marked command that encapsulates a DPMS program, and upon identifying a marked command, operating (318) on the MLDS using a DPMS interpreter for producing a result from the MLDS, the result available to the application program during execution of the application program.

    摘要翻译: 提供了一种计算机化方法(300)和软件产品(200),用于查询和修改存储在具有中央处理单元的数据处理系统的文本到语音(100)引擎中的多级数据结构(106) 202),处理系统存储器(203)和操作系统(201),使用以解释性编程语言编写的应用程序。 该方法包括以下步骤:借助于实现一组命令的CPU初始化(302),处理应用程序的数据处理环境,处理(306)应用程序,其中处理包括识别封装了一个 DPMS程序,并且在识别标记的命令时,使用DPMS解释器在MLDS上操作(318)以产生来自MLDS的结果,该应用程序在执行应用程序期间可用的结果。

    System and method for word-sense disambiguation by recursive partitioning
    7.
    发明授权
    System and method for word-sense disambiguation by recursive partitioning 有权
    通过递归分割的词义消歧的系统和方法

    公开(公告)号:US08099281B2

    公开(公告)日:2012-01-17

    申请号:US11145656

    申请日:2005-06-06

    申请人: Philip Gleason

    发明人: Philip Gleason

    CPC分类号: G10L13/08

    摘要: A device and related methods for word-sense disambiguation during a text-to-speech conversion are provided. The device, for use with a computer-based system capable of converting text data to synthesized speech, includes an identification module for identifying a homograph contained in the text data. The device also includes an assignment module for assigning a pronunciation to the homograph using a statistical test constructed from a recursive partitioning of training samples, each training sample being a word string containing the homograph. The recursive partitioning is based on determining for each training sample an order and a distance of each word indicator relative to the homograph in the training sample. An absence of one of the word indicators in a training sample is treated as equivalent to the absent word indicator being more than a predefined distance from the homograph.

    摘要翻译: 提供了一种在文本到语音转换期间用于词义消歧的设备和相关方法。 该装置与用于将文本数据转换成合成语音的基于计算机的系统一起使用,包括用于识别包含在文本数据中的同形图像的识别模块。 该装置还包括一个分配模块,用于使用从训练样本的递归划分构建的统计测试将同音图分配给发音,每个训练样本是包含同形图的单词串。 递归分割是基于每个训练样本确定训练样本中每个单词指示器相对于同形图的顺序和距离。 训练样本中没有一个单词指标被视为等同于缺席词指标超过同形图预定距离。

    Object specific language extension interface for a multi-level data structure
    8.
    发明申请
    Object specific language extension interface for a multi-level data structure 失效
    用于多级数据结构的对象专用语言扩展接口

    公开(公告)号:US20070118489A1

    公开(公告)日:2007-05-24

    申请号:US11284150

    申请日:2005-11-21

    IPC分类号: G06F15/18

    CPC分类号: G10L13/04 G06F17/2775

    摘要: A computerized method (300) and software product (200) is provided for querying and modifying a Multi-Level Data Structure (106) stored in a Text-to-Speech (100) engine of a data processing system having a Central Processing Unit (202), a processing system memory (203), and an operating system (201), using an application program written in an interpretive programming language. The method includes the steps of initializing (302) by means of the CPU implementing a set of commands, a data processing environment for processing the application program, processing (306) the application program, where the processing includes identifying a marked command that encapsulates a DPMS program, and upon identifying a marked command, operating (318) on the MLDS using a DPMS interpreter for producing a result from the MLDS, the result available to the application program during execution of the application program.

    摘要翻译: 提供了一种计算机化方法(300)和软件产品(200),用于查询和修改存储在具有中央处理单元的数据处理系统的文本到语音(100)引擎中的多级数据结构(106) 202),处理系统存储器(203)和操作系统(201),使用以解释性编程语言编写的应用程序。 该方法包括以下步骤:借助于实现一组命令的CPU初始化(302),处理应用程序的数据处理环境,处理(306)应用程序,其中处理包括识别封装了一个 DPMS程序,并且在识别标记的命令时,使用DPMS解释器在MLDS上操作(318)以产生来自MLDS的结果,该应用程序在执行应用程序期间可用的结果。

    System and method for word-sense disambiguation by recursive partitioning
    9.
    发明申请
    System and method for word-sense disambiguation by recursive partitioning 有权
    通过递归分割的词义消歧的系统和方法

    公开(公告)号:US20060277045A1

    公开(公告)日:2006-12-07

    申请号:US11145656

    申请日:2005-06-06

    申请人: Philip Gleason

    发明人: Philip Gleason

    IPC分类号: G10L13/08

    CPC分类号: G10L13/08

    摘要: A device and related methods for word-sense disambiguation during a text-to-speech conversion are provided. The device, for use with a computer-based system capable of converting text data to synthesized speech, includes an identification module for identifying a homograph contained in the text data. The device also includes an assignment module for assigning a pronunciation to the homograph using a statistical test constructed from a recursive partitioning of training samples, each training sample being a word string containing the homograph. The recursive partitioning is based on determining for each training sample an order and a distance of each word indicator relative to the homograph in the training sample. An absence of one of the word indicators in a training sample is treated as equivalent to the absent word indicator being more than a predefined distance from the homograph.

    摘要翻译: 提供了一种在文本到语音转换期间用于词义消歧的设备和相关方法。 该装置与用于将文本数据转换成合成语音的基于计算机的系统一起使用,包括用于识别包含在文本数据中的同形图像的识别模块。 该装置还包括一个分配模块,用于使用从训练样本的递归划分构建的统计测试将同音图分配给发音,每个训练样本是包含同形图的单词串。 递归分割是基于每个训练样本确定训练样本中每个单词指示器相对于同形图的顺序和距离。 训练样本中没有一个单词指标被视为等同于缺席词指标超过同形图预定距离。

    Method for detecting misaligned phonetic units for a concatenative text-to-speech voice
    10.
    发明申请
    Method for detecting misaligned phonetic units for a concatenative text-to-speech voice 有权
    用于检测拼接的文本到语音的语音单元的方法

    公开(公告)号:US20050027531A1

    公开(公告)日:2005-02-03

    申请号:US10630113

    申请日:2003-07-30

    IPC分类号: G10L13/06 G10L13/08

    CPC分类号: G10L13/06

    摘要: A method of filtering phonetic units to be used within a concatenative text-to-speech (CTTS) voice. Initially, a normality threshold can be established. At least one phonetic unit that has been automatically extracted from a speech corpus in order to construct the CTTS voice can be received. An abnormality index can be calculated for the phonetic unit. Then, the abnormality index can be compared to the established normality threshold. If the abnormality index exceeds the normality threshold, the phonetic unit can be marked as a suspect phonetic unit. If the abnormality index does not exceed the normality threshold, the phonetic unit can be marked as a verified phonetic unit. The concatenative text-to-speech voice can be built using the verified phonetic units.

    摘要翻译: 一种在连续的文本到语音(CTTS)语音中过滤语音单元的方法。 最初,可以建立正常阈值。 可以接收到从语音语料库自动提取以构建CTTS语音的至少一个语音单元。 可以为语音单元计算异常指数。 然后,将异常指数与建立的正常阈值进行比较。 如果异常指数超过正常阈值,则可以将语音单元标记为可疑语音单元。 如果异常指数不超过正常阈值,则可以将语音单元标记为已验证的语音单元。 可以使用经过验证的语音单元构建级联的文本到语音的语音。