Method and apparatus to use semantic inference with speech recognition systems
    21.
    发明申请
    Method and apparatus to use semantic inference with speech recognition systems 有权
    使用语音推理与语音识别系统的方法和装置

    公开(公告)号:US20050038650A1

    公开(公告)日:2005-02-17

    申请号:US10947429

    申请日:2004-09-21

    IPC分类号: G10L15/18

    CPC分类号: G10L15/1822 G10L15/193

    摘要: A method and apparatus to use semantic inference with speech recognition systems includes recognizing at least one spoken word, processing the spoken word using a context-free grammar, deriving an output from the context-free grammar, and translating the output to a predetermined command.

    摘要翻译: 使用语音推理与语音识别系统的方法和装置包括识别至少一个口语单词,使用无上下文的语法处理口语单词,从上下文无关语法导出输出,以及将输出转换为预定命令。

    Graduated visual and manipulative translucency for windows
    22.
    发明授权
    Graduated visual and manipulative translucency for windows 有权
    Windows的渐变视觉和操纵半透明度

    公开(公告)号:US06670970B1

    公开(公告)日:2003-12-30

    申请号:US09467316

    申请日:1999-12-20

    IPC分类号: G09G500

    摘要: Methods and systems for providing graphical user interfaces are described. overlaid, Information-bearing windows whose contents remain unchanged for a predetermined period of time become translucent. The translucency can be graduated so that, over time, if the window's contents remain unchanged, the window becomes more translucent. In addition to visual translucency, windows according to the present invention also have a manipulative translucent quality. Upon reaching a certain level of visual translucency, user input in the region of the window is interpreted as an operation on the underlying objects rather than the contents of the overlaying window.

    摘要翻译: 描述了用于提供图形用户界面的方法和系统。 内容保持不变的信息承载窗口在预定时间段内变得半透明。 半透明度可以分级,以便随着时间的推移,如果窗口的内容保持不变,则窗口变得更加透明。 除了视觉半透明之外,根据本发明的窗户还具有操纵半透明的质量。 当达到一定程度的视觉半透明度时,在窗口区域中的用户输入被解释为底层对象上的操作,而不是覆盖窗口的内容。

    Graduated Visual and Manipulative Translucency for Windows
    23.
    发明申请
    Graduated Visual and Manipulative Translucency for Windows 有权
    Windows的渐变视觉和机械半透明度

    公开(公告)号:US20080155438A1

    公开(公告)日:2008-06-26

    申请号:US12046171

    申请日:2008-03-11

    IPC分类号: G06F3/048

    摘要: Methods and systems for providing graphical user interfaces are described. Overlaid, information-bearing windows whose contents remain unchanged for a predetermined period of time become translucent. The translucency can be graduated so that, over time, if the window's contents remain unchanged, the window becomes more translucent. In addition to visual translucency, windows also have a manipulative translucent quality. Upon reaching a certain level of visual translucency, user input in the region of the window is interpreted as an operation on the underlying objects rather than the contents of the overlaying window.

    摘要翻译: 描述了用于提供图形用户界面的方法和系统。 内容在预定时间段内保持不变的重叠的信息承载窗口变得半透明。 半透明度可以分级,以便随着时间的推移,如果窗口的内容保持不变,则窗口变得更加透明。 除了视觉半透明之外,窗户还具有操纵半透明的质量。 当达到一定程度的视觉半透明度时,在窗口区域中的用户输入被解释为底层对象上的操作,而不是覆盖窗口的内容。

    Speech synthesis method for operator assistance telecommunications calls
comprising a plurality of text-to-speech (TTS) devices
    24.
    发明授权
    Speech synthesis method for operator assistance telecommunications calls comprising a plurality of text-to-speech (TTS) devices 失效
    用于包括多个文本到语音(TTS)设备的操作者辅助电话呼叫的语音合成方法

    公开(公告)号:US5832433A

    公开(公告)日:1998-11-03

    申请号:US669145

    申请日:1996-06-24

    IPC分类号: H04M3/493 G10L5/00

    摘要: Methods and apparatus are described for providing automated operator services and in particular, a reverse directory assistance service. A calling customer is connected to an automated system that prompts the caller for a listing identifier which is used by the system to retrieve a textual listing corresponding to the listing identifier from a database of textual listings. The textual listing contains a TTS ID which identifies a particular one TTS device from a plurality of TTS devices and the listing is optionally preprocessed and parsed into a plurality of fields which define the listing. The listing text is then sent to the particular one TTS device for text to speech synthesis of the text contained within the listing. The method further includes teaching the system which one TTS device of the plurality of TTS devices, best synthesizes the text contained within the listing and then identifying that one TTS device within the listing so that subsequent synthesis will utilize that TTS device.

    摘要翻译: 描述了用于提供自动化操作员服务的方法和装置,特别是反向目录服务。 呼叫客户被连接到自动系统,其提示呼叫者列出标识符,该列表标识符被系统用来从文本列表的数据库中检索与列表标识符相对应的文本列表。 文本列表包含TTS ID,其识别来自多个TTS设备的特定一个TTS设备,并且该列表可选地被预处理并被解析为定义列表的多个字段。 然后将列表文本发送到特定的一个TTS设备,用于文本到语音合成包含在列表中的文本。 该方法还包括对系统中的多个TTS设备中的一个TTS设备进行教学,最佳地合成列表中包含的文本,然后识别列表中的一个TTS设备,以便随后的综合将利用该TTS设备。

    Graduated visual and manipulative translucency for windows
    25.
    发明授权
    Graduated visual and manipulative translucency for windows 有权
    Windows的渐变视觉和操纵半透明度

    公开(公告)号:US07343562B2

    公开(公告)日:2008-03-11

    申请号:US10702969

    申请日:2003-11-05

    IPC分类号: G06F15/00 G09G5/00

    摘要: Methods and systems for providing graphical user interfaces are described. overlaid, Information-bearing windows whose contents remain unchanged for a predetermined period of time become translucent. The translucency can be graduated so that, over time, if the window's contents remain unchanged, the window becomes more translucent. In addition to visual translucency, windows according to the present invention also have a manipulative translucent quality. Upon reaching a certain level of visual translucency, user input in the region of the window is interpreted as an operation on the underlying objects rather than the contents of the overlaying window.

    摘要翻译: 描述了用于提供图形用户界面的方法和系统。 内容在预定时间段内保持不变的重叠的信息承载窗口变得半透明。 半透明度可以分级,以便随着时间的推移,如果窗口的内容保持不变,则窗口变得更加透明。 除了视觉半透明之外,根据本发明的窗户还具有操纵半透明的质量。 当达到一定程度的视觉半透明度时,在窗口区域中的用户输入被解释为底层对象上的操作,而不是覆盖窗口的内容。

    Method and apparatus for improved duration modeling of phonemes
    26.
    发明授权
    Method and apparatus for improved duration modeling of phonemes 有权
    用于改善音素持续时间建模的方法和装置

    公开(公告)号:US06366884B1

    公开(公告)日:2002-04-02

    申请号:US09436048

    申请日:1999-11-08

    IPC分类号: G10L1300

    CPC分类号: G10L13/10 G10L13/04 G10L13/08

    摘要: A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model. An inverse of the non-exponential functional transformation is applied to duration observations, or training data. Coefficients are generated for use with the generalized additive model. The generalized additive model comprising the coefficients is applied to at least one phoneme of the received text resulting in the generation of at least one phoneme having a duration. An acoustic sequence is generated comprising speech signals that are representative of the received text.

    摘要翻译: 提供了一种用于在语音合成系统中改善音素的持续时间建模的方法和装置。 根据一个方面,文本被接收到语音合成系统的处理器中。 所收到的文本是使用产品总和音程持续时间模型来处理的,该模型用于共振峰方法或语音产生的并置方法。 与音素音调模型一起使用的音素持续时间模型通过开发用于广义加性模型的非指数函数变换形式来产生。 非指数函数变换形式包括根据最小音素持续时间和最大音素持续时间来控制的根正弦变换。 在训练数据中观察到最小和最大音素持续时间。 通过指定广义加法模型的多个上下文因素中的至少一个来处理接收到的文本。 非指数函数变换的逆向应用于持续时间观察或训练数据。 生成与广义加法模型一起使用的系数。 包括系数的广义加法模型被应用于接收到的文本的至少一个音素,导致产生具有持续时间的至少一个音素。 产生包括表示所接收文本的语音信号的声学序列。

    Method and apparatus for improved duration modeling of phonemes

    公开(公告)号:US6064960A

    公开(公告)日:2000-05-16

    申请号:US993940

    申请日:1997-12-18

    IPC分类号: G10L13/08

    CPC分类号: G10L13/10 G10L13/04 G10L13/08

    摘要: A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model. An inverse of the non-exponential functional transformation is applied to duration observations, or training data. Coefficients are generated for use with the generalized additive model. The generalized additive model comprising the coefficients is applied to at least one phoneme of the received text resulting in the generation of at least one phoneme having a duration. An acoustic sequence is generated comprising speech signals that are representative of the received text.

    Method and apparatus for speech synthesis using paralinguistic variation
    28.
    发明授权
    Method and apparatus for speech synthesis using paralinguistic variation 有权
    用于语音合成的方法和装置,使用平行变化

    公开(公告)号:US08103505B1

    公开(公告)日:2012-01-24

    申请号:US10718140

    申请日:2003-11-19

    IPC分类号: G10L13/08

    CPC分类号: G10L13/033 G10L13/10

    摘要: A method and apparatus for speech synthesis in a computer-user interface using random paralinguistic variation is described herein. According to one aspect of the present invention, a method for synthesizing speech comprises generating synthesized speech having certain prosodic features. The synthesized speech is further processed by applying a random paralinguistic variation to the acoustic sequence representing the synthesized speech without altering the linguistic prosodic features. According to one aspect of the present invention, the application of the paralinguistic variation is correlated with a previously applied paralinguistic variation to reflect a gradual change in the computer voice, while still maintaining a random quality.

    摘要翻译: 本文描述了使用随机协调变化的计算机用户界面中语音合成的方法和装置。 根据本发明的一个方面,一种用于合成语音的方法包括产生具有某些韵律特征的合成语音。 通过对代表合成语音的声学序列应用随机协方差变化而不改变语言韵律特征来进一步处理合成语音。 根据本发明的一个方面,伴随变化的应用与先前应用的分析变化相关,以反映计算机语音的逐渐变化,同时仍保持随机质量。

    Graduated visual and manipulative translucency for windows

    公开(公告)号:US08775959B2

    公开(公告)日:2014-07-08

    申请号:US12046171

    申请日:2008-03-11

    IPC分类号: G06F15/00 G06F13/00

    摘要: Methods and systems for providing graphical user interfaces are described. Overlaid, information-bearing windows whose contents remain unchanged for a predetermined period of time become translucent. The translucency can be graduated so that, over time, if the window's contents remain unchanged, the window becomes more translucent. In addition to visual translucency, windows also have a manipulative translucent quality. Upon reaching a certain level of visual translucency, user input in the region of the window is interpreted as an operation on the underlying objects rather than the contents of the overlaying window.