Method and apparatus for providing an interactive language tutor
    1.
    发明授权
    Method and apparatus for providing an interactive language tutor 有权
    用于提供交互式语言导师的方法和装置

    公开(公告)号:US07299188B2

    公开(公告)日:2007-11-20

    申请号:US10361256

    申请日:2003-02-10

    IPC分类号: G10L11/00 G10L21/06

    CPC分类号: G06F17/289 G10L15/02

    摘要: A method and apparatus for generating a pronunciation score by receiving a user phrase intended to conform to a reference phrase and processing the user phrase in accordance with at least one of an articulation-scoring engine, a duration scoring engine and an intonation-scoring engine to derive thereby the pronunciation score. The scores provided by the various scoring engines are adapted to provide a visual and/or numerical feedback that provides information pertaining to correctness or incorrectness in one or more speech-features such as intonation, articulation, voicing, phoneme error and relative word duration. Such useful interactive feedback will allow a user to quickly identify the problem area and take remedial action in reciting “tutor” sentences or phrases.

    摘要翻译: 一种用于通过接收旨在符合参考短语的用户短语并根据关节计分引擎,持续时间评分引擎和语调评分引擎中的至少一个来生成发音分数的方法和装置, 从而得出发音得分。 各种评分引擎提供的分数适于提供视觉和/或数值反馈,其提供关于一个或多个语音特征(例如语调,发音,发声,音素错误和相对词长度)中的正确性或不正确性的信息。 这种有用的交互式反馈将允许用户快速识别问题区域,并采取补救措施来背诵“辅导”句子或短语。

    Automatic assessment of phonological processes
    2.
    发明授权
    Automatic assessment of phonological processes 有权
    自动评估语音过程

    公开(公告)号:US07302389B2

    公开(公告)日:2007-11-27

    申请号:US10637235

    申请日:2003-08-08

    IPC分类号: G10L15/26

    CPC分类号: G09B19/06 G10L15/02

    摘要: A computer-based system generates alternative phonetic transcriptions for a target word or phrase corresponding to specific phonological processes that replace individual phonemes or clusters of two or more phonemes with replacement phonemes. The system compares a user's speech with a list of possible transcriptions that includes the base (i.e., correct) transcription of the test target as well as the different alternative transcriptions, to identify the transcription that best matches the user's. In a speech therapy application, the system identifies the phonological process(es), if any, associated with the user's speech and generates statistics over multiple test targets that can be used to diagnose the user's specific phonological disorders. The system can also be implemented in other contexts such as foreign language instruction and automated attendant applications to cover a wide variety and range of accents and/or phonological disorders.

    摘要翻译: 基于计算机的系统产生用于替换具有替换音素的两个或多个音素的单个音素或簇的特定语音过程的目标词或短语的替代语音转录。 该系统将用户的语音与包括测试目标的基础(即,正确)转录以及不同的替代转录的可能转录的列表进行比较,以识别与用户最匹配的转录。 在语音治疗应用中,系统识别与用户语音相关联的语音过程(如果有的话),并产生可用于诊断用户的特定语音障碍的多个测试目标的统计。 该系统还可以在诸如外语指令和自动应答之类的其他情况下实现,以覆盖广泛的各种各样的口音和/或语音障碍。

    Method and system to compensate for the effects of packet delays on speech quality in a Voice-over IP system
    3.
    发明授权
    Method and system to compensate for the effects of packet delays on speech quality in a Voice-over IP system 有权
    用于补偿分组延迟对语音IP系统中的语音质量的影响的方法和系统

    公开(公告)号:US07266127B2

    公开(公告)日:2007-09-04

    申请号:US10068023

    申请日:2002-02-08

    IPC分类号: H04L12/56

    摘要: The system includes a jitter buffer for receiving speech packets in a Voice over Internet Protocol (VoIP) system, a playback device for adjusting the playback speed of the received speed packets, and a jitter buffer manager for detecting out of sequence packets in the jitter buffer and for sending commands to the playback device to adjust playback speed based on the detection. The speech signal is played back at the nominal speed when there are no out of sequence packets. The playback speed is decreased when an out of sequence packet is detected, thereby tending to increase the jitter buffer length. When an out of sequence packet arrives, the playback speed is increased in order to restore jitter buffer length to its nominal length.

    摘要翻译: 该系统包括用于在因特网协议语音(VoIP)系统中接收语音分组的抖动缓冲器,用于调整接收到的速度分组的回放速度的回放装置,以及用于检测抖动缓冲器中的顺序分组的抖动缓冲器管理器 并且用于根据检测向播放装置发送命令以调整播放速度。 当没有不合格的数据包时,语音信号以标称速度播放。 当检测到异步分组时,播放速度降低,从而趋于增加抖动缓冲器长度。 当序列分组到达时,播放速度增加,以将抖动缓冲区长度恢复到其标称长度。

    Flow Control in Real-Time Transmission of Non-Uniform Data Rate Encoded Video Over a Universal Serial Bus
    4.
    发明申请
    Flow Control in Real-Time Transmission of Non-Uniform Data Rate Encoded Video Over a Universal Serial Bus 审中-公开
    通过通用串行总线实时传输非均匀数据速率编码视频的流量控制

    公开(公告)号:US20110302334A1

    公开(公告)日:2011-12-08

    申请号:US13118229

    申请日:2011-05-27

    IPC分类号: G06F3/00

    CPC分类号: H04L12/40136 G09G2340/02

    摘要: A method is provided that includes coding pictures by a video encoder in a digital camera to form a compressed video bit stream for real-time transmission to a host digital system coupled to the digital camera by a universal serial bus (USB), wherein an output data rate of the video encoder is at least sometimes higher than an operating data rate of the host digital system, and applying flow control in the digital camera to maintain an output data rate over the USB to the host digital system of the compressed video bit stream below the operating data rate of the host digital system.

    摘要翻译: 提供一种方法,其包括通过数字照相机中的视频编码器对图像进行编码以形成压缩视频比特流,用于通过通用串行总线(USB)实时传输到耦合到数字照相机的主机数字系统,其中输出 视频编码器的数据速率至少有时高于主机数字系统的操作数据速率,并且在数字照相机中应用流量控制以将USB上的输出数据速率保持为压缩视频比特流的主机数字系统 低于主机数字系统的运行数据速率。

    Intonation transformation for speech therapy and the like
    5.
    发明授权
    Intonation transformation for speech therapy and the like 有权
    语音治疗的语调转换等

    公开(公告)号:US07373294B2

    公开(公告)日:2008-05-13

    申请号:US10438642

    申请日:2003-05-15

    IPC分类号: G10L11/04 G10L21/00

    摘要: The intonation of speech is modified by an appropriate combination of resampling and time-domain harmonic scaling. Resampling increases (upsampling) or decreases (downsampling) the number of data points in a signal. Harmonic scaling adds or removes pitch cycles to or from a signal. The pitch of a speech signal can be increased by combining downsampling with harmonic scaling that adds an appropriate number of pitch cycles. Alternatively, pitch can be decreased by combining upsampling with harmonic scaling that removes an appropriate number of pitch cycles. The present invention can be implemented in an automated speech-therapy tool that is able to modify the intonation of prerecorded reference speech signals for playback to a user to emphasize the correct pronunciation by increasing the pitch of selected portions of words or phrases that the user had previously mispronounced.

    摘要翻译: 通过重采样和时域谐波缩放的适当组合来修改语音的语调。 重采样增加(上采样)或降低(下采样)信号中数据点的数量。 谐波缩放可以增加或去除信号的音调周期。 语音信号的音调可以通过将下采样与谐波缩放相结合来增加,该谐波缩放增加适当数量的音调周期。 或者,可以通过组合上采样与谐波缩放来去除适当数量的音调周期来减小音调。 本发明可以在自动言语治疗工具中实现,该自动化语音治疗工具能够通过增加用户具有的单词或短语的选定部分的音调来修改预先记录的参考语音信号的音调以便播放给用户以强调正确的发音 以前是错误的。