Method and apparatus for obtaining complete speech signals for speech recognition applications
    2.
    发明授权
    Method and apparatus for obtaining complete speech signals for speech recognition applications 有权
    用于获得用于语音识别应用的完整语音信号的方法和装置

    公开(公告)号:US07610199B2

    公开(公告)日:2009-10-27

    申请号:US11217912

    申请日:2005-09-01

    IPC分类号: G10L15/14

    CPC分类号: G10L25/87

    摘要: The present invention relates to a method and apparatus for obtaining complete speech signals for speech recognition applications. In one embodiment, the method continuously records an audio stream comprising a sequence of frames to a circular buffer. When a user command to commence or terminate speech recognition is received, the method obtains a number of frames of the audio stream occurring before or after the user command in order to identify an augmented audio signal for speech recognition processing. In further embodiments, the method analyzes the augmented audio signal in order to locate starting and ending speech endpoints that bound at least a portion of speech to be processed for recognition. At least one of the speech endpoints is located using a Hidden Markov Model.

    摘要翻译: 本发明涉及一种用于获取语音识别应用的完整语音信号的方法和装置。 在一个实施例中,该方法将包括一系列帧的音频流连续地记录到循环缓冲器。 当接收到开始或终止语音识别的用户命令时,该方法获得在用户命令之前或之后发生的音频流的数量的数目,以便识别用于语音识别处理的增强音频信号。 在另外的实施例中,该方法分析增强的音频信号,以便定位限定待处理的语音的至少一部分的起始和结束语音端点用于识别。 使用隐马尔可夫模型来定位至少一个语音端点。

    Method and apparatus for adapting original musical tracks for karaoke use
    4.
    发明申请
    Method and apparatus for adapting original musical tracks for karaoke use 审中-公开
    用于适应卡拉OK使用的原始音乐曲目的方法和装置

    公开(公告)号:US20060112812A1

    公开(公告)日:2006-06-01

    申请号:US11000271

    申请日:2004-11-30

    IPC分类号: G10H7/00

    摘要: In one embodiment, the present invention is a method and apparatus for adapting original musical tracks for karaoke use. In one embodiment, an original musical track is separated into vocal elements and non-vocal elements. The vocal elements are aligned with corresponding text transcriptions (e.g., text-based lyrics), and the aligned text-based lyrics are then displayed to a user while the non-vocal elements are simultaneously played in a manner that is synchronous with the display of the lyrics.

    摘要翻译: 在一个实施例中,本发明是用于使原始音乐曲目适合卡拉OK使用的方法和装置。 在一个实施例中,原始音乐轨道被分离成声乐元素和非声音元素。 声乐元素与对应的文本转录(例如,基于文本的歌词)对准,然后将对齐的基于文本的歌词显示给用户,同时以与声音元素的显示同步的方式同时播放非声乐元素 歌词。

    Method and apparatus for reading education
    5.
    发明授权
    Method and apparatus for reading education 有权
    阅读教育的方法和设备

    公开(公告)号:US08226416B2

    公开(公告)日:2012-07-24

    申请号:US11952713

    申请日:2007-12-07

    CPC分类号: G09B19/06 G09B5/04

    摘要: The present invention is a method and apparatus for reading education. In one embodiment, a method for recognizing an utterance spoken by a reader, includes receiving text to be read by the reader, generating a grammar for speech recognition, in accordance with the text, receiving the utterance, interpreting the utterance in accordance with the grammar, and outputting feedback indicative of reader performance.

    摘要翻译: 本发明是一种阅读教育的方法和装置。 在一个实施例中,一种用于识别读取器所发出的话语的方法包括接收读取器读取的文本,根据文本生成用于语音识别的语法,接收话语,根据语法解释话语 并且输出指示读取器性能的反馈。