USING A LOUDNESS-LEVEL-REFERENCE SEGMENT OF AUDIO TO NORMALIZE RELATIVE AUDIO LEVELS AMONG DIFFERENT AUDIO FILES WHEN COMBINING CONTENT OF THE AUDIO FILES
    1.
    发明申请
    USING A LOUDNESS-LEVEL-REFERENCE SEGMENT OF AUDIO TO NORMALIZE RELATIVE AUDIO LEVELS AMONG DIFFERENT AUDIO FILES WHEN COMBINING CONTENT OF THE AUDIO FILES 有权
    在组合音频文件的内容时,使用音频级别的音频级别来标准化不同音频文件的相对音频级别

    公开(公告)号:US20080039964A1

    公开(公告)日:2008-02-14

    申请号:US11463683

    申请日:2006-08-10

    IPC分类号: G06F17/00 H03G3/00

    摘要: The present invention records a loudness-level-reference segment of audio when creating speech audio files and audio files including background sounds. The speech audio files can then be combined with the background sound containing audio files in any desirable combination. When combining the files, the relative audio level of the files is matched, by matching the loudness-level-reference segments with each other. Any of a variety of known digital signal processing techniques can be used to normalize the component audio files. The combined audio files containing speech and background sounds (e.g. ambient noise) having matching relative audio levels can be used to test and/or train a speech recognition engine or a speech processing system.

    摘要翻译: 本发明在创建包括背景声音的语音音频文件和音频文件时记录音频的音量级参考片段。 语音音频文件然后可以与包含音频文件的背景声音以任何期望的组合组合。 当组合文件时,通过将响度级别参考分段彼此匹配来匹配文件的相对音频级别。 可以使用各种已知的数字信号处理技术中的任何一种来标准化组件音频文件。 可以使用包含具有匹配的相对音频电平的语音和背景声音(例如环境噪声)的组合音频文件来测试和/或训练语音识别引擎或语音处理系统。