-
公开(公告)号:US20040243397A1
公开(公告)日:2004-12-02
申请号:US10795962
申请日:2004-03-08
Applicant: STMicroelectronics Asia Pacific Pte Ltd
Inventor: Charles Averty , Xue Yao , Ranjot Singh
IPC: G10L019/00 , G10L019/10
CPC classification number: G10L19/032
Abstract: A mask generation process for use in encoding audio data, including generating linear masking components from the audio data, generating logarithmic masking components from the linear masking components, and generating a global masking threshold from the logarithmic masking components. The process is a psychoacoustic masking process for use in an MPEG-1-L2 encoder, and includes generating energy values from a Fourier transform of the audio data, determining sound pressure level values from the energy values, selecting tonal and non-tonal masking components on the basis of the energy values, generating power values from the energy values, generating masking thresholds on the basis of the masking components and the power values, and generating signal to mask ratios for a quantizier on the basis of the sound pressure level values and the masking thresholds.
Abstract translation: 一种用于对音频数据进行编码的掩码产生过程,包括从音频数据生成线性屏蔽分量,从线性屏蔽分量产生对数掩蔽分量,以及从对数掩蔽分量生成全局掩蔽阈值。 该过程是用于MPEG-1-L2编码器的心理声学屏蔽过程,并且包括从音频数据的傅里叶变换产生能量值,从能量值确定声压级值,选择色调和非色调掩蔽分量 基于能量值,从能量值产生功率值,基于掩蔽分量和功率值产生掩蔽阈值,并且基于声压级值生成量子信号与掩模比,以及 掩蔽阈值。