Subband normalization, transformation, and voiceness to recognize
phonemes for text messaging in a radio communication system
    1.
    发明授权
    Subband normalization, transformation, and voiceness to recognize phonemes for text messaging in a radio communication system 失效
    子带规范化,转换和语音识别无线电通信系统中文本消息的音素

    公开(公告)号:US6163765A

    公开(公告)日:2000-12-19

    申请号:US50184

    申请日:1998-03-30

    IPC分类号: G10L15/02 G10L25/93 H04W88/18

    摘要: A radio communication system includes a voice recognition system (221) for converting (400) a caller's voice message to a textual speech message. The textual speech message is then transmitted to an intended selective call radio (122). To perform these functions, the radio communication system includes a caller interface circuit (218), a transmitter (116), and a processor (222). To perform voice-to-text conversion, the processor is adapted to cause the caller interface circuit to sample a voice signal generated by the caller during a plurality of frame intervals, and to apply a Fourier transform thereto, thereby generating spectral data. The spectral data is subdivided into a plurality of bands. The spectral envelope of the spectral data is then filtered out to generate filtered spectral data. A Fourier transform is applied thereto to generate an autocorrelation function for each band. From the autocorrelation function of each band, a magnitude is determined, which is representative of the degree of voiceness of each band. The degree of voiceness for each band is then applied to a corresponding plurality of phoneme models, which are used to derive a textual equivalent of speech from the voice signal. The textual equivalent of speech is then transmitted to the selective call radio by way of the transmitter.

    摘要翻译: 无线电通信系统包括用于将呼叫者的语音消息转换(400)到文本语音消息的语音识别系统(221)。 然后将文本语音消息发送到预期的选择呼叫无线电(122)。 为了执行这些功能,无线电通信系统包括呼叫者接口电路(218),发射器(116)和处理器(222)。 为了执行语音到文本转换,处理器适于使呼叫者接口电路在多个帧间隔期间采样由呼叫者产生的语音信号,并对其进行傅里叶变换,由此产生频谱数据。 光谱数据被细分成多个频带。 然后滤出光谱数据的光谱包络以产生滤波的光谱数据。 对其进行傅立叶变换以产生每个频带的自相关函数。 根据每个频带的自相关函数,确定一个幅度,其代表每个频带的声音程度。 然后将每个频带的声音程度应用于相应的多个音素模型,这些音素模型用于从语音信号中导出语音的文本等价物。 然后通过发射机将语音的文本等价物发送到选呼通话。