NEUROEVOLUTION-BASED ARTIFICIAL BANDWIDTH EXPANSION OF TELEPHONE BAND SPEECH
    1.
    发明公开
    NEUROEVOLUTION-BASED ARTIFICIAL BANDWIDTH EXPANSION OF TELEPHONE BAND SPEECH 审中-公开
    电话波段语言神经毒性演化基于人工带宽扩展

    公开(公告)号:EP1766614A2

    公开(公告)日:2007-03-28

    申请号:EP05739447.0

    申请日:2005-05-09

    申请人: Nokia Corporation

    IPC分类号: G10L21/02

    摘要: Artificial bandwidth expansion devices, systems, methods and computer code products are disclosed for expanding a narrowband speech signal into an artificially expanded wideband speech signal. Embodiments of the invention can operate by forming an unshaped wideband signal based on the narrowband speech signal, such as through aliasing, and shaping the wideband signal into the artificially expanded wideband speech signal by amplifying/attenuating the unshaped wideband signal using a function generated by a neural network. Weights of the neural network can be set by a training/learning subsystem which generates genomes containing the neural network weights based on simulated environments in which a device employing the artificial bandwidth expansion is expected to operate.

    METHOD AND APPARATUS FOR ARTIFICIAL BANDWIDTH EXPANSION IN SPEECH PROCESSING
    3.
    发明公开
    METHOD AND APPARATUS FOR ARTIFICIAL BANDWIDTH EXPANSION IN SPEECH PROCESSING 审中-公开
    方法和装置人工BANDBREITENERWEITERUNGBEI语音处理

    公开(公告)号:EP1581929A2

    公开(公告)日:2005-10-05

    申请号:EP04701060.8

    申请日:2004-01-09

    申请人: Nokia Corporation

    IPC分类号: G10L19/06

    CPC分类号: G10L21/038 G10L25/93

    摘要: A method and device for improving the quality of speech signals transmitted using an audio bandwidth between 300 Hz and 3.4 kHz. After the received speech signal is divided into frames, zeros are inserted between samples to double the sampling frequency. The level of these aliased frequency components is adjusted using an adaptive algorithm based on the classification of the speech frame. Sound can be classified into sibilants and non-sibilants, and a non-sibilant sound can be further classified into a voiced sound and a stop consonant. The adjustment is based on parameters, such as the number of zero-crossings and energy distribution, computed from the spectrum of the up-sampled speech signal between 300 Hz and 3.4kHz. A new sound with a bandwidth between 300 Hz and 7.7kHz is obtained by inverse Fourier transforming the spectrum of the adjusted, up-sampled sound.