Sound analysis and resynthesis using correlograms
    1.
    发明授权
    Sound analysis and resynthesis using correlograms 失效
    声音分析和重新合成使用相关图

    公开(公告)号:US5473759A

    公开(公告)日:1995-12-05

    申请号:US20785

    申请日:1993-02-22

    IPC分类号: G10L19/02 G10L25/18 G10L9/10

    CPC分类号: G10L19/02 G10L25/18

    摘要: A system for reconstructing a signal waveform from a correlogram is based upon the recognition that the information in each channel of the correlogram is equivalent to the magnitude of the Fourier transform of a signal. By estimating a signal on the basis of its Short-Time Fourier Transform Magnitude, each channel of information from a cochlear model can be reconstructed. Once this information is retrieved, a signal waveform can be resynthesized through inversion of the cochlear model. The process for reconstructing the cochlear model data can be optimized with the use of techniques for improving the initial estimate of the signal from the magnitude of its Fourier Transform, and by employing information that is known apriori about the signal during the estimation process, such as the characteristics of sound signals.

    摘要翻译: 用于从相关图重构信号波形的系统基于这样的认识:相关图的每个通道中的信息等于信号的傅立叶变换的幅度。 通过基于其短时傅里叶变换幅度估计信号,可以重构来自耳蜗模型的每个信道信道。 一旦检索到该信息,就可以通过人工耳蜗模型的反演来重新合成信号波形。 用于重建耳蜗模型数据的过程可以通过使用用于从其傅里叶变换的幅度改进信号的初始估计的技术以及通过在估计过程中使用关于信号的已知信息的技术来优化,例如 声音信号的特点。

    Method and apparatus for speech feature recognition based on models of
auditory signal processing
    2.
    发明授权
    Method and apparatus for speech feature recognition based on models of auditory signal processing 失效
    基于听觉信号处理模型的语音特征识别方法和装置

    公开(公告)号:US5381512A

    公开(公告)日:1995-01-10

    申请号:US903729

    申请日:1992-06-24

    CPC分类号: G10L15/02 G10L17/02

    摘要: A stimulus waveform is processed using a model of the human auditory system to provide a plurality of output waveforms. Each output waveform corresponds to excitation at different locations along the basilar membrane in the cochlea, and matches the narrow frequency bandwidth, short time response, and wave propagation characteristics of the human cochlea. Primary feature detection is achieved by comparing response waveforms and their spatial and time derivatives to predetermined stereotypes. Secondary feature detection is achieved by comparing spatial and temporal patterns of primary features with patterns stereotypical of human speech elements.

    摘要翻译: 使用人类听觉系统的模型来处理刺激波形以提供多个输出波形。 每个输出波形对应于耳蜗基底膜不同位置处的激发,匹配人耳蜗的窄频带宽,短时间响应和波传播特性。 通过将响应波形及其空间和时间导数与预定的刻板印象进行比较来实现主要特征检测。 通过将主要特征的空间和时间模式与人类语音元素的模式刻板比较来实现次要特征检测。

    Speech unit for dolls and other toys
    3.
    发明授权
    Speech unit for dolls and other toys 失效
    演唱单位娃娃等玩具

    公开(公告)号:US4809335A

    公开(公告)日:1989-02-28

    申请号:US790949

    申请日:1985-10-24

    申请人: Daniel S. Rumsey

    发明人: Daniel S. Rumsey

    IPC分类号: A63H3/28 G10L9/10

    CPC分类号: A63H3/28

    摘要: A speech unit for producing preselected words or phrases based on the orientation of a toy doll or figure. A gravity sensing means produces an output corresponding to the orientation of the sensing means with respect to gravity. The output of the sensing means is coupled to a speech synthesizer which produces an output based on transitions from one orientation of the sensing means to a second orientation. A timing circuit coupled to the sensing means establishes a time period during which the sensing means must maintain its orientation for an output to be realized. The timing means also is used to shut off power to the speech synthesizer and speaker means to conserve power of the circuit. In an alternate embodiment, the absolute position of the sensing means is used to select a speech output.

    摘要翻译: 一种用于根据玩具娃娃或人物的方向产生预选词或短语的语音单元。 重力感测装置产生对应于感测装置相对于重力的取向的输出。 感测装置的输出耦合到语音合成器,该语音合成器基于从感测装置的一个取向到第二取向的转换而产生输出。 耦合到感测装置的定时电路建立了一个时间段,在该时间段期间,感测装置必须保持其取向以便实现输出。 定时装置还用于切断语音合成器和扬声器装置的功率以节省电路的功率。 在替代实施例中,感测装置的绝对位置用于选择语音输出。