SOUND SOURCE SEPARATION METHOD AND SOUND SOURCE SEPARATION APPARATUS

    公开(公告)号:US20190115043A1

    公开(公告)日:2019-04-18

    申请号:US16118986

    申请日:2018-08-31

    申请人: HITACHI, LTD.

    IPC分类号: G10L21/0308 G10L21/0388

    CPC分类号: G10L21/0308 G10L21/0388

    摘要: There is provided a sound source separation method of carrying out sound source separation of an audio signal inputted from an input device by using a modeled sound source distribution, by an information processing apparatus provided with a processing device, a storage device, the input device, and an output device. In this method, as a condition followed by the model, sound sources are independent of one another, powers which the sound sources have are modeled for each of frequency bands obtained through band division, a relationship among the powers for the frequency bands different from each other is modeled by nonnegative matrix factorization, and components obtained through the division of the sound source follow a complex normal distribution.

    SOUND SIGNAL PROCESSING DEVICE, SOUND SIGNAL PROCESSING METHOD, AND PROGRAM
    9.
    发明申请
    SOUND SIGNAL PROCESSING DEVICE, SOUND SIGNAL PROCESSING METHOD, AND PROGRAM 有权
    声信号处理装置,声信号处理方法和程序

    公开(公告)号:US20170047079A1

    公开(公告)日:2017-02-16

    申请号:US15118239

    申请日:2015-01-27

    申请人: SONY CORPORATION

    发明人: ATSUO HIROE

    摘要: A device and a method for determining a speech segment with a high degree of accuracy from a sound signal in which different sounds coexist are provided. Directional points indicating the direction of arrival of the sound signal are connected in the temporal direction, and a speech segment is detected. In this configuration, pattern classification is performed in accordance with directional characteristics with respect to the direction of arrival, and a directionality pattern and a null beam pattern are generated from the classification results. Also, an average null beam pattern is also generated by calculating the average of the null beam patterns at a time when a non-speech-like signal is input. Further, a threshold that is set at a slightly lower value than the average null beam pattern is calculated as the threshold to be used in detecting the local minimum point corresponding to the direction of arrival from each null beam pattern, and a local minimum point equal to or lower than the threshold is determined to be the point corresponding to the direction of arrival.

    摘要翻译: 提供了一种从不同声音共存的声音信号确定高精确度的语音段的装置和方法。 指示声音信号的到达方向的方向点在时间方向上连接,并且检测到语音段。 在这种配置中,根据到达方向的方向特性执行模式分类,并且从分类结果生成方向性模式和零波束模式。 此外,通过计算在输入非语音状信号时的零波束图案的平均值也产生平均零波束图案。 此外,将被设置为比平均零波束图案稍低一点的阈值被计算为用于在从每个零波束图案检测与到达方向对应的局部最小点时使用的阈值,并且将局部最小点相等 确定为与到达方向对应的点。

    Microphone signal fusion
    10.
    发明授权
    Microphone signal fusion 有权
    麦克风信号融合

    公开(公告)号:US09401158B1

    公开(公告)日:2016-07-26

    申请号:US14853947

    申请日:2015-09-14

    摘要: Provided are systems and methods for microphone signal fusion. An example method commences with receiving a first and second signal representing sounds captured, respectively, by internal and external microphones. The second signal includes at least a voice component. The first signal and the voice component are modified by at least human tissue. The first and second signals are processed to obtain noise estimates. The first signal is aligned with the second signal. The second signal and the aligned first signal are blended based on the noise estimates to generate an enhanced voice signal. The internal microphone is located inside an ear canal and sealed for isolation from acoustic signals outside the ear canal. The external microphone is located outside the ear canal. All of parts of the processing, blending and aligning of the systems and method may be performed on a subband basis in the frequency domain.

    摘要翻译: 提供了用于麦克风信号融合的系统和方法。 示例性方法开始于分别由内部和外部麦克风接收代表所捕获的声音的第一和第二信号。 第二信号至少包括一个声音分量。 至少人体组织修改了第一个信号和声音部分。 处理第一和第二信号以获得噪声估计。 第一个信号与第二个信号对齐。 基于噪声估计来混合第二信号和对准的第一信号以产生增强的语音信号。 内部麦克风位于耳道内,并密封,以便与耳道外的声学信号隔离。 外部麦克风位于耳道外。 系统和方法的处理,混合和对准的所有部分可以在频域中基于子带进行。