IN-CAR COMMUNICATION SYSTEM FOR MULTIPLE ACOUSTIC ZONES
    2.
    发明申请
    IN-CAR COMMUNICATION SYSTEM FOR MULTIPLE ACOUSTIC ZONES 有权
    多声道区域的车内通信系统

    公开(公告)号:US20130179163A1

    公开(公告)日:2013-07-11

    申请号:US13346854

    申请日:2012-01-10

    IPC分类号: G10L15/20 H04B1/00

    摘要: An In-Car Communication (ICC) system supports the communication paths within a car by receiving the speech signals of a speaking passenger and playing it back for one or more listening passengers. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in a vehicle having multiple acoustic zones includes a plurality of microphone In-Car Communication (Mic-ICC) instances coupled and a plurality of loudspeaker In-Car Communication (Ls-ICC) instances. The system further includes a dynamic audio routing matrix with a controller and coupled to the Mic-ICC instances, a mixer coupled to the plurality of Mic-ICC instances and a distributor coupled to the Ls-ICC instances.

    摘要翻译: 车载通信(ICC)系统通过接收说话的乘客的语音信号并且为一个或多个听力乘客返回来支持汽车内的通信路径。 信号处理任务分为麦克风相关部分和扬声器相关部分。 适用于具有多个声学区域的车辆的声音处理系统包括耦合的多个麦克风车载通信(Mic-ICC)实例和多个扬声器车载通信(Ls-ICC)实例。 该系统还包括具有控制器并耦合到Mic-ICC实例的动态音频路由矩阵,耦合到多个Mic-ICC实例的混合器和耦合到Ls-ICC实例的分配器。

    Detecting barge-in in a speech dialogue system
    3.
    发明授权
    Detecting barge-in in a speech dialogue system 有权
    在语音对话系统中检测插入

    公开(公告)号:US09026438B2

    公开(公告)日:2015-05-05

    申请号:US12415927

    申请日:2009-03-31

    摘要: A method for detecting barge-in in a speech dialog system comprising determining whether a speech prompt is output by the speech dialog system, and detecting whether speech activity is present in an input signal based on a time-varying sensitivity threshold of a speech activity detector and/or based on speaker information, where the sensitivity threshold is increased if output of a speech prompt is determined and decreased if no output of a speech prompt is determined. If speech activity is detected in the input signal, the speech prompt may be interrupted or faded out. A speech dialog system configured to detect barge-in is also disclosed.

    摘要翻译: 一种用于检测语音对话系统中的插入的方法,包括:确定语音对话系统是否输出语音提示,以及基于语音活动检测器的时变灵敏度阈值检测语音活动是否存在于输入信号中 和/或基于说话者信息,其中如果确定了语音提示的输出,则在不确定语音提示的输出的情况下确定并减小了灵敏度阈值。 如果在输入信号中检测到语音活动,语音提示可能被中断或淡出。 还公开了一种配置成检测插入的语音对话系统。

    Speech signal enhancement using visual information
    4.
    发明授权
    Speech signal enhancement using visual information 有权
    使用视觉信息的语音信号增强

    公开(公告)号:US09293151B2

    公开(公告)日:2016-03-22

    申请号:US14352016

    申请日:2011-10-17

    摘要: Visual information is used to alter or set an operating parameter of an audio signal processor, other than a beamformer. A digital camera captures visual information about a scene that includes a human speaker and/or a listener. The visual information is analyzed to ascertain information about acoustics of a room. A distance between the speaker and a microphone may be estimated, and this distance estimate may be used to adjust an overall gain of the system. Distances among, and locations of, the speaker, the listener, the microphone, a loudspeaker and/or a sound-reflecting surface may be estimated. These estimates may be used to estimate reverberations within the room and adjust aggressiveness of an anti-reverberation filter, based on an estimated ratio of direct to indirect (reverberated) sound energy expected to reach the microphone. In addition, orientation of the speaker or the listener, relative to the microphone or the loudspeaker, can also be estimated, and this estimate may be used to adjust frequency-dependent filter weights to compensate for uneven frequency propagation of acoustic signals from a mouth, or to a human ear, about a human head.

    摘要翻译: 视觉信息用于改变或设置除波束形成器之外的音频信号处理器的操作参数。 数码相机拍摄有关包含人类扬声器和/或听众的场景的视觉信息。 分析视觉信息以确定关于房间声学的信息。 可以估计扬声器和麦克风之间的距离,并且该距离估计可以用于调整系统的整体增益。 可以估计扬声器,收听者,麦克风,扬声器和/或声音反射表面之间的距离和位置。 这些估计可以用于估计房间内的混响,并基于估计达到麦克风的直接到间接(混响)声能的估计比例来调节反混响滤波器的积极性。 此外,还可以估计扬声器或收听者相对于麦克风或扬声器的取向,并且该估计可用于调整频率依赖的滤波器权重以补偿来自口的声信号的不均匀频率传播, 或人的耳朵,关于人的头部。

    DETECTING BARGE-IN IN A SPEECH DIALOGUE SYSTEM
    5.
    发明申请
    DETECTING BARGE-IN IN A SPEECH DIALOGUE SYSTEM 有权
    检测语音对话系统中的边界

    公开(公告)号:US20090254342A1

    公开(公告)日:2009-10-08

    申请号:US12415927

    申请日:2009-03-31

    IPC分类号: G10L17/00 G10L15/20

    摘要: A method for detecting barge-in in a speech dialogue system comprising determining whether a speech prompt is output by the speech dialogue system, and detecting whether speech activity is present in an input signal based on a time-varying sensitivity threshold of a speech activity detector and/or based on speaker information, where the sensitivity threshold is increased if output of a speech prompt is determined and decreased if no output of a speech prompt is determined. If speech activity is detected in the input signal, the speech prompt may be interrupted or faded out. A speech dialogue system configured to detect barge-in is also disclosed.

    摘要翻译: 一种用于检测语音对话系统中的插入的方法,包括确定语音对话系统是否输出语音提示,以及基于语音活动检测器的时变灵敏度阈值来检测语音活动是否存在于输入信号中 和/或基于说话者信息,其中如果确定了语音提示的输出,则在不确定语音提示的输出的情况下确定并减小了灵敏度阈值。 如果在输入信号中检测到语音活动,语音提示可能被中断或淡出。 还公开了一种配置成检测插入的语音对话系统。

    Adjusting or setting vehicle elements through speech control
    7.
    发明授权
    Adjusting or setting vehicle elements through speech control 有权
    通过语音控制调整或设置车辆元件

    公开(公告)号:US09580028B2

    公开(公告)日:2017-02-28

    申请号:US12241837

    申请日:2008-09-30

    摘要: A speech processing device includes an automotive device that filters data that is sent and received across an in-vehicle bus. The device selectively acquires vehicle data related to a user settings or adjustments. An interface acquires the selected vehicle data from in-vehicle sensors in response to a user's articulation of a first code phrase. A memory stores the selected vehicle data with unique identifying data associated with the user and establishes a connection between the selected vehicle data and the user when a second code phrase is articulated. A data interface provides access to the selected vehicle data and stored relationship data and enables the processing of the data to customize the in-vehicle system. The data interface is responsive to the user's articulation of a third code phrase to process the selected vehicle data that enables the setting or adjustment of the in-vehicle system.

    摘要翻译: 语音处理装置包括:汽车装置,其对通过车载总线发送和接收的数据进行滤波。 该设备选择性地获取与用户设置或调整相关的车辆数据。 响应于用户对第一代码短语的表达,接口从车载传感器中获取所选择的车辆数据。 存储器使用与用户相关联的唯一识别数据存储所选择的车辆数据,并且当第二代码短语被铰接时,建立所选择的车辆数据与用户之间的连接。 数据接口提供对所选择的车辆数据和存储的关系数据的访问,并使数据的处理能够定制车载系统。 数据接口响应于用户对第三代码短语的表达,以处理能够设置或调整车载系统的所选车辆数据。

    DYNAMIC MICROPHONE SIGNAL MIXER
    9.
    发明申请
    DYNAMIC MICROPHONE SIGNAL MIXER 审中-公开
    动态麦克风信号混频器

    公开(公告)号:US20130325458A1

    公开(公告)日:2013-12-05

    申请号:US13990176

    申请日:2010-11-29

    IPC分类号: G10L21/0208

    摘要: A system and method of signal combining that supports different speakers in a noisy environment is provided. Particularly for deviations in the noise characteristics among the channels, various embodiments ensure a smooth transition of the background noise at speaker changes. A modified noise reduction (NR) may achieve equivalent background noise characteristics for all channels by applying a dynamic, channel specific, and frequency dependent maximum attenuation. The reference characteristics for adjusting the background noise may be specified by the dominant speaker channel. In various embodiments, an automatic gain control (AGC) with a dynamic target level may ensure similar speech signal levels in all channels.

    摘要翻译: 提供了在嘈杂环境中支持不同扬声器的信号组合系统和方法。 特别是对于通道之间的噪声特性的偏差,各种实施例确保扬声器变化时背景噪声的平滑过渡。 修改的噪声降低(NR)可以通过应用动态,信道特定和频率相关的最大衰减来实现所有信道的等效背景噪声特性。 用于调整背景噪声的参考特性可以由主扬声器通道指定。 在各种实施例中,具有动态目标电平的自动增益控制(AGC)可以确保所有信道中的类似语音信号电平。

    Method for determining a time delay for time delay compensation
    10.
    发明授权
    Method for determining a time delay for time delay compensation 有权
    用于确定时间延迟补偿的时间延迟的方法

    公开(公告)号:US08238574B2

    公开(公告)日:2012-08-07

    申请号:US12636160

    申请日:2009-12-11

    IPC分类号: H04R3/00

    摘要: The invention provides a computer-implemented method for determining a time delay for time delay compensation of a microphone signal from a microphone array in a beamformer arrangement. For a given time, an instantaneous estimate of a position of a wanted sound source and/or of a direction of arrival of a signal originating from the wanted sound source is determined. The computer system then determines whether the instantaneous estimate deviates from a preset estimate of a position of the wanted sound source and/or of a direction of arrival of a signal originating from the wanted sound source according to a predetermined criterion. The predetermined criterion comprises a check whether the instantaneous estimate deviates from the preset estimate by at least a predetermined deviation threshold. If the predetermined criterion is fulfilled, the instantaneous estimate for the given time is set by the computer system as the preset estimate, and the computer system determines the time delay for time delay compensation of the microphone signal based on the instantaneous estimate.

    摘要翻译: 本发明提供了一种计算机实现的方法,用于确定来自波束形成器布置中的麦克风阵列的麦克风信号的时间延迟的时间延迟。 对于给定时间,确定所需声源的位置和/或源自所需声源的信号的到达方向的瞬时估计。 计算机系统然后根据预定标准确定瞬时估计是否偏离预期的所需声源的位置的预设估计和/或源自所需声源的信号的到达方向。 预定标准包括检查瞬时估计是否偏离预设估计至少预定的偏差阈值。 如果满足预定标准,则由计算机系统将给定时间的瞬时估计值设置为预设估计,并且计算机系统基于瞬时估计确定麦克风信号的时间延迟补偿的时间延迟。