IN-CAR COMMUNICATION SYSTEM FOR MULTIPLE ACOUSTIC ZONES
    2.
    发明申请
    IN-CAR COMMUNICATION SYSTEM FOR MULTIPLE ACOUSTIC ZONES 有权
    多声道区域的车内通信系统

    公开(公告)号:US20130179163A1

    公开(公告)日:2013-07-11

    申请号:US13346854

    申请日:2012-01-10

    IPC分类号: G10L15/20 H04B1/00

    摘要: An In-Car Communication (ICC) system supports the communication paths within a car by receiving the speech signals of a speaking passenger and playing it back for one or more listening passengers. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in a vehicle having multiple acoustic zones includes a plurality of microphone In-Car Communication (Mic-ICC) instances coupled and a plurality of loudspeaker In-Car Communication (Ls-ICC) instances. The system further includes a dynamic audio routing matrix with a controller and coupled to the Mic-ICC instances, a mixer coupled to the plurality of Mic-ICC instances and a distributor coupled to the Ls-ICC instances.

    摘要翻译: 车载通信(ICC)系统通过接收说话的乘客的语音信号并且为一个或多个听力乘客返回来支持汽车内的通信路径。 信号处理任务分为麦克风相关部分和扬声器相关部分。 适用于具有多个声学区域的车辆的声音处理系统包括耦合的多个麦克风车载通信(Mic-ICC)实例和多个扬声器车载通信(Ls-ICC)实例。 该系统还包括具有控制器并耦合到Mic-ICC实例的动态音频路由矩阵,耦合到多个Mic-ICC实例的混合器和耦合到Ls-ICC实例的分配器。

    Detecting barge-in in a speech dialogue system
    3.
    发明授权
    Detecting barge-in in a speech dialogue system 有权
    在语音对话系统中检测插入

    公开(公告)号:US09026438B2

    公开(公告)日:2015-05-05

    申请号:US12415927

    申请日:2009-03-31

    摘要: A method for detecting barge-in in a speech dialog system comprising determining whether a speech prompt is output by the speech dialog system, and detecting whether speech activity is present in an input signal based on a time-varying sensitivity threshold of a speech activity detector and/or based on speaker information, where the sensitivity threshold is increased if output of a speech prompt is determined and decreased if no output of a speech prompt is determined. If speech activity is detected in the input signal, the speech prompt may be interrupted or faded out. A speech dialog system configured to detect barge-in is also disclosed.

    摘要翻译: 一种用于检测语音对话系统中的插入的方法,包括:确定语音对话系统是否输出语音提示,以及基于语音活动检测器的时变灵敏度阈值检测语音活动是否存在于输入信号中 和/或基于说话者信息,其中如果确定了语音提示的输出,则在不确定语音提示的输出的情况下确定并减小了灵敏度阈值。 如果在输入信号中检测到语音活动,语音提示可能被中断或淡出。 还公开了一种配置成检测插入的语音对话系统。

    Speech signal enhancement using visual information
    4.
    发明授权
    Speech signal enhancement using visual information 有权
    使用视觉信息的语音信号增强

    公开(公告)号:US09293151B2

    公开(公告)日:2016-03-22

    申请号:US14352016

    申请日:2011-10-17

    摘要: Visual information is used to alter or set an operating parameter of an audio signal processor, other than a beamformer. A digital camera captures visual information about a scene that includes a human speaker and/or a listener. The visual information is analyzed to ascertain information about acoustics of a room. A distance between the speaker and a microphone may be estimated, and this distance estimate may be used to adjust an overall gain of the system. Distances among, and locations of, the speaker, the listener, the microphone, a loudspeaker and/or a sound-reflecting surface may be estimated. These estimates may be used to estimate reverberations within the room and adjust aggressiveness of an anti-reverberation filter, based on an estimated ratio of direct to indirect (reverberated) sound energy expected to reach the microphone. In addition, orientation of the speaker or the listener, relative to the microphone or the loudspeaker, can also be estimated, and this estimate may be used to adjust frequency-dependent filter weights to compensate for uneven frequency propagation of acoustic signals from a mouth, or to a human ear, about a human head.

    摘要翻译: 视觉信息用于改变或设置除波束形成器之外的音频信号处理器的操作参数。 数码相机拍摄有关包含人类扬声器和/或听众的场景的视觉信息。 分析视觉信息以确定关于房间声学的信息。 可以估计扬声器和麦克风之间的距离,并且该距离估计可以用于调整系统的整体增益。 可以估计扬声器,收听者,麦克风,扬声器和/或声音反射表面之间的距离和位置。 这些估计可以用于估计房间内的混响,并基于估计达到麦克风的直接到间接(混响)声能的估计比例来调节反混响滤波器的积极性。 此外,还可以估计扬声器或收听者相对于麦克风或扬声器的取向,并且该估计可用于调整频率依赖的滤波器权重以补偿来自口的声信号的不均匀频率传播, 或人的耳朵,关于人的头部。

    DETECTING BARGE-IN IN A SPEECH DIALOGUE SYSTEM
    5.
    发明申请
    DETECTING BARGE-IN IN A SPEECH DIALOGUE SYSTEM 有权
    检测语音对话系统中的边界

    公开(公告)号:US20090254342A1

    公开(公告)日:2009-10-08

    申请号:US12415927

    申请日:2009-03-31

    IPC分类号: G10L17/00 G10L15/20

    摘要: A method for detecting barge-in in a speech dialogue system comprising determining whether a speech prompt is output by the speech dialogue system, and detecting whether speech activity is present in an input signal based on a time-varying sensitivity threshold of a speech activity detector and/or based on speaker information, where the sensitivity threshold is increased if output of a speech prompt is determined and decreased if no output of a speech prompt is determined. If speech activity is detected in the input signal, the speech prompt may be interrupted or faded out. A speech dialogue system configured to detect barge-in is also disclosed.

    摘要翻译: 一种用于检测语音对话系统中的插入的方法,包括确定语音对话系统是否输出语音提示,以及基于语音活动检测器的时变灵敏度阈值来检测语音活动是否存在于输入信号中 和/或基于说话者信息,其中如果确定了语音提示的输出,则在不确定语音提示的输出的情况下确定并减小了灵敏度阈值。 如果在输入信号中检测到语音活动,语音提示可能被中断或淡出。 还公开了一种配置成检测插入的语音对话系统。

    Speech Recognition
    7.
    发明申请
    Speech Recognition 有权
    语音识别

    公开(公告)号:US20100057462A1

    公开(公告)日:2010-03-04

    申请号:US12552517

    申请日:2009-09-02

    IPC分类号: G10L15/14 G10L15/06 G10L15/00

    摘要: The present invention relates to a method for speech recognition of a speech signal comprising the steps of providing at least one codebook comprising codebook entries, in particular, multivariate Gaussians of feature vectors, that are frequency weighted such that higher weights are assigned to entries corresponding to frequencies below a predetermined level than to entries corresponding to frequencies above the predetermined level and processing the speech signal for speech recognition comprising extracting at least one feature vector from the speech signal and matching the feature vector with the entries of the codebook.

    摘要翻译: 本发明涉及一种用于语音信号的语音识别的方法,包括以下步骤:提供至少一个码本,其中包括码本条目,特别是特征向量的多元高斯,频率加权,使得较高权重被分配给对应于 频率低于对应于高于预定电平的频率的条目,并且处理用于语音识别的语音信号,包括从语音信号中提取至少一个特征向量并使特征向量与码本的条目相匹配。

    Method for Adapting a Codebook for Speech Recognition
    8.
    发明申请
    Method for Adapting a Codebook for Speech Recognition 有权
    适应语音识别码本的方法

    公开(公告)号:US20100138222A1

    公开(公告)日:2010-06-03

    申请号:US12622717

    申请日:2009-11-20

    IPC分类号: G10L15/06

    CPC分类号: G10L15/065

    摘要: A method for adapting a codebook for speech recognition, wherein the codebook is from a set of codebooks comprising a speaker-independent codebook and at least one speaker-dependent codebook is disclosed. A speech input is received and a feature vector based on the received speech input is determined. For each of the Gaussian densities, a first mean vector is estimated using an expectation process and taking into account the determined feature vector. For each of the Gaussian densities, a second mean vector using an Eigenvoice adaptation is determined taking into account the determined feature vector. For each of the Gaussian densities, the mean vector is set to a convex combination of the first and the second mean vector. Thus, this process allows for adaptation during operation and does not require a lengthy training phase.

    摘要翻译: 一种用于适应用于语音识别的码本的方法,其中所述码本来自包括与扬声器无关的码本和至少一个与扬声器相关的码本的码本集合。 接收到语音输入,并且确定基于所接收的语音输入的特征向量。 对于每个高斯密度,使用期望过程并且考虑确定的特征向量来估计第一平均向量。 对于每个高斯密度,使用特征语音适配的第二平均向量被确定,其考虑所确定的特征向量。 对于每个高斯密度,将平均矢量设置为第一和第二平均矢量的凸组合。 因此,该过程允许在操作期间的适应并且不需要冗长的训练阶段。

    Method for determining the presence of a wanted signal component
    9.
    发明授权
    Method for determining the presence of a wanted signal component 有权
    用于确定有用信号分量的存在的方法

    公开(公告)号:US09530432B2

    公开(公告)日:2016-12-27

    申请号:US12507444

    申请日:2009-07-22

    IPC分类号: G10L25/78 G10L15/22

    摘要: This invention provides a method for determining, in a speech dialog system issuing speech prompts, a score value as an indicator for the presence of a wanted signal component in an input signal stemming from a microphone, comprising the steps of: using a first likelihood function to determine a first likelihood value for the presence of the wanted signal component in the input signal, using a second likelihood function to determine a second likelihood value for the presence of a noise signal component in the input signal, and determining a score value based on the first and the second likelihood values, wherein the first likelihood function is based on a predetermined reference wanted signal, and the second likelihood function is based on a predetermined reference noise signal.

    摘要翻译: 本发明提供了一种在发出语音提示的语音对话系统中确定得分值作为来自麦克风的输入信号中有用信号分量的存在的指标的方法,包括以下步骤:使用第一似然函数 使用第二似然函数来确定用于在输入信号中存在噪声信号分量的第二似然值,并且基于该输入信号确定分数值来确定用于输入信号中有用信号分量的存在的第一似然值 第一似然值和第二似然值,其中第一似然函数基于预定的参考有用信号,第二似然函数基于预定的参考噪声信号。

    Method for adapting a codebook for speech recognition
    10.
    发明授权
    Method for adapting a codebook for speech recognition 有权
    适用于语音识别的码本的方法

    公开(公告)号:US08346551B2

    公开(公告)日:2013-01-01

    申请号:US12622717

    申请日:2009-11-20

    IPC分类号: G10L15/06

    CPC分类号: G10L15/065

    摘要: A method for adapting a codebook for speech recognition, wherein the codebook is from a set of codebooks comprising a speaker-independent codebook and at least one speaker dependent codebook. A speech input is received and a feature vector based on the received speech input is determined. For each of the Gaussian densities, a first mean vector is estimated using an expectation process and taking into account the determined feature vector. For each of the Gaussian densities, a second mean vector using an Eigenvoice adaptation is determined taking into account the determined feature vector. For each of the Gaussian densities, the mean vector is set to a convex combination of the first and the second mean vector. Thus, this process allows for adaptation during operation and does not require a lengthy training phase.

    摘要翻译: 一种用于调整用于语音识别的码本的方法,其中码本来自包括与扬声器无关的码本和至少一个扬声器相关码本的一组码本。 接收到语音输入,并且确定基于所接收的语音输入的特征向量。 对于每个高斯密度,使用期望过程并且考虑确定的特征向量来估计第一平均向量。 对于每个高斯密度,使用特征语音适配的第二平均向量被确定,其考虑所确定的特征向量。 对于每个高斯密度,将平均矢量设置为第一和第二平均矢量的凸组合。 因此,该过程允许在操作期间的适应并且不需要冗长的训练阶段。