System and Method for an Endpoint Detection of Speech for Improved Speech Recognition in Noisy Environments
    2.
    发明申请
    System and Method for an Endpoint Detection of Speech for Improved Speech Recognition in Noisy Environments 审中-公开
    用于在嘈杂环境中改进语音识别的语音端点检测的系统和方法

    公开(公告)号:US20120191455A1

    公开(公告)日:2012-07-26

    申请号:US13438715

    申请日:2012-04-03

    IPC分类号: G10L15/00

    CPC分类号: G10L25/87

    摘要: According to a disclosed embodiment, an endpointer determines the background energy of a first portion of a speech signal, and a cepstral computing module extracts one or more features of the first portion. The endpointer calculates an average distance of the first portion based on the features. Subsequently, an energy computing module measures the energy of a second portion of the speech signal, and the cepstral computing module extracts one or more features of the second portion. Based on the features of the second portion, the endpointer calculates a distance of the second portion. Thereafter, the endpointer contrasts the energy of the second portion with the background energy of the first portion, and compares the distance of the second portion with the distance of the first portion. The second portion of the speech signal is classified by the endpointer as speech or non-speech based on the contrast and the comparison.

    摘要翻译: 根据所公开的实施例,终端指针确定语音信号的第一部分的背景能量,倒谱计算模块提取第一部分的一个或多个特征。 endpointer基于特征计算第一部分的平均距离。 随后,能量计算模块测量语音信号的第二部分的能量,并且倒谱计算模块提取第二部分的一个或多个特征。 基于第二部分的特征,终点计算器计算第二部分的距离。 此后,endpointer将第二部分的能量与第一部分的背景能量进行对比,并将第二部分的距离与第一部分的距离进行比较。 基于对比度和比较,语音信号的第二部分被endpointer分类为语音或非语音。

    Methods and apparatus for performing speaker independent recognition of commands in parallel with speaker dependent recognition of names, words or phrases
    3.
    再颁专利
    Methods and apparatus for performing speaker independent recognition of commands in parallel with speaker dependent recognition of names, words or phrases 有权
    用于与讲话者独立地识别命令并且与名称,单词或短语的说话者依赖性识别并行的方法和装置

    公开(公告)号:USRE38101E1

    公开(公告)日:2003-04-29

    申请号:US09505103

    申请日:2000-02-16

    IPC分类号: G10L908

    摘要: Methods and apparatus for activating telephone services in response to speech are described. A directory including names is maintained for each customer. A speaker dependent speech template and a telephone number for each name, is maintained as part of each customer's directory. Speaker independent speech templates are used for recognizing commands. The present invention has the advantage of permitting a customer to place a call by speaking a person's name which serves as a destination identifier without having to speak an additional command or steering word to place the call. This is achieved by treating the receipt of a spoken name in the absence of a command as an implicit command to place a call. Explicit speaker independent commands are used to invoke features or services other than call placement. Speaker independent and speaker dependent speech recognition are performed on a customer's speech in parallel. An arbiter is used to decide which function or service should be performed when an apparent conflict arises as a result of both the speaker dependent and speaker independent speech recognition step outputs. Stochastic grammars, word spotting and/or out-of-vocabulary rejection are used as part of the speech recognition process to provide a user friendly interface which permits the use of spontaneous speech. Voice verification is performed on a selective basis where security is of concern.

    摘要翻译: 描述了响应于语音激活电话服务的方法和装置。 为每个客户维护包含名称的目录。 每个名字的说话者依赖语音模板和电话号码都作为每个客户目录的一部分进行维护。 扬声器独立语音模板用于识别命令。 本发明的优点在于,允许客户通过说出作为目的地标识符的人的姓名来进行呼叫,而不用说另外的命令或指导词来进行呼叫。 这是通过在没有命令的情况下处理接收到口语名称作为发出呼叫的隐式命令来实现的。 独立于显示扬声器的命令用于调用除呼叫位置之外的功能或服务。 扬声器独立和扬声器相关语音识别是在客户演讲中并行执行的。 当由于说话人依赖和说话者独立的语音识别步骤输出而产生明显的冲突时,仲裁器用于决定应该执行哪个功能或服务。 语音识别过程的一部分使用随机语法,单词发音和/或超出词汇拒绝,以提供允许使用自发语音的用户友好界面。 语音验证是在安全性受到关注的基础上进行的。

    System and method for an endpoint detection of speech for improved speech recognition in noisy environments
    4.
    发明授权
    System and method for an endpoint detection of speech for improved speech recognition in noisy environments 失效
    用于在嘈杂环境中改善语音识别的语音终端检测的系统和方法

    公开(公告)号:US08175876B2

    公开(公告)日:2012-05-08

    申请号:US12459168

    申请日:2009-06-25

    IPC分类号: G10L17/00

    CPC分类号: G10L25/87

    摘要: According to a disclosed embodiment, an endpointer determines the background energy of a first portion of a speech signal, and a cepstral computing module extracts one or more features of the first portion. The endpointer calculates an average distance of the first portion based on the features. Subsequently, an energy computing module measures the energy of a second portion of the speech signal, and the cepstral computing module extracts one or more features of the second portion. Based on the features of the second portion, the endpointer calculates a distance of the second portion. Thereafter, the endpointer contrasts the energy of the second portion with the background energy of the first portion, and compares the distance of the second portion with the distance of the first portion. The second portion of the speech signal is classified by the endpointer as speech or non-speech based on the contrast and the comparison.

    摘要翻译: 根据所公开的实施例,终端指针确定语音信号的第一部分的背景能量,倒谱计算模块提取第一部分的一个或多个特征。 endpointer基于特征计算第一部分的平均距离。 随后,能量计算模块测量语音信号的第二部分的能量,并且倒谱计算模块提取第二部分的一个或多个特征。 基于第二部分的特征,终点计算器计算第二部分的距离。 此后,endpointer将第二部分的能量与第一部分的背景能量进行对比,并将第二部分的距离与第一部分的距离进行比较。 基于对比度和比较,语音信号的第二部分被endpointer分类为语音或非语音。

    System and method for an endpoint detection of speech for improved speech recognition in noisy environments
    5.
    发明申请
    System and method for an endpoint detection of speech for improved speech recognition in noisy environments 失效
    用于在嘈杂环境中改善语音识别的语音终端检测的系统和方法

    公开(公告)号:US20100030559A1

    公开(公告)日:2010-02-04

    申请号:US12459168

    申请日:2009-06-25

    IPC分类号: G10L17/00 G10L15/00

    CPC分类号: G10L25/87

    摘要: According to a disclosed embodiment, an endpointer determines the background energy of a first portion of a speech signal, and a cepstral computing module extracts one or more features of the first portion. The endpointer calculates an average distance of the first portion based on the features. Subsequently, an energy computing module measures the energy of a second portion of the speech signal, and the cepstral computing module extracts one or more features of the second portion. Based on the features of the second portion, the endpointer calculates a distance of the second portion. Thereafter, the endpointer contrasts the energy of the second portion with the background energy of the first portion, and compares the distance of the second portion with the distance of the first portion. The second portion of the speech signal is classified by the endpointer as speech or non-speech based on the contrast and the comparison.

    摘要翻译: 根据所公开的实施例,终端指针确定语音信号的第一部分的背景能量,倒谱计算模块提取第一部分的一个或多个特征。 endpointer基于特征计算第一部分的平均距离。 随后,能量计算模块测量语音信号的第二部分的能量,并且倒谱计算模块提取第二部分的一个或多个特征。 基于第二部分的特征,终点计算器计算第二部分的距离。 此后,endpointer将第二部分的能量与第一部分的背景能量进行对比,并将第二部分的距离与第一部分的距离进行比较。 基于对比度和比较,语音信号的第二部分被endpointer分类为语音或非语音。

    System and method for a endpoint detection of speech for improved speech recognition in noisy environments
    6.
    发明授权
    System and method for a endpoint detection of speech for improved speech recognition in noisy environments 失效
    用于在嘈杂环境中改善语音识别的语音端点检测的系统和方法

    公开(公告)号:US07277853B1

    公开(公告)日:2007-10-02

    申请号:US09948331

    申请日:2001-09-05

    IPC分类号: G10L17/00

    CPC分类号: G10L25/87

    摘要: According to a disclosed embodiment, an endpointer determines the background energy of a first portion of a speech signal, and a cepstral computing module extracts one or more features of the first portion. The endpointer calculates an average distance of the first portion based on the features. Subsequently, an energy computing module measures the energy of a second portion of the speech signal, and the cepstral computing module extracts one or more features of the second portion. Based on the features of the second portion, the endpointer calculates a distance of the second portion. Thereafter, the endpointer contrasts the energy of the second portion with the background energy of the first portion, and compares the distance of the second portion with the distance of the first portion. The second portion of the speech signal is classified by the endpointer as speech or non-speech based on the contrast and the comparison.

    摘要翻译: 根据所公开的实施例,终端指针确定语音信号的第一部分的背景能量,倒谱计算模块提取第一部分的一个或多个特征。 endpointer基于特征计算第一部分的平均距离。 随后,能量计算模块测量语音信号的第二部分的能量,并且倒谱计算模块提取第二部分的一个或多个特征。 基于第二部分的特征,终点计算器计算第二部分的距离。 此后,endpointer将第二部分的能量与第一部分的背景能量进行对比,并将第二部分的距离与第一部分的距离进行比较。 基于对比度和比较,语音信号的第二部分被endpointer分类为语音或非语音。

    Methods and apparatus for activating telephone services in response to
speech
    7.
    发明授权
    Methods and apparatus for activating telephone services in response to speech 失效
    响应语音激活电话服务的方法和装置

    公开(公告)号:US5719921A

    公开(公告)日:1998-02-17

    申请号:US609029

    申请日:1996-02-29

    摘要: Methods and apparatus for activating telephone services in response to speech are described. A directory including names is maintained for each customer. A speaker dependent speech template and a telephone number for each name, is maintained as part of each customer's directory. Speaker independent speech templates are used for recognizing commands. The present invention has the advantage of permitting a customer to place a call by speaking a person's name which serves as a destination identifier without having to speak an additional command or steering word to place the call. This is achieved by treating the receipt of a spoken name in the absence of a command as an implicit command to place a call. Explicit speaker independent commands are used to invoke features or services other than call placement. Speaker independent and speaker dependent speech recognition are performed on a customer's speech in parallel. An arbiter is used to decide which function or service should be performed when an apparent conflict arises as a result of both the speaker dependent and speaker independent speech recognition step outputs. Stochastic grammars, word spotting and/or out-of-vocabulary rejection are used as part of the speech recognition process to provide a user friendly interface which permits the use of spontaneous speech. Voice verification is performed on a selective basis where security is of concern.

    摘要翻译: 描述了响应于语音激活电话服务的方法和装置。 为每个客户维护包含名称的目录。 每个名字的说话者依赖语音模板和电话号码都作为每个客户目录的一部分进行维护。 扬声器独立语音模板用于识别命令。 本发明的优点在于,允许客户通过说出作为目的地标识符的人的姓名来进行呼叫,而不用说另外的命令或指导词来进行呼叫。 这是通过在没有命令的情况下处理接收到口语名称作为发出呼叫的隐式命令来实现的。 独立于显示扬声器的命令用于调用除呼叫位置之外的功能或服务。 扬声器独立和扬声器相关语音识别是在客户演讲中并行执行的。 当由于说话人依赖和说话者独立的语音识别步骤输出而产生明显的冲突时,仲裁器用于决定应该执行哪个功能或服务。 语音识别过程的一部分使用随机语法,单词发音和/或超出词汇拒绝,以提供允许使用自发语音的用户友好界面。 语音验证是在安全性受到关注的基础上进行的。