System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies
    1.
    发明授权
    System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies 有权
    用于具有大词汇的自动语音识别的声学和语言建模的系统和方法

    公开(公告)号:US06928404B1

    公开(公告)日:2005-08-09

    申请号:US09271469

    申请日:1999-03-17

    摘要: Systems and methods are provided for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms. One method for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms includes partitioning the language vocabulary V into subsets of word forms based on frequencies of occurrence of the respective word forms, in at least one the subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components and generating a language component vocabulary VC including word forms and word form components. The resulting language component vocabulary, which includes word forms and word components, is used to generate a language model that can be efficiently implemented for real-time automatic speech recognition applications for languages with large vocabularies.

    摘要翻译: 提供了用于为具有多个单词形式的语言词汇V的语音识别系统生成语言组件词汇VC的系统和方法。 用于生成具有多个单词形式的语言词汇V的语音识别系统的语言组件词汇VC的一种方法包括至少基于各个单词形式的出现频率将语言词汇V划分成单词形式的子集 一个子集,分裂词形式具有小于阈值的频率,从而生成单词形式分量并生成包括单词形式和单词形式分量的语言组成词汇VC。 所产生的包括单词形式和单词组成的语言组件词汇用于生成语言模型,该语言模型可以有效地实现用于具有大词汇的语言的实时自动语音识别应用。

    System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies
    2.
    发明授权
    System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies 有权
    用于具有大词汇的自动语音识别的声学和语言建模的系统和方法

    公开(公告)号:US07801727B2

    公开(公告)日:2010-09-21

    申请号:US11064643

    申请日:2005-02-24

    IPC分类号: G10L15/04

    摘要: A method for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms is disclosed. The method includes: partitioning the language vocabulary V into subsets of word forms based on frequencies of occurrence of the respective word forms; and in at least one of the subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components. Also disclosed is a method for use in speech recognition including: splitting an acoustic vocabulary comprising baseforms into baseform components and storing the baseform components; and, performing sound to spelling mapping on the baseform components so as to generate a baseform components to word parts table for use in subsequent decoding of speech. A method for decoding a speech utterance using language model components and acoustic components, includes the steps of: generating from the utterance a stack of baseform component paths; concatenating baseform components in a path to generate concatenated baseforms, when the concatenated baseform components correspond to a baseform found in an acoustic vocabulary; mapping the concatenated baseforms into words; computing language model (LM) scores associated with the words using a language model, and performing further decoding of the utterance based thereupon.

    摘要翻译: 公开了一种用于生成具有多个单词形式的语言词汇V的语音识别系统的语言组件词汇VC的方法。 该方法包括:基于各个词形式的出现频率将语言词汇V划分成单词形式的子集; 并且在至少一个子集中,分割具有小于阈值的频率的字形式,从而生成词形分量。 还公开了一种用于语音识别的方法,包括:将包含基本形式的声学词汇分解成基本形式组件并存储基本形式组件; 并且对基本形式组件执行声音拼写映射,以便生成用于语音后续解码中的字部分表的基本形式分量。 一种使用语言模型分量和声学分量对语音发音进行解码的方法,包括以下步骤:从发音中产生一叠基础分量路径; 当级联的基本形式组件对应于在声学词汇中发现的基础形式时,将路径中的基本形式组件连接以生成级联的基本形式; 将连接的基本形式映射为单词; 与使用语言模型的单词相关联的计算语言模型(LM)得分,并且基于此进行对话语的进一步解码。

    System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies
    3.
    发明申请
    System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies 有权
    用于具有大词汇的自动语音识别的声学和语言建模的系统和方法

    公开(公告)号:US20050143972A1

    公开(公告)日:2005-06-30

    申请号:US11064643

    申请日:2005-02-24

    IPC分类号: G06F17/27 G10L15/18 G06F17/21

    摘要: A method for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms is disclosed. The method includes: partitioning the language vocabulary V into subsets of word forms based on frequencies of occurrence of the respective word forms; and in at least one of the subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components. Also disclosed is a method for use in speech recognition including: splitting an acoustic vocabulary comprising baseforms into baseform components and storing the baseform components; and, performing sound to spelling mapping on the baseform components so as to generate a baseform components to word parts table for use in subsequent decoding of speech. A method for decoding a speech utterance using language model components and acoustic components, includes the steps of: generating from the utterance a stack of baseform component paths; concatenating baseform components in a path to generate concatenated baseforms, when the concatenated baseform components correspond to a baseform found in an acoustic vocabulary; mapping the concatenated baseforms into words; computing language model (LM) scores associated with the words using a language model, and performing further decoding of the utterance based thereupon.

    摘要翻译: 公开了一种用于生成具有多个单词形式的语言词汇V的语音识别系统的语言组件词汇VC的方法。 该方法包括:基于各个词形式的出现频率将语言词汇V划分成单词形式的子集; 并且在至少一个子集中,分割具有小于阈值的频率的字形式,从而生成词形分量。 还公开了一种用于语音识别的方法,包括:将包含基本形式的声学词汇分解成基本形式组件并存储基本形式组件; 并且对基本形式组件执行声音拼写映射,以便生成用于语音后续解码中的字部分表的基本形式分量。 一种使用语言模型分量和声学分量对语音发音进行解码的方法,包括以下步骤:从发音中产生一叠基础分量路径; 当级联的基本形式组件对应于在声学词汇中发现的基础形式时,将路径中的基本形式组件连接以生成级联的基本形式; 将连接的基本形式映射为单词; 与使用语言模型的单词相关联的计算语言模型(LM)得分,并且基于此进行对话语的进一步解码。

    Seatbelt microphone mounting
    4.
    发明授权
    Seatbelt microphone mounting 有权
    安全带麦克风安装

    公开(公告)号:US06438247B1

    公开(公告)日:2002-08-20

    申请号:US09239328

    申请日:1999-01-28

    IPC分类号: H04R2500

    CPC分类号: H04R1/08

    摘要: A microphone bearing slider on a diagonal seatbelt member, together with a tethering tape that is positioned along the diagonal seatbelt, from a seatbelt hanger member to the buckle with attachment to the slider, in combination, operate to position the microphone at the same precise location for vocal transmission at each deployment, and to return the assembly to a storage position with no addition attention being required on the part of the communicating person.

    摘要翻译: 一对角安全带构件上的麦克风支承滑块以及沿对角座椅安全带定位的系带带组合起来,从安全带挂钩构件到带扣,并联在滑块上,组合起作用,将麦克风定位在相同的精确位置 在每个部署时进行声带传输,并且将组件返回到存储位置,而不需要通信人的部分。

    Conversational computing via conversational virtual machine
    5.
    发明授权
    Conversational computing via conversational virtual machine 有权
    通过对话虚拟机进行会话计算

    公开(公告)号:US07729916B2

    公开(公告)日:2010-06-01

    申请号:US11551901

    申请日:2006-10-23

    IPC分类号: G10L15/22 G10L15/28

    摘要: A conversational computing system that provides a universal coordinated multi-modal conversational user interface (CUI) 10 across a plurality of conversationally aware applications (11) (i.e., applications that “speak” conversational protocols) and conventional applications (12). The conversationally aware applications (11) communicate with a conversational kernel (14) via conversational application APIs (13). The conversational kernel 14 controls the dialog across applications and devices (local and networked) on the basis of their registered conversational capabilities and requirements and provides a unified conversational user interface and conversational services and behaviors. The conversational computing system may be built on top of a conventional operating system and APIs (15) and conventional device hardware (16). The conversational kernel (14) handles all I/O processing and controls conversational engines (18). The conversational kernel (14) converts voice requests into queries and converts outputs and results into spoken messages using conversational engines (18) and conversational arguments (17). The conversational application API (13) conveys all the information for the conversational kernel (14) to transform queries into application calls and conversely convert output into speech, appropriately sorted before being provided to the user.

    摘要翻译: 一种对话计算系统,其跨越多个会话感知应用(11)(即,“说”对话协议的应用)和常规应用(12)提供通用协调多模态对话用户界面(CUI)10。 对话感知应用(11)通过对话应用API(13)与对话内核(14)通信。 会话核心14基于其注册的对话能力和需求来控制应用和设备(本地和网络)之间的对话,并提供统一的对话用户界面和对话服务和行为。 对话计算系统可以构建在常规操作系统和API(15)和常规设备硬件(16)之上。 对话内核(14)处理所有I / O处理和控制对话引擎(18)。 会话内核(14)将语音请求转换为查询,并将会话引擎(18)和会话参数(17)将输出和结果转换为口语消息。 对话应用程序API(13)传达对话内核(14)的所有信息,以将查询转换成应用程序调用,并相反地将输出转换为语音,在提供给用户之前进行适当排序。

    RESOURCE CONFIGURATION IN MULTI-MODAL DISTRIBUTED COMPUTING SYSTEMS
    6.
    发明申请
    RESOURCE CONFIGURATION IN MULTI-MODAL DISTRIBUTED COMPUTING SYSTEMS 有权
    多模式分布式计算系统中的资源配置

    公开(公告)号:US20090094451A1

    公开(公告)日:2009-04-09

    申请号:US12272597

    申请日:2008-11-17

    IPC分类号: G06F1/24

    摘要: A method and system for configuring available resources in real-time to automatically accommodate the needs of the system user in multi-modal distributed computing system is disclosed. Information about the location or environment of a wireless device is used, preferably in combination with user personal preferences and past history to modify the behavior of the wireless device, including the selection of the most appropriate mode of interaction with the device and the activation of applications thereon as appropriate.

    摘要翻译: 公开了一种实时配置可用资源以自动适应多模态分布式计算系统中系统用户需求的方法和系统。 使用关于无线设备的位置或环境的信息,优选地结合用户个人偏好和过去历史来修改无线设备的行为,包括选择与设备的最合适的交互模式以及激活应用 在适当的情况下。

    Resource configuration in multi-modal distributed computing systems
    7.
    发明授权
    Resource configuration in multi-modal distributed computing systems 有权
    多模式分布式计算系统中的资源配置

    公开(公告)号:US07454608B2

    公开(公告)日:2008-11-18

    申请号:US10698101

    申请日:2003-10-31

    IPC分类号: G06R15/177 G10L15/00

    摘要: A method and system for configuring available resources in real-time to automatically accommodate the needs of the system user in multi-modal distributed computing system is disclosed. Information about the location or environment of a wireless device is used, preferably in combination with user personal preferences and past history to modify the behavior of the wireless device, including the selection of the most appropriate mode of interaction with the device and the activation of applications thereon as appropriate.

    摘要翻译: 公开了一种实时配置可用资源以自动适应多模态分布式计算系统中系统用户需求的方法和系统。 使用关于无线设备的位置或环境的信息,优选地结合用户个人偏好和过去历史来修改无线设备的行为,包括选择与设备的最合适的交互模式以及激活应用 在适当的情况下。

    Resource configuration in multi-modal distributed computing systems
    8.
    发明授权
    Resource configuration in multi-modal distributed computing systems 有权
    多模式分布式计算系统中的资源配置

    公开(公告)号:US07984287B2

    公开(公告)日:2011-07-19

    申请号:US12272597

    申请日:2008-11-17

    IPC分类号: G06F15/177 G06F9/24 G10L15/00

    摘要: A method and system for configuring available resources in real-time to automatically accommodate the needs of the system user in multi-modal distributed computing system is disclosed. Information about the location or environment of a wireless device is used, preferably in combination with user personal preferences and past history to modify the behavior of the wireless device, including the selection of the most appropriate mode of interaction with the device and the activation of applications thereon as appropriate.

    摘要翻译: 公开了一种实时配置可用资源以自动适应多模态分布式计算系统中系统用户需求的方法和系统。 使用关于无线设备的位置或环境的信息,优选地结合用户个人偏好和过去历史来修改无线设备的行为,包括选择与设备的最合适的交互模式以及激活应用 在适当的情况下。

    CONVERSATIONAL COMPUTING VIA CONVERSATIONAL VIRTUAL MACHINE
    9.
    发明申请
    CONVERSATIONAL COMPUTING VIA CONVERSATIONAL VIRTUAL MACHINE 有权
    通过对话虚拟机对话计算

    公开(公告)号:US20070043574A1

    公开(公告)日:2007-02-22

    申请号:US11551901

    申请日:2006-10-23

    IPC分类号: G10L21/00

    摘要: A conversational computing system that provides a universal coordinated multi-modal conversational user interface (CUI) 10 across a plurality of conversationally aware applications (11) (i.e., applications that “speak” conversational protocols) and conventional applications (12). The conversationally aware applications (11) communicate with a conversational kernel (14) via conversational application APIs (13). The conversational kernel 14 controls the dialog across applications and devices (local and networked) on the basis of their registered conversational capabilities and requirements and provides a unified conversational user interface and conversational services and behaviors. The conversational computing system may be built on top of a conventional operating system and APIs (15) and conventional device hardware (16). The conversational kernel (14) handles all I/O processing and controls conversational engines (18). The conversational kernel (14) converts voice requests into queries and converts outputs and results into spoken messages using conversational engines (18) and conversational arguments (17). The conversational application API (13) conveys all the information for the conversational kernel (14) to transform queries into application calls and conversely convert output into speech, appropriately sorted before being provided to the user.

    摘要翻译: 一种对话计算系统,其跨越多个会话感知应用(11)(即,“说”对话协议的应用)和常规应用(12)提供通用协调多模态对话用户界面(CUI)10。 对话感知应用(11)通过对话应用API(13)与对话内核(14)通信。 会话核心14基于其注册的对话能力和需求来控制应用和设备(本地和网络)之间的对话,并提供统一的对话用户界面和对话服务和行为。 对话计算系统可以构建在常规操作系统和API(15)和常规设备硬件(16)之上。 对话内核(14)处理所有I / O处理和控制对话引擎(18)。 会话内核(14)将语音请求转换为查询,并将会话引擎(18)和会话参数(17)将输出和结果转换为口语消息。 对话应用程序API(13)传达对话内核(14)的所有信息,以将查询转换成应用程序调用,并相反地将输出转换为语音,在提供给用户之前进行适当排序。

    Electronic device connection resource management
    10.
    发明申请
    Electronic device connection resource management 有权
    电子设备连接资源管理

    公开(公告)号:US20050018623A1

    公开(公告)日:2005-01-27

    申请号:US10627823

    申请日:2003-07-25

    IPC分类号: H04L12/28 H04L12/56

    摘要: In a connection arrangement including two or more electronic devices, wherein information can be exchanged among the electronic devices through a plurality of communication links between the electronic devices, at least one of the electronic devices being configurable for communicating with a data source, a method for presenting a multi-channel message originating from the data source, the multi-channel message including a two or more components, includes the steps of allocating each of at least a portion of the components in the multi-channel message to at least one electronic device and, for each allocated component, determining possible communication paths between the data source and the at least one electronic device allocated to the corresponding component. The method further includes the steps of selecting, based at least in part on one or more selection criteria, at least one of the possible communication paths for the allocated components, each of the selected communication paths representing an optimal route between the data source and the at least one electronic device allocated to the corresponding component, and routing each of the allocated components in the multi-channel message according to the selected communication paths for presentation of the allocated components by the corresponding electronic device(s).

    摘要翻译: 在包括两个或更多个电子设备的连接装置中,其中可以通过电子设备之间的多个通信链路在电子设备之间交换信息,电子设备中的至少一个可配置为与数据源进行通信, 呈现源自数据源的多信道消息,包括两个或更多个组件的多信道消息包括以下步骤:将多信道消息中的组件的至少一部分中的每一个分配给至少一个电子设备 并且对于每个分配的组件,确定数据源和分配给相应组件的至少一个电子设备之间的可能的通信路径。 该方法还包括以下步骤:至少部分地基于一个或多个选择标准来选择所分配的组件的可能通信路径中的至少一个,所选择的通信路径中的每一个表示数据源和数据源之间的最佳路由 分配给相应组件的至少一个电子设备,以及根据所选择的通信路径来路由多信道消息中的所分配组件中的每一个,以便由相应的电子设备呈现所分配的组件。