Bubble splitting for compact acoustic modeling
    8.
    发明授权
    Bubble splitting for compact acoustic modeling 有权
    气泡分裂用于紧凑的声学建模

    公开(公告)号:US07328154B2

    公开(公告)日:2008-02-05

    申请号:US10639974

    申请日:2003-08-13

    IPC分类号: G10L15/00

    摘要: An improved method is provided for constructing compact acoustic models for use in a speech recognizer. The method includes: partitioning speech data from a plurality of training speakers according to at least one speech related criteria (i.e., vocal tract length); grouping together the partitioned speech data from training speakers having a similar speech characteristic; and training an acoustic bubble model for each group using the speech data within the group.

    摘要翻译: 提供了一种用于构建用于语音识别器中的紧凑声学模型的改进方法。 该方法包括:根据至少一个语音相关标准(即,声道长度)来分割来自多个训练说话者的语音数据; 将具有类似语音特征的训练说话者的分割语音数据分组在一起; 并使用组内的语音数据为每个组训练声音气泡模型。

    Multi-slot dialog systems and methods
    9.
    发明申请
    Multi-slot dialog systems and methods 有权
    多槽对话系统和方法

    公开(公告)号:US20070255566A1

    公开(公告)日:2007-11-01

    申请号:US11787763

    申请日:2007-04-17

    IPC分类号: G10L15/00

    CPC分类号: H04M3/4936 G10L15/22

    摘要: Systems and methods for constructing a series of interactions with a user to collect multiple pieces of related information for the purpose of accomplishing a specific goal or topic (a multi-slot dialog) using a component-based approach are disclosed. The method generally includes outputting a primary header prompt to elicit values for slots in a segment from the user, receiving a primary user response containing a value for each slot in at least a subset of the slots in the segment, processing the primary user response to determine at least one possible recognition value for each slot contained in the primary user response, filling each slot contained in the primary user response with a matched value selected from the corresponding possible recognition values, and repeating the outputting, receiving, processing and filling for any unfilled slots in the segment until all slots in the segment of slots are filled.

    摘要翻译: 公开了使用基于组件的方法来构建与用户进行一系列交互以收集多条相关信息以用于实现特定目标或主题(多时隙对话)的目的的系统和方法。 该方法通常包括输出主标题提示以从用户中引出段中的时隙的值,接收包含段中的时隙的至少一个子集中的每个时隙的值的主用户响应,处理主用户响应 确定主用户响应中包含的每个时隙的至少一个可能的识别值,用从相应的可能识别值中选择的匹配值填充主用户响应中包含的每个时隙,并重复输出,接收,处理和填充任何 段中的未填充插槽,直到插槽段中的所有插槽都被填充为止。

    Focused language models for improved speech input of structured documents
    10.
    发明授权
    Focused language models for improved speech input of structured documents 有权
    用于改进结构化文档语音输入的专注语言模型

    公开(公告)号:US06901364B2

    公开(公告)日:2005-05-31

    申请号:US09951093

    申请日:2001-09-13

    CPC分类号: G10L15/1815 G10L15/30

    摘要: An e-mail message process is provided for use with a personal digital assistant which allows for the use of input speech messaging which is converted to text using a focused language model which is downloaded by a cellular phone connection to an Internet server which provides the focused language model based upon a topic for the intended e-mail message. The text that is generated from the input speech method can be summarized by the e-mail message processor and can be edited by the user. The generated e-mail message can then be transmitted again via cellular connection to an Internet e-mail server for transmitting the e-mail message to a recipient.

    摘要翻译: 提供电子邮件消息处理以与个人数字助理一起使用,该个人数字助理允许使用输入语音消息传送,其使用由通过蜂窝电话连接下载的聚焦语言模型转换为文本,该互联网服务器提供聚焦 基于预期电子邮件的主题的语言模型。 从输入语音方法生成的文本可以由电子邮件消息处理器来总结,并且可以由用户编辑。 然后可以通过蜂窝连接再次将生成的电子邮件消息发送到Internet电子邮件服务器,以将电子邮件消息发送给接收者。