Speech to Text Conversion
    1.
    发明申请
    Speech to Text Conversion 有权
    演讲文字转换

    公开(公告)号:US20120022867A1

    公开(公告)日:2012-01-26

    申请号:US13249181

    申请日:2011-09-29

    IPC分类号: G10L15/26 G06F17/30

    摘要: Methods, computer program products and systems are described for speech-to-text conversion. A voice input is received from a user of an electronic device and contextual metadata is received that describes a context of the electronic device at a time when the voice input is received. Multiple base language models are identified, where each base language model corresponds to a distinct textual corpus of content. Using the contextual metadata, an interpolated language model is generated based on contributions from the base language models. The contributions are weighted according to a weighting for each of the base language models. The interpolated language model is used to convert the received voice input to a textual output. The voice input is received at a computer server system that is remote to the electronic device. The textual output is transmitted to the electronic device.

    摘要翻译: 描述了用于语音到文本转换的方法,计算机程序产品和系统。 从电子设备的用户接收语音输入,并且接收到在接收到语音输入时描述电子设备的上下文的语境元数据。 识别多个基本语言模型,其中每个基本语言模型对应于不同的文本语料库的内容。 使用上下文元数据,基于来自基本语言模型的贡献生成内插语言模型。 根据每个基本语言模型的加权来加权贡献。 内插语言模型用于将接收的语音输入转换为文本输出。 在远离电子设备的计算机服务器系统处接收语音输入。 文本输出被传送到电子设备。

    Speech to Text Conversion
    7.
    发明申请
    Speech to Text Conversion 审中-公开
    演讲文字转换

    公开(公告)号:US20110161080A1

    公开(公告)日:2011-06-30

    申请号:US12976972

    申请日:2010-12-22

    IPC分类号: G10L15/26 G06F17/30

    摘要: Methods, computer program products and systems are described for speech-to-text conversion. A voice input is received from a user of an electronic device and contextual metadata is received that describes a context of the electronic device at a time when the voice input is received. Multiple base language models are identified, where each base language model corresponds to a distinct textual corpus of content. Using the contextual metadata, an interpolated language model is generated based on contributions from the base language models. The contributions are weighted according to a weighting for each of the base language models. The interpolated language model is used to convert the received voice input to a textual output. The voice input is received at a computer server system that is remote to the electronic device. The textual output is transmitted to the electronic device.

    摘要翻译: 描述了用于语音到文本转换的方法,计算机程序产品和系统。 从电子设备的用户接收语音输入,并且接收到在接收到语音输入时描述电子设备的上下文的语境元数据。 识别多个基本语言模型,其中每个基本语言模型对应于不同的文本语料库的内容。 使用上下文元数据,基于来自基本语言模型的贡献生成内插语言模型。 根据每个基本语言模型的加权来加权贡献。 内插语言模型用于将接收的语音输入转换为文本输出。 在远离电子设备的计算机服务器系统处接收语音输入。 文本输出被传送到电子设备。