Continuous speech transcription performance indication
    12.
    发明授权
    Continuous speech transcription performance indication 有权
    连续语音转录性能指标

    公开(公告)号:US08510109B2

    公开(公告)日:2013-08-13

    申请号:US12197213

    申请日:2008-08-22

    IPC分类号: G10L15/00

    摘要: A method of providing speech transcription performance indication includes receiving, at a user device data representing text transcribed from an audio stream by an ASR system, and data representing a metric associated with the audio stream; displaying, via the user device, said text; and via the user device, providing, in user-perceptible form, an indicator of said metric. Another method includes displaying, by a user device, text transcribed from an audio stream by an ASR system; and via the user device, providing, in user-perceptible form, an indicator of a level of background noise of the audio stream. Another method includes receiving data representing an audio stream; converting said data representing an audio stream to text via an ASR system; determining a metric associated with the audio stream; transmitting data representing said text to a user device; and transmitting data representing said metric to the user device.

    摘要翻译: 提供语音转录性能指示的方法包括在用户设备处接收表示由ASR系统从音频流转录的文本的数据和表示与音频流相关联的度量的数据; 经由所述用户设备显示所述文本; 并且经由用户设备,以用户可感知的形式提供所述度量的指示符。 另一方法包括由用户设备显示由ASR系统从音频流转录的文本; 并且经由用户设备,以用户可感知的形式提供音频流的背景噪声水平的指示符。 另一种方法包括接收表示音频流的数据; 通过ASR系统将表示音频流的所述数据转换为文本; 确定与所述音频流相关联的度量; 将表示所述文本的数据发送到用户设备; 以及将表示所述度量的数据发送到用户设备。

    CONTINUOUS SPEECH TRANSCRIPTION PERFORMANCE INDICATION
    13.
    发明申请
    CONTINUOUS SPEECH TRANSCRIPTION PERFORMANCE INDICATION 有权
    连续性语音识别性能指标

    公开(公告)号:US20090055175A1

    公开(公告)日:2009-02-26

    申请号:US12197213

    申请日:2008-08-22

    IPC分类号: G10L15/26

    摘要: A method of providing speech transcription performance indication includes receiving, at a user device data representing text transcribed from an audio stream by an ASR system, and data representing a metric associated with the audio stream; displaying, via the user device, said text; and via the user device, providing, in user-perceptible form, an indicator of said metric. Another method includes displaying, by a user device, text transcribed from an audio stream by an ASR system; and via the user device, providing, in user-perceptible form, an indicator of a level of background noise of the audio stream. Another method includes receiving data representing an audio stream; converting said data representing an audio stream to text via an ASR system; determining a metric associated with the audio stream; transmitting data representing said text to a user device; and transmitting data representing said metric to the user device.

    摘要翻译: 提供语音转录性能指示的方法包括在用户设备处接收表示由ASR系统从音频流转录的文本的数据和表示与音频流相关联的度量的数据; 经由所述用户设备显示所述文本; 并且经由用户设备,以用户可感知的形式提供所述度量的指示符。 另一方法包括由用户设备显示由ASR系统从音频流转录的文本; 并且经由用户设备,以用户可感知的形式提供音频流的背景噪声水平的指示符。 另一种方法包括接收表示音频流的数据; 通过ASR系统将表示音频流的所述数据转换为文本; 确定与所述音频流相关联的度量; 将表示所述文本的数据发送到用户设备; 以及将表示所述度量的数据发送到用户设备。

    Corrective feedback loop for automated speech recognition
    14.
    发明授权
    Corrective feedback loop for automated speech recognition 有权
    用于自动语音识别的纠正反馈回路

    公开(公告)号:US08793122B2

    公开(公告)日:2014-07-29

    申请号:US13621189

    申请日:2012-09-15

    IPC分类号: G10L15/00

    摘要: Audio data that includes speech may be transcribed using a language model. The transcription may be provided to a user. The user may provide feedback on the transcription, and the language model may be updated based at least in part on the feedback. The feedback may include, for example, an affirmation of the transcription; a disapproval of the transcription; a correction to the transcription; a selection of an alternate transcription result; or any other kind of response.

    摘要翻译: 可以使用语言模型来转录包括语音的音频数据。 转录可以提供给用户。 用户可以提供关于转录的反馈,并且可以至少部分地基于反馈来更新语言模型。 反馈可以包括例如确认转录; 不赞成转录; 修改转录; 选择替代转录结果; 或任何其他类型的回应。

    METHODS AND SYSTEMS FOR DYNAMICALLY UPDATING WEB SERVICE PROFILE INFORMATION BY PARSING TRANSCRIBED MESSAGE STRINGS
    15.
    发明申请
    METHODS AND SYSTEMS FOR DYNAMICALLY UPDATING WEB SERVICE PROFILE INFORMATION BY PARSING TRANSCRIBED MESSAGE STRINGS 有权
    通过分发传递消息行动态地更新WEB服务简档信息的方法和系统

    公开(公告)号:US20090083032A1

    公开(公告)日:2009-03-26

    申请号:US12212644

    申请日:2008-09-17

    IPC分类号: G10L15/26 H04W4/14

    摘要: Systems, methods, and software for parsing and/or filtering message strings of text messages and/or instant messages in order to identify keywords, phrases, or fragments as a function of which user preferences of user profiles are dynamically updated are disclosed. Such systems, methods, and software are utilized in the context of a communication system including text messaging, instant messaging, or both. Furthermore, such communication system preferably includes an automatic speech recognition (ASR) system. Additionally, ad impressions are selected and delivered to users based, at least in part, on the parsing and/or filtering and/or data maintained in user profiles as dynamically updated from time to time. The ad impression preferably is delivered within a text message or within an instant message conversation and is generally unobtrusive. Revenues preferably may be generated from the delivering of the ad impressions, whereby a provider of instant messaging or text messaging may further derive monetary benefit from providing such service and whereby users of such service may be provided with contextually relevant information in an unobtrusive manner.

    摘要翻译: 公开了用于解析和/或过滤文本消息和/或即时消息的消息串的系统,方法和软件,以便识别作为动态更新用户简档的用户偏好的函数的关键字,短语或片段。 这样的系统,方法和软件在包括文本消息,即时消息或两者的通信系统的上下文中被使用。 此外,这种通信系统优选地包括自动语音识别(ASR)系统。 此外,至少部分地基于不时地动态更新的在用户简档中维护的解析和/或过滤和/或数据来选择和传递广告印象。 广告印象优选地在文本消息内或在即时消息对话内递送,并且通常是不显眼的。 收入优选地可以从广告展示的传递产生,由此即时消息或文本消息的提供者可以进一步从提供这样的服务中获得货币利益,并且由此可以以不显眼的方式向这些服务的用户提供上下文相关的信息。

    USE OF METADATA TO POST PROCESS SPEECH RECOGNITION OUTPUT
    16.
    发明申请
    USE OF METADATA TO POST PROCESS SPEECH RECOGNITION OUTPUT 有权
    使用元数据来过程语音识别输出

    公开(公告)号:US20090248415A1

    公开(公告)日:2009-10-01

    申请号:US12415874

    申请日:2009-03-31

    CPC分类号: G10L15/30 G10L2015/228

    摘要: A method of utilizing metadata stored in a computer-readable medium to assist in the conversion of an audio stream to a text stream. The method compares personally identifiable data, such as a user's electronic address book and/or Caller/Recipient ID information (in the case of processing voice mail to text), to the n-best results generated by a speech recognition engine for each word that is output by the engine. A goal of this comparison is to correct a possible misrecognition of a spoken proper noun such as a name or company with its proper textual form or a spoken phone number to correctly formatted phone number with Arabic numerals to improve the overall accuracy of the output of the voice recognition system.

    摘要翻译: 利用存储在计算机可读介质中的元数据来帮助将音频流转换成文本流的方法。 该方法将个人识别数据(例如用户的电子地址簿和/或来电/收件人ID信息(在处理语音邮件到文本的情况下))与由语音识别引擎为每个单词产生的n最佳结果进行比较, 由发动机输出。 这个比较的目标是纠正一个口头的专有名词,例如具有适当的文本形式或口头电话号码的名称或公司,以正确格式化的电话号码与阿拉伯数字的错误识别,以提高输出的总体准确性 语音识别系统

    Use of metadata to post process speech recognition output
    17.
    发明授权
    Use of metadata to post process speech recognition output 有权
    使用元数据来发布过程语音识别输出

    公开(公告)号:US08676577B2

    公开(公告)日:2014-03-18

    申请号:US12415874

    申请日:2009-03-31

    IPC分类号: G10L15/00

    CPC分类号: G10L15/30 G10L2015/228

    摘要: A method of utilizing metadata stored in a computer-readable medium to assist in the conversion of an audio stream to a text stream. The method compares personally identifiable data, such as a user's electronic address book and/or Caller/Recipient ID information (in the case of processing voice mail to text), to the n-best results generated by a speech recognition engine for each word that is output by the engine. A goal of this comparison is to correct a possible misrecognition of a spoken proper noun such as a name or company with its proper textual form or a spoken phone number to correctly formatted phone number with Arabic numerals to improve the overall accuracy of the output of the voice recognition system.

    摘要翻译: 利用存储在计算机可读介质中的元数据来帮助将音频流转换成文本流的方法。 该方法将个人识别数据(例如用户的电子地址簿和/或来电/收件人ID信息(在处理语音邮件到文本的情况下))与由语音识别引擎为每个单词产生的n最佳结果进行比较, 由发动机输出。 这个比较的目标是纠正一个口头的专有名词,例如具有适当的文本形式或口头电话号码的名称或公司,以正确格式化的电话号码与阿拉伯数字的错误识别,以提高输出的总体准确性 语音识别系统

    CORRECTIVE FEEDBACK LOOP FOR AUTOMATED SPEECH RECOGNITION
    18.
    发明申请
    CORRECTIVE FEEDBACK LOOP FOR AUTOMATED SPEECH RECOGNITION 有权
    用于自动语音识别的纠正反馈环

    公开(公告)号:US20130024195A1

    公开(公告)日:2013-01-24

    申请号:US13621189

    申请日:2012-09-15

    IPC分类号: G10L15/26

    摘要: A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier.

    摘要翻译: 一种便于更新语言模型的方法包括在客户端设备经由麦克风接收对应于用户语音的音频消息; 将音频消息传送到第一远程服务器; 从所述音频消息接收所述客户端设备,使用自动语音识别系统(ASR)在所述第一远程服务器处转录的结果; 在客户端设备从用户接收结果的肯定; 在客户端设备处存储与对应于音频消息的标识符相关联的结果; 以及与所述标识符一起与所述第二远程服务器通信所存储的结果。

    Corrective feedback loop for automated speech recognition
    19.
    发明授权
    Corrective feedback loop for automated speech recognition 有权
    用于自动语音识别的纠正反馈回路

    公开(公告)号:US08352264B2

    公开(公告)日:2013-01-08

    申请号:US12407502

    申请日:2009-03-19

    IPC分类号: G10L15/00 G10L15/14

    摘要: A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier.

    摘要翻译: 一种便于更新语言模型的方法包括在客户端设备经由麦克风接收对应于用户语音的音频消息; 将音频消息传送到第一远程服务器; 从所述音频消息接收所述客户端设备,使用自动语音识别系统(ASR)在所述第一远程服务器处转录的结果; 在客户端设备从用户接收结果的肯定; 在客户端设备处存储与对应于音频消息的标识符相关联的结果; 以及与所述标识符一起与所述第二远程服务器通信所存储的结果。

    CORRECTIVE FEEDBACK LOOP FOR AUTOMATED SPEECH RECOGNITION
    20.
    发明申请
    CORRECTIVE FEEDBACK LOOP FOR AUTOMATED SPEECH RECOGNITION 有权
    用于自动语音识别的纠正反馈环

    公开(公告)号:US20090240488A1

    公开(公告)日:2009-09-24

    申请号:US12407502

    申请日:2009-03-19

    IPC分类号: G06F17/27 G10L15/00 G06F3/041

    摘要: A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier.

    摘要翻译: 一种便于更新语言模型的方法包括在客户端设备经由麦克风接收对应于用户语音的音频消息; 将音频消息传送到第一远程服务器; 从所述音频消息接收使用自动语音识别系统(“ASR”)在所述第一远程服务器处转录的结果,所述客户端设备, 在客户端设备从用户接收结果的肯定; 在客户端设备处存储与对应于音频消息的标识符相关联的结果; 以及与所述标识符一起与所述第二远程服务器通信所存储的结果。