Timely speech recognition
    1.
    发明授权
    Timely speech recognition 有权
    及时的语音识别

    公开(公告)号:US09099090B2

    公开(公告)日:2015-08-04

    申请号:US13632962

    申请日:2012-10-01

    IPC分类号: G10L15/26 G10L15/22

    CPC分类号: G10L15/26 G10L15/22

    摘要: An automatic speech recognition engine may generate text or tokens that correspond to audio data. For example, the automatic speech recognition engine may generate first text or first speech tokens corresponding to a first portion of audio data. The automatic speech recognition engine may further generate second text or second speech tokens that correspond to a first portion of the audio data and a second portion of the audio data. The text or speech tokens generated by the automatic speech recognition engine may be provided to a device for presentation thereon. In some embodiments, the automatic speech recognition engine generates the second text or second speech tokens substantially while the first text or first speech tokens are presented on the device.

    摘要翻译: 自动语音识别引擎可以产生对应于音频数据的文本或令牌。 例如,自动语音识别引擎可以产生对应于音频数据的第一部分的第一文本或第一语音令牌。 自动语音识别引擎还可以生成对应于音频数据的第一部分和音频数据的第二部分的第二文本或第二语音令牌。 由自动语音识别引擎生成的文本或语音令牌可以被提供给用于在其上呈现的设备。 在一些实施例中,自动语音识别引擎基本上产生第二文本或第二语音令牌,同时第一文本或第一语音令牌被呈现在设备上。

    Speech recognition with hierarchical networks
    2.
    发明授权
    Speech recognition with hierarchical networks 有权
    语音识别与分层网络

    公开(公告)号:US09093061B1

    公开(公告)日:2015-07-28

    申请号:US13434315

    申请日:2012-03-29

    IPC分类号: G10L15/14 G10L15/00 G10L15/06

    摘要: Provided are systems and methods for using hierarchical networks for recognition, such as speech recognition. Conventional automatic recognition systems may not be both efficient and flexible. Recognition systems are disclosed that may achieve efficiency and flexibility by employing hierarchical networks, prefix consolidation of networks, and future consolidation of networks. The disclosed networks may be associated with a network model and the associated network model may be modified during recognition to achieve greater flexibility.

    摘要翻译: 提供了用于使用分层网络进行识别的系统和方法,例如语音识别。 常规的自动识别系统可能不是既有效又灵活。 公开了可以通过采用分层网络,网络前缀整合以及未来网络整合来实现效率和灵活性的识别系统。 所公开的网络可以与网络模型相关联,并且可以在识别期间修改相关联的网络模型以实现更大的灵活性。

    Continuous speech transcription performance indication
    3.
    发明授权
    Continuous speech transcription performance indication 有权
    连续语音转录性能指标

    公开(公告)号:US08868420B1

    公开(公告)日:2014-10-21

    申请号:US14010433

    申请日:2013-08-26

    IPC分类号: G10L21/00

    摘要: A method of providing speech transcription performance indication includes receiving, at a user device data representing text transcribed from an audio stream by an ASR system, and data representing a metric associated with the audio stream; displaying, via the user device, said text; and via the user device, providing, in user-perceptible form, an indicator of said metric. Another method includes displaying, by a user device, text transcribed from an audio stream by an ASR system; and via the user device, providing, in user-perceptible form, an indicator of a level of background noise of the audio stream. Another method includes receiving data representing an audio stream; converting said data representing an audio stream to text via an ASR system; determining a metric associated with the audio stream; transmitting data representing said text to a user device; and transmitting data representing said metric to the user device.

    摘要翻译: 提供语音转录性能指示的方法包括在用户设备处接收表示由ASR系统从音频流转录的文本的数据和表示与音频流相关联的度量的数据; 经由所述用户设备显示所述文本; 并且经由用户设备,以用户可感知的形式提供所述度量的指示符。 另一方法包括由用户设备显示由ASR系统从音频流转录的文本; 并且经由用户设备,以用户可感知的形式提供音频流的背景噪声水平的指示符。 另一种方法包括接收表示音频流的数据; 通过ASR系统将表示音频流的所述数据转换为文本; 确定与所述音频流相关联的度量; 将表示所述文本的数据发送到用户设备; 以及将表示所述度量的数据发送到用户设备。

    Continuous speech transcription performance indication
    5.
    发明授权
    Continuous speech transcription performance indication 有权
    连续语音转录性能指标

    公开(公告)号:US08510109B2

    公开(公告)日:2013-08-13

    申请号:US12197213

    申请日:2008-08-22

    IPC分类号: G10L15/00

    摘要: A method of providing speech transcription performance indication includes receiving, at a user device data representing text transcribed from an audio stream by an ASR system, and data representing a metric associated with the audio stream; displaying, via the user device, said text; and via the user device, providing, in user-perceptible form, an indicator of said metric. Another method includes displaying, by a user device, text transcribed from an audio stream by an ASR system; and via the user device, providing, in user-perceptible form, an indicator of a level of background noise of the audio stream. Another method includes receiving data representing an audio stream; converting said data representing an audio stream to text via an ASR system; determining a metric associated with the audio stream; transmitting data representing said text to a user device; and transmitting data representing said metric to the user device.

    摘要翻译: 提供语音转录性能指示的方法包括在用户设备处接收表示由ASR系统从音频流转录的文本的数据和表示与音频流相关联的度量的数据; 经由所述用户设备显示所述文本; 并且经由用户设备,以用户可感知的形式提供所述度量的指示符。 另一方法包括由用户设备显示由ASR系统从音频流转录的文本; 并且经由用户设备,以用户可感知的形式提供音频流的背景噪声水平的指示符。 另一种方法包括接收表示音频流的数据; 通过ASR系统将表示音频流的所述数据转换为文本; 确定与所述音频流相关联的度量; 将表示所述文本的数据发送到用户设备; 以及将表示所述度量的数据发送到用户设备。

    Filtering transcriptions of utterances
    6.
    发明授权
    Filtering transcriptions of utterances 有权
    过滤话语的转录

    公开(公告)号:US08498872B2

    公开(公告)日:2013-07-30

    申请号:US13621194

    申请日:2012-09-15

    IPC分类号: G10L15/30

    CPC分类号: G10L15/193 G10L15/30

    摘要: Audio data that includes speech may be transcribed by a speech recognition engine to generate speech recognition results, such as a transcription. One or more filters may be selected and applied to the speech recognition results to generate filtered speech recognition results. The one or more filters may be selected based at least in part on a characteristic of the speech recognition results, a characteristic of the audio data, or any other characteristic.

    摘要翻译: 包括语音的音频数据可以由语音识别引擎转录以产生诸如转录的语音识别结果。 可以选择一个或多个滤波器并将其应用于语音识别结果以产生滤波的语音识别结果。 可以至少部分地基于语音识别结果的特性,音频数据的特性或任何其它特性来选择一个或多个滤波器。

    Use of intermediate speech transcription results in editing final speech transcription results
    7.
    发明授权
    Use of intermediate speech transcription results in editing final speech transcription results 有权
    使用中间语音转录可以编辑最终语音转录结果

    公开(公告)号:US08352261B2

    公开(公告)日:2013-01-08

    申请号:US12400723

    申请日:2009-03-09

    IPC分类号: G10L15/26 G10L15/08

    CPC分类号: G10L15/22 G10L2015/221

    摘要: A communication system includes at least one transmitting device and at least one receiving device, one or more network systems for connecting the transmitting device to the receiving device, and an automatic speech recognition (“ASR”) system, including an ASR engine. A user speaks an utterance into the transmitting device, and the recorded speech audio is sent to the ASR engine. The ASR engine returns intermediate transcription results to the transmitting device, which displays the intermediate transcription results in real-time to the user. The intermediate transcription results are also correlated by utterance fragment to final transcription results and displayed to the user. The user may use the information thus presented to make decisions as to whether to edit the final transcription results or to speak the utterance again, thereby repeating the process. The intermediate transcription results may also be used by the user to edit the final transcription results.

    摘要翻译: 通信系统包括至少一个发送设备和至少一个接收设备,用于将发送设备连接到接收设备的一个或多个网络系统以及包括ASR引擎的自动语音识别(ASR)系统。 用户对发送设备说话,并将记录的语音音频发送到ASR引擎。 ASR引擎将中间转录结果返回到发送设备,其向用户实时显示中间转录结果。 中间转录结果也通过说话片段与最终转录结果相关,并显示给用户。 用户可以使用这样呈现的信息来做出关于是否编辑最终转录结果或再次说话的决定,从而重复该过程。 中间转录结果也可以由用户使用来编辑最终的转录结果。

    Methods, apparatuses, and systems for providing timely user cues pertaining to speech recognition
    8.
    发明授权
    Methods, apparatuses, and systems for providing timely user cues pertaining to speech recognition 有权
    用于及时提供与语音识别相关的用户提示的方法,设备和系统

    公开(公告)号:US08301454B2

    公开(公告)日:2012-10-30

    申请号:US12546636

    申请日:2009-08-24

    IPC分类号: G10L21/00

    CPC分类号: G10L15/26 G10L15/22

    摘要: A method is provided of providing cues from am electronic communication device to a user while capturing an utterance. A plurality of cues associated with the user utterance are provided by the device to the user in at least near real-time. For each of a plurality of portions of the utterance, data representative of the respective portion of the user utterance is communicated from the electronic communication device to a remote electronic device. In response to this communication, data, representative of at least one parameter associated with the respective portion of the user utterance, is received at the electronic communication device. The electronic communication device provides one or more cues to the user based on the at least parameter. At least one of the cues is provided by the electronic communication device to the user prior to completion of the step of capturing the user utterance.

    摘要翻译: 提供了一种在捕获话语时从电子通信设备向用户提供线索的方法。 与用户话语相关联的多个提示由至少几乎实时的设备提供给用户。 对于话音的多个部分中的每一个,表示用户话语的相应部分的数据从电子通信设备传送到远程电子设备。 响应于该通信,代表与用户话语的相应部分相关联的至少一个参数的数据在电子通信设备处被接收。 电子通信设备基于至少一个参数向用户提供一个或多个提示。 在完成用户发声的步骤完成之前,电子通信装置中的至少一个提示被提供给用户。

    VALIDATION OF MOBILE ADVERTISING FROM DERIVED INFORMATION
    10.
    发明申请
    VALIDATION OF MOBILE ADVERTISING FROM DERIVED INFORMATION 有权
    从传播信息中确定移动广告

    公开(公告)号:US20140180823A1

    公开(公告)日:2014-06-26

    申请号:US14081983

    申请日:2013-11-15

    IPC分类号: G06Q30/02

    摘要: A system and method of validating an advertisement presented to an advertisement recipient via a mobile communication device includes presenting an advertisement for a product or service to a recipient via a mobile communication device, monitoring the geospatial location of the mobile communication device relative to some predetermined criteria, and inferring information about the reaction of the advertisement recipient to the advertisement on the basis of the monitored geospatial location information.

    摘要翻译: 经由移动通信设备验证向广告接收者呈现的广告的系统和方法包括经由移动通信设备向接收者呈现产品或服务的广告,监视移动通信设备相对于某些预定标准的地理空间位置 并且基于所监视的地理空间位置信息推断关于广告接收者对广告的反应的信息。