Speech recognition using an operating system hooking component for context-aware recognition models
    1.
    发明授权
    Speech recognition using an operating system hooking component for context-aware recognition models 有权
    语音识别使用操作系统挂钩组件进行上下文感知识别模型

    公开(公告)号:US09489375B2

    公开(公告)日:2016-11-08

    申请号:US13526789

    申请日:2012-06-19

    摘要: Inputs provided into user interface elements of an application are observed. Records are made of the inputs and the state(s) the application was in while the inputs were provided. For each state, a corresponding language model is trained based on the input(s) provided to the application while the application was in that state. When the application is next observed to be in a previously-observed state, a language model associated with the application's current state is applied to recognize speech input provided by a user and thereby to generate speech recognition output that is provided to the application. An application's state at a particular time may include the user interface element(s) that are displayed and/or in focus at that time, and is determined by an operating system hooking component embedded in the automatic speech recognition system.

    摘要翻译: 观察到提供给应用程序的用户界面元素的输入。 记录由输入和应用程序在提供输入时所处的状态组成。 对于每个状态,在应用程序处于该状态时,基于提供给应用程序的输入来对相应的语言模型进行训练。 当应用程序接下来观察到处于先前观察到的状态时,应用与应用程序的当前状态相关联的语言模型来识别由用户提供的语音输入,从而生成提供给应用的语音识别输出。 在特定时间的应用程序的状态可以包括当时显示和/或聚焦的用户界面元素,并且由嵌入在自动语音识别系统中的操作系统挂钩组件确定。

    Using Alternative Sources of Evidence in Computer-Assisted Billing Coding
    2.
    发明申请
    Using Alternative Sources of Evidence in Computer-Assisted Billing Coding 审中-公开
    在计算机辅助计费编码中使用替代证据来源

    公开(公告)号:US20120323598A1

    公开(公告)日:2012-12-20

    申请号:US13526684

    申请日:2012-06-19

    IPC分类号: G06Q50/22

    CPC分类号: G06Q50/22 G06Q30/04

    摘要: A computerized billing code generator reviews billing source data containing both admissible data (such as physician's notes) and inadmissible data (such as nurse's notes). The billing code generator determines whether to generate a request to review the first data based on both the first data and the second data. For example, the billing code generator may generate the request in response to determining that the second data contains information that is inconsistent with information contained in the first data. As another example, the billing code generator may generate the request in response to determining that the second data contains information that is not contained within the first data.

    摘要翻译: 计算机化的计费代码生成器检查包含可接受数据(例如医生的笔记)和不允许的数据(例如护士的笔记)的计费来源数据。 计费代码生成器基于第一数据和第二数据来确定是否生成查看第一数据的请求。 例如,计费代码生成器可以响应于确定第二数据包含与包含在第一数据中的信息不一致的信息而生成请求。 作为另一示例,响应于确定第二数据包含不包含在第一数据内的信息,计费代码生成器可以生成请求。

    Content-Based Audio Playback Emphasis
    3.
    发明申请
    Content-Based Audio Playback Emphasis 有权
    基于内容的音频播放强调

    公开(公告)号:US20100318347A1

    公开(公告)日:2010-12-16

    申请号:US12859883

    申请日:2010-08-20

    IPC分类号: G06F17/27

    摘要: Techniques are disclosed for facilitating the process of proofreading draft transcripts of spoken audio streams. In general, proofreading of a draft transcript is facilitated by playing back the corresponding spoken audio stream with an emphasis on those regions in the audio stream that are highly relevant or likely to have been transcribed incorrectly. Regions may be emphasized by, for example, playing them back more slowly than regions that are of low relevance and likely to have been transcribed correctly. Emphasizing those regions of the audio stream that are most important to transcribe correctly and those regions that are most likely to have been transcribed incorrectly increases the likelihood that the proofreader will accurately correct any errors in those regions, thereby improving the overall accuracy of the transcript.

    摘要翻译: 公开了用于促进校对口头音频流的草稿的过程的技术。 一般来说,通过播放对应的口语音频流,强调音频流中与那些高度相关或可能被错误地转录的那些区域,来校对草稿。 例如,区域可能会被强调为比相关程度低且可能被正确转录的地区的播放速度更慢。 强调音频流中最重要的那些区域是正确转录的,那些最有可能被错误转录的区域增加了校对者准确地纠正这些区域中的任何错误的可能性,从而提高了抄本的整体准确性。

    Verification of extracted data
    4.
    发明授权
    Verification of extracted data 有权
    提取数据的验证

    公开(公告)号:US07716040B2

    公开(公告)日:2010-05-11

    申请号:US11766767

    申请日:2007-06-21

    摘要: Facts are extracted from speech and recorded in a document using codings. Each coding represents an extracted fact and includes a code and a datum. The code may represent a type of the extracted fact and the datum may represent a value of the extracted fact. The datum in a coding is rendered based on a specified feature of the coding. For example, the datum may be rendered as boldface text to indicate that the coding has been designated as an “allergy.” In this way, the specified feature of the coding (e.g., “allergy”-ness) is used to modify the manner in which the datum is rendered. A user inspects the rendering and provides, based on the rendering, an indication of whether the coding was accurately designated as having the specified feature. A record of the user's indication may be stored, such as within the coding itself.

    摘要翻译: 事实是从言语中提取的,并使用编码记录在文档中。 每个编码表示提取的事实,并包括代码和基准。 代码可以表示提取的事实的类型,并且数据可以表示提取的事实的值。 编码中的数据基于编码的指定特征进行渲染。 例如,数据可以呈现为粗体文本,以指示编码已被指定为“过敏”。以这种方式,编码的特定特征(例如,“过敏”)用于修改方式 其中基准被渲染。 用户检查呈现并基于呈现提供编码是否被准确地指定为具有指定特征的指示。 可以存储用户指示的记录,例如在编码本身内。

    Audio signal de-identification
    5.
    发明授权
    Audio signal de-identification 有权
    音频信号去识别

    公开(公告)号:US07502741B2

    公开(公告)日:2009-03-10

    申请号:US11064343

    申请日:2005-02-23

    IPC分类号: G06Q50/00 G10L15/26 G06F17/21

    摘要: Techniques are disclosed for automatically de-identifying spoken audio signals. In particular, techniques are disclosed for automatically removing personally identifying information from spoken audio signals and replacing such information with non-personally identifying information. De-identification of a spoken audio signal may be performed by automatically generating a report based on the spoken audio signal. The report may include concept content (e.g., text) corresponding to one or more concepts represented by the spoken audio signal. The report may also include timestamps indicating temporal positions of speech in the spoken audio signal that corresponds to the concept content. Concept content that represents personally identifying information is identified. Audio corresponding to the personally identifying concept content is removed from the spoken audio signal. The removed audio may be replaced with non-personally identifying audio.

    摘要翻译: 公开了用于自动取消识别口头音频信号的技术。 特别地,公开了用于自动从口头音频信号中移除个人识别信息并用非个人识别信息替换这些信息的技术。 可以通过基于所述口语音频信号自动生成报告来执行语音音频信号的取消识别。 报告可以包括对应于由口头音频信号表示的一个或多个概念的概念内容(例如,文本)。 报告还可以包括指示对应于概念内容的口语音频信号中的语音的时间位置的时间戳。 识别表示个人识别信息的概念内容。 与个人识别概念内容相对应的音频从口头音频信号中去除。 删除的音频可以被非个人识别的音频替换。

    Document extension in dictation-based document generation workflow
    6.
    发明授权
    Document extension in dictation-based document generation workflow 有权
    基于口授的文档生成工作流中的文档扩展

    公开(公告)号:US08781829B2

    公开(公告)日:2014-07-15

    申请号:US13527347

    申请日:2012-06-19

    IPC分类号: G10L15/00

    摘要: An automatic speech recognizer is used to produce a structured document representing the contents of human speech. A best practice is applied to the structured document to produce a conclusion, such as a conclusion that required information is missing from the structured document. Content is inserted into the structured document based on the conclusion, thereby producing a modified document. The inserted content may be obtained by prompting a human user for the content and receiving input representing the content from the human user.

    摘要翻译: 自动语音识别器用于产生表示人类语言内容的结构化文档。 对结构化文档应用最佳实践来得出结论,例如从结构化文档中缺少所需信息的结论。 根据结论将内容插入到结构化文档中,从而生成修改后的文档。 插入的内容可以通过向人类用户提示内容并从人类用户接收表示内容的输入来获得。

    Content-based audio playback emphasis
    7.
    发明授权
    Content-based audio playback emphasis 有权
    基于内容的音频播放强调

    公开(公告)号:US07844464B2

    公开(公告)日:2010-11-30

    申请号:US11187119

    申请日:2005-07-22

    IPC分类号: G10L21/00

    摘要: Techniques are disclosed for facilitating the process of proofreading draft transcripts of spoken audio streams. In general, proofreading of a draft transcript is facilitated by playing back the corresponding spoken audio stream with an emphasis on those regions in the audio stream that are highly relevant or likely to have been transcribed incorrectly. Regions may be emphasized by, for example, playing them back more slowly than regions that are of low relevance and likely to have been transcribed correctly. Emphasizing those regions of the audio stream that are most important to transcribe correctly and those regions that are most likely to have been transcribed incorrectly increases the likelihood that the proofreader will accurately correct any errors in those regions, thereby improving the overall accuracy of the transcript.

    摘要翻译: 公开了用于促进校对口头音频流的草稿的过程的技术。 一般来说,通过播放对应的口语音频流,强调音频流中与那些高度相关或可能被错误地转录的那些区域,来校对草稿。 例如,区域可能会被强调为比相关程度低且可能被正确转录的地区的播放速度更慢。 强调音频流中最重要的那些区域是正确转录的,那些最有可能被错误转录的区域增加了校对者准确地纠正这些区域中的任何错误的可能性,从而提高了抄本的整体准确性。

    Applying Service Levels to Transcripts
    8.
    发明申请
    Applying Service Levels to Transcripts 有权
    将服务级别应用于成绩单

    公开(公告)号:US20070299652A1

    公开(公告)日:2007-12-27

    申请号:US11766784

    申请日:2007-06-21

    IPC分类号: G06F17/27

    摘要: Speech is transcribed to produce a draft transcript of the speech. Portions of the transcript having a high priority are identified. For example, particular sections of the transcript may be identified as high-priority sections. As another example, portions of the transcript requiring human verification may be identified as high-priority sections. High-priority portions of the transcript are verified at a first time, without verifying other portions of the transcript. Such other portions may or may not be verified at a later time. Limiting verification, either initially or entirely, to high-priority portions of the transcript limits the time required to perform such verification, thereby making it feasible to verify the most important portions of the transcript at an early stage without introducing an undue delay into the transcription process. Verifying the other portions of the transcript later ensures that early verification of the high-priority portions does not sacrifice overall verification accuracy.

    摘要翻译: 演讲转载为演讲稿。 识别具有高优先级的部分转录本。 例如,誊本的特定部分可以被识别为高优先级部分。 作为另一个例子,需要人类验证的部分转录物可以被识别为高优先级部分。 誊本的高优先级部分在第一时间被验证,而不验证抄本的其他部分。 这样的其他部分可以在以后也可能不被验证。 初步或全部将验证限制在转录本的高优先级部分,限制进行此类验证所需的时间,从而使得可以在早期阶段验证转录本的最重要部分,而不会在转录中引入不适当的延迟 处理。 稍后验证转录本的其他部分将确保高优先级部分的早期验证不会牺牲整体验证准确性。

    Document Extension in Dictation-Based Document Generation Workflow
    9.
    发明申请
    Document Extension in Dictation-Based Document Generation Workflow 有权
    基于听写文档生成工作流程的文档扩展

    公开(公告)号:US20120323572A1

    公开(公告)日:2012-12-20

    申请号:US13527347

    申请日:2012-06-19

    IPC分类号: G10L15/26

    摘要: An automatic speech recognizer is used to produce a structured document representing the contents of human speech. A best practice is applied to the structured document to produce a conclusion, such as a conclusion that required information is missing from the structured document. Content is inserted into the structured document based on the conclusion, thereby producing a modified document. The inserted content may be obtained by prompting a human user for the content and receiving input representing the content from the human user.

    摘要翻译: 自动语音识别器用于产生表示人类语言内容的结构化文档。 对结构化文档应用最佳实践来得出结论,例如从结构化文档中缺少所需信息的结论。 根据结论将内容插入到结构化文档中,从而生成修改后的文档。 插入的内容可以通过向人类用户提示内容并从人类用户接收表示内容的输入来获得。

    Audio Signal De-Identification
    10.
    发明申请
    Audio Signal De-Identification 审中-公开
    音频信号去识别

    公开(公告)号:US20120303365A1

    公开(公告)日:2012-11-29

    申请号:US13303362

    申请日:2011-11-23

    IPC分类号: G10L15/00

    摘要: Techniques are disclosed for automatically de-identifying spoken audio signals. In particular, techniques are disclosed for automatically removing personally identifying information from spoken audio signals and replacing such information with non-personally identifying information. De-identification of a spoken audio signal may be performed by automatically generating a report based on the spoken audio signal. The report may include concept content (e.g., text) corresponding to one or more concepts represented by the spoken audio signal. The report may also include timestamps indicating temporal positions of speech in the spoken audio signal that corresponds to the concept content. Concept content that represents personally identifying information is identified. Audio corresponding to the personally identifying concept content is removed from the spoken audio signal. The removed audio may be replaced with non-personally identifying audio.

    摘要翻译: 公开了用于自动取消识别口头音频信号的技术。 特别地,公开了用于自动从口头音频信号中移除个人识别信息并用非个人识别信息替换这些信息的技术。 可以通过基于所述口语音频信号自动生成报告来执行语音音频信号的取消识别。 报告可以包括对应于由口头音频信号表示的一个或多个概念的概念内容(例如,文本)。 报告还可以包括指示对应于概念内容的口语音频信号中的语音的时间位置的时间戳。 识别表示个人识别信息的概念内容。 与个人识别概念内容相对应的音频从口头音频信号中去除。 删除的音频可以被非个人识别的音频替换。