TRANSCRIPT RE-SYNC
    3.
    发明申请
    TRANSCRIPT RE-SYNC 有权
    转录重新同步

    公开(公告)号:US20130060572A1

    公开(公告)日:2013-03-07

    申请号:US13602991

    申请日:2012-09-04

    IPC分类号: G10L15/04

    CPC分类号: G11B27/10 G10L15/04

    摘要: In an aspect, in general, method for aligning an audio recording and a transcript includes receiving a transcript including a plurality of terms, each term of the plurality of terms associated with a time location within a different version of the audio recording, forming a plurality of search terms from the terms of the transcript, determining possible time locations of the search terms in the audio recording, determining a correspondence between time locations within the different version of the audio recording associated with the search terms and the possible time locations of the search terms in the audio recording, and aligning the audio recording and the transcript including updating the time location associated with terms of the transcript based on the determined correspondence.

    摘要翻译: 一方面,通常,用于对准音频记录和抄本的方法包括接收包括多个术语的抄本,所述多个术语的每个术语与音频记录的不同版本内的时间位置相关联,形成多个 根据抄本的条款,确定音频记录中的搜索项的可能的时间位置,确定与搜索项相关联的音频记录的不同版本之间的时间位置与搜索的可能时间位置之间的对应关系 音频记录中的术语,以及对准音频记录和记录,包括基于所确定的对应来更新与抄本的术语相关联的时间位置。

    KEYWORD SPOTTING USING A PHONEME-SEQUENCE INDEX
    4.
    发明申请
    KEYWORD SPOTTING USING A PHONEME-SEQUENCE INDEX 有权
    使用PHONEME-SEQUENCE索引的关键字点选

    公开(公告)号:US20090063151A1

    公开(公告)日:2009-03-05

    申请号:US12199123

    申请日:2008-08-27

    IPC分类号: G10L15/04

    摘要: In some aspects, a wordspotter is used to locate occurrences in an audio corpus of each of a set of predetermined subword units, which may be phoneme sequences. To locate a query (e.g., a keyword or phrase) in the audio corpus, constituent subword units in the query are indentified and then locations of those subwords are determined based on the locations of those subword units determined earlier by the wordspotter, for example, using a pre-built inverted index that maps subword units to their locations.

    摘要翻译: 在一些方面,使用一个wordspotter来定位一组预定子词单元的每个音频语料库中的出现次数,这可以是音素序列。 为了在音频语料库中定位查询(例如,关键字或短语),查询中的组成子词单元被识别,然后基于由字检查者较早确定的那些子词单元的位置来确定这些子词的位置, 使用预先构建的倒排索引,将子单位映射到其位置。

    Transcript re-sync
    5.
    发明授权
    Transcript re-sync 有权
    记录重新同步

    公开(公告)号:US09536567B2

    公开(公告)日:2017-01-03

    申请号:US13602991

    申请日:2012-09-04

    CPC分类号: G11B27/10 G10L15/04

    摘要: In an aspect, in general, method for aligning an audio recording and a transcript includes receiving a transcript including a plurality of terms, each term of the plurality of terms associated with a time location within a different version of the audio recording, forming a plurality of search terms from the terms of the transcript, determining possible time locations of the search terms in the audio recording, determining a correspondence between time locations within the different version of the audio recording associated with the search terms and the possible time locations of the search terms in the audio recording, and aligning the audio recording and the transcript including updating the time location associated with terms of the transcript based on the determined correspondence.

    摘要翻译: 一方面,通常,用于对准音频记录和抄本的方法包括接收包括多个术语的抄本,所述多个术语的每个术语与音频记录的不同版本内的时间位置相关联,形成多个 根据抄本的条款,确定音频记录中的搜索项的可能的时间位置,确定与搜索项相关联的音频记录的不同版本之间的时间位置与搜索的可能时间位置之间的对应关系 音频记录中的术语,以及对准音频记录和记录,包括基于所确定的对应来更新与抄本的术语相关联的时间位置。

    Keyword spotting using a phoneme-sequence index
    6.
    发明授权
    Keyword spotting using a phoneme-sequence index 有权
    使用音素序列索引进行关键词检测

    公开(公告)号:US08311828B2

    公开(公告)日:2012-11-13

    申请号:US12199123

    申请日:2008-08-27

    IPC分类号: G10L15/04

    摘要: In some aspects, a wordspotter is used to locate occurrences in an audio corpus of each of a set of predetermined subword units, which may be phoneme sequences. To locate a query (e.g., a keyword or phrase) in the audio corpus, constituent subword units in the query are indentified and then locations of those subwords are determined based on the locations of those subword units determined earlier by the wordspotter, for example, using a pre-built inverted index that maps subword units to their locations.

    摘要翻译: 在一些方面,使用一个wordspotter来定位一组预定子词单元的每个音频语料库中的出现次数,这可以是音素序列。 为了在音频语料库中定位查询(例如,关键字或短语),查询中的组成子词单元被识别,然后基于由字检查者较早确定的那些子词单元的位置来确定这些子词的位置, 使用预先构建的倒排索引,将子单位映射到其位置。

    CHANNEL COMPRESSION
    7.
    发明申请
    CHANNEL COMPRESSION 审中-公开
    频道压缩

    公开(公告)号:US20110216905A1

    公开(公告)日:2011-09-08

    申请号:US12718114

    申请日:2010-03-05

    IPC分类号: H04R5/00 G10L19/00

    CPC分类号: G10L19/00 H04R5/00

    摘要: Techniques implemented as systems, methods, and apparatuses, including computer program products, for logging multi-channel audio signals. The techniques include receiving a first audio input signal over a first audio channel and a second audio input signal over a second audio channel, the first audio channel and the second audio channel forming portions of a multi-channel call; generating supplemental information representative of characteristics of the first audio input signal, the second audio input signal, or both; after generating the supplemental information, combining the first audio input signal and the second audio input signal to form an audio output signal of a single-channel format; and storing the generated supplemental information in association with an identifier of the audio output signal, wherein at least a portion of the generated supplemental information is sufficient to enable information associated with the first audio input signal, the second audio input signal, or both to be derived from the audio output signal of the single-channel format.

    摘要翻译: 实现为用于记录多声道音频信号的系统,方法和装置(包括计算机程序产品)的技术。 这些技术包括通过第一音频通道接收第一音频输入信号,通过第二音频频道接收第二音频输入信号,第一音频通道和第二音频通道形成多声道通话的部分; 产生表示第一音频输入信号,第二音频输入信号或两者的特性的补充信息; 在产生补充信息之后,组合第一音频输入信号和第二音频输入信号以形成单声道格式的音频输出信号; 并且将生成的补充信息与音频输出信号的标识符相关联地存储,其中所生成的补充信息的至少一部分足以使与第一音频输入信号,第二音频输入信号或两者相关联的信息成为 衍生自单声道格式的音频输出信号。