-
公开(公告)号:US07640161B2
公开(公告)日:2009-12-29
申请号:US11748319
申请日:2007-05-14
申请人: Robert W. Morris , Jon A. Arrowood , Marsal Gavalda , Peter S. Cardillo , Mark Finlay , Zahi Karam
发明人: Robert W. Morris , Jon A. Arrowood , Marsal Gavalda , Peter S. Cardillo , Mark Finlay , Zahi Karam
IPC分类号: G10L13/00
CPC分类号: G10L15/26 , G10L2015/025 , G10L2015/0638 , G10L2015/088
摘要: An approach to improving the performance of a wordspotting system includes providing an interface for interactive improvement of a phonetic representation of a query based on an operator identifying true detections and false alarms in a data set.
摘要翻译: 改进字注系统的性能的方法包括提供用于基于识别数据集中的真实检测和虚假警报的操作者的交互式改进查询的语音表示的接口。
-
公开(公告)号:US20130060572A1
公开(公告)日:2013-03-07
申请号:US13602991
申请日:2012-09-04
申请人: Jacob B. Garland , Drew Lanham , Daryl Kip Watters , Marsal Gavalda , Mark Finlay , Kenneth K. Griggs
发明人: Jacob B. Garland , Drew Lanham , Daryl Kip Watters , Marsal Gavalda , Mark Finlay , Kenneth K. Griggs
IPC分类号: G10L15/04
摘要: In an aspect, in general, method for aligning an audio recording and a transcript includes receiving a transcript including a plurality of terms, each term of the plurality of terms associated with a time location within a different version of the audio recording, forming a plurality of search terms from the terms of the transcript, determining possible time locations of the search terms in the audio recording, determining a correspondence between time locations within the different version of the audio recording associated with the search terms and the possible time locations of the search terms in the audio recording, and aligning the audio recording and the transcript including updating the time location associated with terms of the transcript based on the determined correspondence.
摘要翻译: 一方面,通常,用于对准音频记录和抄本的方法包括接收包括多个术语的抄本,所述多个术语的每个术语与音频记录的不同版本内的时间位置相关联,形成多个 根据抄本的条款,确定音频记录中的搜索项的可能的时间位置,确定与搜索项相关联的音频记录的不同版本之间的时间位置与搜索的可能时间位置之间的对应关系 音频记录中的术语,以及对准音频记录和记录,包括基于所确定的对应来更新与抄本的术语相关联的时间位置。
-
公开(公告)号:US20070271241A1
公开(公告)日:2007-11-22
申请号:US11748319
申请日:2007-05-14
申请人: Robert Morris , Jon Arrowood , Marsal Gavalda , Peter Cardillo , Mark Finlay , Zahi Karam
发明人: Robert Morris , Jon Arrowood , Marsal Gavalda , Peter Cardillo , Mark Finlay , Zahi Karam
IPC分类号: G06F17/30
CPC分类号: G10L15/26 , G10L2015/025 , G10L2015/0638 , G10L2015/088
摘要: An approach to improving the performance of a wordspotting system includes providing an interface for interactive improvement of a phonetic representation of a query based on an operator identifying true detections and false alarms in a data set.
摘要翻译: 改进字注系统的性能的方法包括提供用于基于识别数据集中的真实检测和虚假警报的操作者的交互式改进查询的语音表示的接口。
-
公开(公告)号:US09536567B2
公开(公告)日:2017-01-03
申请号:US13602991
申请日:2012-09-04
申请人: Jacob B. Garland , Drew Lanham , Daryl Kip Watters , Marsal Gavalda , Mark Finlay , Kenneth K. Griggs
发明人: Jacob B. Garland , Drew Lanham , Daryl Kip Watters , Marsal Gavalda , Mark Finlay , Kenneth K. Griggs
摘要: In an aspect, in general, method for aligning an audio recording and a transcript includes receiving a transcript including a plurality of terms, each term of the plurality of terms associated with a time location within a different version of the audio recording, forming a plurality of search terms from the terms of the transcript, determining possible time locations of the search terms in the audio recording, determining a correspondence between time locations within the different version of the audio recording associated with the search terms and the possible time locations of the search terms in the audio recording, and aligning the audio recording and the transcript including updating the time location associated with terms of the transcript based on the determined correspondence.
摘要翻译: 一方面,通常,用于对准音频记录和抄本的方法包括接收包括多个术语的抄本,所述多个术语的每个术语与音频记录的不同版本内的时间位置相关联,形成多个 根据抄本的条款,确定音频记录中的搜索项的可能的时间位置,确定与搜索项相关联的音频记录的不同版本之间的时间位置与搜索的可能时间位置之间的对应关系 音频记录中的术语,以及对准音频记录和记录,包括基于所确定的对应来更新与抄本的术语相关联的时间位置。
-
公开(公告)号:US20110216905A1
公开(公告)日:2011-09-08
申请号:US12718114
申请日:2010-03-05
申请人: Marsal Gavalda , Mark Finlay
发明人: Marsal Gavalda , Mark Finlay
摘要: Techniques implemented as systems, methods, and apparatuses, including computer program products, for logging multi-channel audio signals. The techniques include receiving a first audio input signal over a first audio channel and a second audio input signal over a second audio channel, the first audio channel and the second audio channel forming portions of a multi-channel call; generating supplemental information representative of characteristics of the first audio input signal, the second audio input signal, or both; after generating the supplemental information, combining the first audio input signal and the second audio input signal to form an audio output signal of a single-channel format; and storing the generated supplemental information in association with an identifier of the audio output signal, wherein at least a portion of the generated supplemental information is sufficient to enable information associated with the first audio input signal, the second audio input signal, or both to be derived from the audio output signal of the single-channel format.
摘要翻译: 实现为用于记录多声道音频信号的系统,方法和装置(包括计算机程序产品)的技术。 这些技术包括通过第一音频通道接收第一音频输入信号,通过第二音频频道接收第二音频输入信号,第一音频通道和第二音频通道形成多声道通话的部分; 产生表示第一音频输入信号,第二音频输入信号或两者的特性的补充信息; 在产生补充信息之后,组合第一音频输入信号和第二音频输入信号以形成单声道格式的音频输出信号; 并且将生成的补充信息与音频输出信号的标识符相关联地存储,其中所生成的补充信息的至少一部分足以使与第一音频输入信号,第二音频输入信号或两者相关联的信息成为 衍生自单声道格式的音频输出信号。
-
公开(公告)号:US20090063151A1
公开(公告)日:2009-03-05
申请号:US12199123
申请日:2008-08-27
申请人: Jon A. Arrowood , Robert W. Morris , Mark Finlay , Scott A. Judy
发明人: Jon A. Arrowood , Robert W. Morris , Mark Finlay , Scott A. Judy
IPC分类号: G10L15/04
CPC分类号: G06F17/30746 , G10L15/26 , G10L2015/025 , G10L2015/088
摘要: In some aspects, a wordspotter is used to locate occurrences in an audio corpus of each of a set of predetermined subword units, which may be phoneme sequences. To locate a query (e.g., a keyword or phrase) in the audio corpus, constituent subword units in the query are indentified and then locations of those subwords are determined based on the locations of those subword units determined earlier by the wordspotter, for example, using a pre-built inverted index that maps subword units to their locations.
摘要翻译: 在一些方面,使用一个wordspotter来定位一组预定子词单元的每个音频语料库中的出现次数,这可以是音素序列。 为了在音频语料库中定位查询(例如,关键字或短语),查询中的组成子词单元被识别,然后基于由字检查者较早确定的那些子词单元的位置来确定这些子词的位置, 使用预先构建的倒排索引,将子单位映射到其位置。
-
公开(公告)号:US08311828B2
公开(公告)日:2012-11-13
申请号:US12199123
申请日:2008-08-27
申请人: Jon A. Arrowood , Robert W. Morris , Mark Finlay , Scott A. Judy
发明人: Jon A. Arrowood , Robert W. Morris , Mark Finlay , Scott A. Judy
IPC分类号: G10L15/04
CPC分类号: G06F17/30746 , G10L15/26 , G10L2015/025 , G10L2015/088
摘要: In some aspects, a wordspotter is used to locate occurrences in an audio corpus of each of a set of predetermined subword units, which may be phoneme sequences. To locate a query (e.g., a keyword or phrase) in the audio corpus, constituent subword units in the query are indentified and then locations of those subwords are determined based on the locations of those subword units determined earlier by the wordspotter, for example, using a pre-built inverted index that maps subword units to their locations.
摘要翻译: 在一些方面,使用一个wordspotter来定位一组预定子词单元的每个音频语料库中的出现次数,这可以是音素序列。 为了在音频语料库中定位查询(例如,关键字或短语),查询中的组成子词单元被识别,然后基于由字检查者较早确定的那些子词单元的位置来确定这些子词的位置, 使用预先构建的倒排索引,将子单位映射到其位置。
-
-
-
-
-
-