APPARATUS AND METHOD FOR ENHANCED SPEECH RECOGNITION
    1.
    发明申请
    APPARATUS AND METHOD FOR ENHANCED SPEECH RECOGNITION 审中-公开
    用于增强语音识别的装置和方法

    公开(公告)号:US20110004473A1

    公开(公告)日:2011-01-06

    申请号:US12497718

    申请日:2009-07-06

    CPC分类号: G10L15/02 G10L2015/025

    摘要: A method and apparatus for improving speech recognition results for an audio signal captured within an organization, comprising: receiving the audio signal captured by a capturing or logging device; extracting a phonetic feature and an acoustic feature from the audio signal; decoding the phonetic feature into a phonetic searchable structure; storing the phonetic searchable structure and the acoustic feature in an index; performing phonetic search for a word or a phrase in the phonetic searchable structure to obtain a result; activating an audio analysis engine which receives the acoustic feature to validate the result and obtain an enhanced result.

    摘要翻译: 一种用于改善在组织内捕获的音频信号的语音识别结果的方法和装置,包括:接收由捕获或记录装置捕获的音频信号; 从音频信号中提取语音特征和声学特征; 将语音特征解码成语音可搜索结构; 将语音可搜索结构和声学特征存储在索引中; 在语音搜索结构中执行语音搜索单词或短语以获得结果; 激活音频分析引擎,其接收声学特征以验证结果并获得增强的结果。

    Method and apparatus for large vocabulary continuous speech recognition using a hybrid phoneme-word lattice
    4.
    发明授权
    Method and apparatus for large vocabulary continuous speech recognition using a hybrid phoneme-word lattice 有权
    使用混合音素词格的大词汇连续语音识别的方法和装置

    公开(公告)号:US08831947B2

    公开(公告)日:2014-09-09

    申请号:US12941057

    申请日:2010-11-07

    CPC分类号: G10L15/08

    摘要: A method and apparatus combining the advantages of phonetic search such as the rapid implementation and deployment and medium accuracy, comprising steps and components for receiving the audio signal captured in the call center environment, extracting a multiplicity of feature vectors from the audio signal, creating a phoneme lattice from the multiplicity of feature vectors wherein the phoneme lattice comprising one or more allophone and each allophone comprising two or more phonemes, creating a hybrid phoneme-word lattice from the phoneme lattice and extracting the word by analyzing the hybrid phoneme-Word lattice.

    摘要翻译: 一种结合语音搜索的优点的方法和装置,例如快速实现和部署以及中等精度,包括用于接收在呼叫中心环境中捕获的音频信号的步骤和组件,从音频信号中提取多个特征向量,创建一个 从多个特征向量的音素格子中,其中包括一个或多个异音素的音素晶格和包含两个或更多个音素的每个异音素,从音素晶格产生混合的音素单元格,并通过分析混合的音素-Word晶格提取单词。

    Method and apparatus for interaction or discourse analytics
    5.
    发明授权
    Method and apparatus for interaction or discourse analytics 有权
    用于交互或话语分析的方法和装置

    公开(公告)号:US08676586B2

    公开(公告)日:2014-03-18

    申请号:US12211112

    申请日:2008-09-16

    IPC分类号: G10L21/00

    摘要: A method and apparatus for analyzing and segmenting a vocal interaction captured in a test audio source, the test audio source captured within an environment. The method and apparatus first use text and acoustic features extracted from the interaction with tagging information, for constructing a model. Then, at production time, text and acoustic features are extracted from the interactions, and by applying the model, tagging information is retrieved for the interaction, enabling analysis, flow visualization or further processing of the interaction.

    摘要翻译: 一种用于分析和分割在测试音频源中捕获的声音交互的方法和装置,在环境中捕获的测试音频源。 该方法和装置首先使用从与标签信息的交互提取的文本和声学特征,用于构建模型。 然后,在生产时间,从交互中提取文本和声学特征,并通过应用模型,检索标签信息进行交互,实现分析,流程可视化或进一步处理交互。

    Methods and apparatus for language identification
    6.
    发明授权
    Methods and apparatus for language identification 有权
    语言识别的方法和装置

    公开(公告)号:US08311824B2

    公开(公告)日:2012-11-13

    申请号:US12258463

    申请日:2008-10-27

    IPC分类号: G10L15/26

    CPC分类号: G10L15/005

    摘要: In a multi-lingual environment, a method and apparatus for determining a language spoken in a speech utterance. The method and apparatus test acoustic feature vectors extracted from the utterances against acoustic models associated with one or more of the languages. Speech to text is then performed for the language indicated by the acoustic testing, followed by textual verification of the resulting text. During verification, the resulting text is processed by language specific NLP and verified against textual models associated with the language. The system is self-learning, i.e., once a language is verified or rejected, the relevant feature vectors are used for enhancing one or more acoustic models associated with one or more languages, so that acoustic determination may improve.

    摘要翻译: 在多语言环境中,一种用于确定语音话语中所说的语言的方法和装置。 所述方法和装置测试从与一种或多种语言相关联的声学模型的话语中提取的声学特征向量。 然后对由声学测试指示的语言执行到文本的语音,然后对所得到的文本进行文本验证。 在验证期间,生成的文本由语言特定的NLP处理,并针对与该语言相关联的文本模型进行验证。 该系统是自学习的,即,一旦语言被验证或拒绝,相关的特征向量被用于增强与一种或多种语言相关联的一个或多个声学模型,使得声学测定可以改善。

    METHODS AND APPARATUS FOR ENHANCING SPEECH ANALYTICS
    7.
    发明申请
    METHODS AND APPARATUS FOR ENHANCING SPEECH ANALYTICS 有权
    用于增强语音分析的方法和装置

    公开(公告)号:US20090292541A1

    公开(公告)日:2009-11-26

    申请号:US12126884

    申请日:2008-05-25

    IPC分类号: G10L15/04

    摘要: Methods and apparatus for the enhancement of speech to text engines, by providing indications to the correctness of the found words, based on additional sources besides the internal indication provided by the STT engine. The enhanced indications comprise sources of data such as acoustic features, CTI features, phonetic search and others. The apparatus and methods also enable the detection of important or significant keywords found in audio files, thus enabling more efficient usages, such as further processing or transfer of interactions to relevant agents, escalation of issues, or the like. The methods and apparatus employ a training phase in which word model and key phrase model are generated for determining an enhanced correctness indication for a word and an enhanced importance indication for a key phrase, based on the additional features.

    摘要翻译: 除了STT引擎提供的内部指示之外,还可以通过提供对发现的词语的正确性的指示来增强语言到文本引擎的方法和装置。 增强的指示包括诸如声学特征,CTI特征,语音搜索等的数据源。 该装置和方法还能够检测在音频文件中发现的重要或重要的关键词,从而实现更有效的用途,例如进一步处理或转移相关代理人的交互,升级问题等。 所述方法和装置采用训练阶段,其中基于附加特征,生成词模型和密钥短语模型,用于确定词的增强正确性指示和关键短语的增强重要性指示。

    Apparatus and method for fraud prevention
    8.
    发明授权
    Apparatus and method for fraud prevention 有权
    预防欺诈的装置和方法

    公开(公告)号:US08145562B2

    公开(公告)日:2012-03-27

    申请号:US12399999

    申请日:2009-03-09

    IPC分类号: G06Q40/00

    摘要: The disclosed method and apparatus combine interactions and transactions in order to detect fraud acts or fraud attempts. In one embodiment, one or more interactions is correlated with one or more transactions, the interactions is and transactions features are combined, and features are extracted from the combined structure. The features are compared against one or more profiles, and a combined risk score is determined for the interactions or transactions. If the risk score exceeds a predetermined threshold, a preventive/corrective action can be taken.In another embodiment, behavioral characteristics extracted from one or more interactions associated with a transaction, with a risk score obtained by analyzing the transaction. The behavioral characteristic are used to enhance suspicion level related to a transaction being fraudulent, and to enable the taking of measures related to the transaction or to the person handling the transaction. The combination thus enables better assessment whether a particular interaction or transaction is fraudulent, and therefore provides for better detection or prevention of such activities. In addition, making the fraud assessment more reliable enables more efficient resource allocation of personnel for monitoring the transactions and interactions, better usage of communication time by avoiding lengthy identification where not required, and generally higher efficiency.

    摘要翻译: 所公开的方法和装置结合了交互和交易以便检测欺诈行为或欺诈尝试。 在一个实施例中,一个或多个交互与一个或多个交易相关联,交互是并且交易特征被组合,并且从组合结构中提取特征。 将功能与一个或多个配置文件进行比较,并确定交互或交易的组合风险分数。 如果风险分数超过预定阈值,则可采取预防/纠正措施。 在另一个实施例中,从与交易相关联的一个或多个交互中提取的行为特征,具有通过分析交易获得的风险评分。 行为特征用于提高与欺诈交易相关的怀疑水平,并且能够采取与交易相关的措施或处理交易的人。 因此,组合能够更好地评估特定交互或交易是否是欺诈性的,因此提供更好的检测或预防这些活动。 此外,使欺诈评估更可靠,可以更有效地进行人员资源分配,以监控交易和交互,避免通信时间的长时间避免冗长的识别,而且通常效率更高。

    Enhancing analysis of test key phrases from acoustic sources with key phrase training models
    10.
    发明授权
    Enhancing analysis of test key phrases from acoustic sources with key phrase training models 有权
    用关键短语训练模型加强声源测试关键词的分析

    公开(公告)号:US08145482B2

    公开(公告)日:2012-03-27

    申请号:US12126884

    申请日:2008-05-25

    IPC分类号: G10L15/06 G10L15/18

    摘要: Methods and apparatus for the enhancement of speech to text engines, by providing indications to the correctness of the found words, based on additional sources besides the internal indication provided by the STT engine. The enhanced indications comprise sources of data such as acoustic features, CTI features, phonetic search and others. The apparatus and methods also enable the detection of important or significant keywords found in audio files, thus enabling more efficient usages, such as further processing or transfer of interactions to relevant agents, escalation of issues, or the like. The methods and apparatus employ a training phase in which word model and key phrase model are generated for determining an enhanced correctness indication for a word and an enhanced importance indication for a key phrase, based on the additional features.

    摘要翻译: 除了STT引擎提供的内部指示之外,还可以通过提供对发现的词语的正确性的指示来增强语言到文本引擎的方法和装置。 增强的指示包括诸如声学特征,CTI特征,语音搜索等的数据源。 该装置和方法还能够检测在音频文件中发现的重要或重要的关键词,从而实现更有效的用途,例如进一步处理或转移相关代理人的交互,升级问题等。 所述方法和装置采用训练阶段,其中基于附加特征,生成词模型和密钥短语模型,用于确定词的增强正确性指示和关键短语的增强重要性指示。