System and method of automated evaluation of transcription quality

    公开(公告)号:US10147418B2

    公开(公告)日:2018-12-04

    申请号:US15676306

    申请日:2017-08-14

    Inventor: Oana Sidi Ron Wein

    Abstract: Systems and methods automatedly evaluate a transcription quality. Audio data is obtained. The audio data is segmented into a plurality of utterances with a voice activity detector operating on a computer processor. The plurality of utterances are transcribed into at least one word lattice with a large vocabulary continuous speech recognition system operating on the processor. A minimum Bayes risk decoder is applied to the at least one word lattice to create at least one confusion network. At least conformity ratio is calculated from the at least one confusion network.

    SYSTEM AND METHOD OF AUTOMATED EVALUATION OF TRANSCRIPTION QUALITY
    4.
    发明申请
    SYSTEM AND METHOD OF AUTOMATED EVALUATION OF TRANSCRIPTION QUALITY 有权
    自动评估转录质量的系统与方法

    公开(公告)号:US20160365089A1

    公开(公告)日:2016-12-15

    申请号:US15180325

    申请日:2016-06-13

    Inventor: Oana Sidi Ron Wein

    CPC classification number: G10L15/01 G10L15/04 G10L15/12 G10L15/26

    Abstract: Systems and methods automatedly evaluate a transcription quality. Audio data is obtained. The audio data is segmented into a plurality of utterances with a voice activity detector operating on a computer processor. The plurality of utterances are transcribed into at least one word lattice with a large vocabulary continuous speech recognition system operating on the processor. A minimum Bayes risk decoder is applied to the at least one word lattice to create at least one confusion network. At least conformity ratio is calculated from the at least one contusion network.

    Abstract translation: 系统和方法自动评估转录质量。 获得音频数据。 音频数据被分割成具有在计算机处理器上操作的语音活动检测器的多个话语。 多个话语被转录成具有在处理器上操作的大词汇连续语音识别系统的至少一个单词格。 最小贝叶斯风险解码器被应用于至少一个单词网格以创建至少一个混淆网络。 至少一个挫伤网络计算出至少一致性比例。

    System and method of automated evaluation of transcription quality
    5.
    发明授权
    System and method of automated evaluation of transcription quality 有权
    自动评估转录质量的系统和方法

    公开(公告)号:US09368106B2

    公开(公告)日:2016-06-14

    申请号:US14319853

    申请日:2014-06-30

    Inventor: Oana Sidi Ron Wein

    CPC classification number: G10L15/01 G10L15/04 G10L15/12 G10L15/26

    Abstract: Systems and methods automatedly evaluate a transcription quality. Audio data is obtained. The audio data is segmented into a plurality of utterances with a voice activity detector operating on a computer processor. The plurality of utterances are transcribed into at least one word lattice with a large vocabulary continuous speech recognition system operating on the processor. A minimum Bayes risk decoder is applied to the at least one word lattice to create at least one confusion network. At least conformity ratio is calculated from the at least one confusion network.

    Abstract translation: 系统和方法自动评估转录质量。 获得音频数据。 音频数据被分割成具有在计算机处理器上操作的语音活动检测器的多个话语。 多个话语被转录成具有在处理器上操作的大词汇连续语音识别系统的至少一个单词格。 最小贝叶斯风险解码器被应用于至少一个单词网格以创建至少一个混淆网络。 至少一个混淆网络计算出至少一致性比例。

    Ontology expansion using entity-association rules and abstract relations

    公开(公告)号:US11030406B2

    公开(公告)日:2021-06-08

    申请号:US15007703

    申请日:2016-01-27

    Abstract: A method for expanding an initial ontology via processing of communication data, wherein the initial ontology is a structural representation of language elements comprising a set of entities, a set of terms, a set of term-entity associations, a set of entity-association rules, a set of abstract relations, and a set of relation instances. A method for extracting a set of significant phrases and a set of significant phrase co-occurrences from an input set of documents further includes utilizing the terms to identify relations within the training set of communication data, wherein a relation is a pair of terms that appear in proximity to one another.

    Voice activity detection using a soft decision mechanism

    公开(公告)号:US10665253B2

    公开(公告)日:2020-05-26

    申请号:US15959743

    申请日:2018-04-23

    Inventor: Ron Wein

    Abstract: Voice activity detection (VAD) is an enabling technology for a variety of speech based applications. Herein disclosed is a robust VAD algorithm that is also language independent. Rather than classifying short segments of the audio as either “speech” or “silence”, the VAD as disclosed herein employees a soft-decision mechanism. The VAD outputs a speech-presence probability, which is based on a variety of characteristics.

Patent Agency Ranking