SPEAKER SEPARATION IN DIARIZATION
    21.
    发明申请
    SPEAKER SEPARATION IN DIARIZATION 有权
    扬声器分离

    公开(公告)号:US20160343373A1

    公开(公告)日:2016-11-24

    申请号:US15158959

    申请日:2016-05-19

    CPC classification number: G10L15/26 G10L17/06 G10L25/51 G10L25/78 G10L2025/783

    Abstract: The system and method of separating speakers in an audio file including obtaining an audio file. The audio file is transcribed into at least one text file by a transcription server. Homogenous speech segments are identified within the at least one text file. The audio file is segmented into homogenous audio segments that correspond to the identified homogenous speech segments. The homogenous audio segments of the audio file are separated into a first speaker audio file and second speaker audio file the first speaker audio file and the second speaker audio file are transcribed to produce a diarized transcript.

    Abstract translation: 分离音频文件中的扬声器的系统和方法,包括获取音频文件。 音频文件由转录服务器转录成至少一个文本文件。 在至少一个文本文件内识别均匀的语音段。 音频文件被分割成与所识别的同源语音片段对应的同质音频段。 音频文件的同质音频片段被分成第一扬声器音频文件和第二扬声器音频文件,第一扬声器音频文件和第二扬声器音频文件被转录以产生经过缩写的转录。

    IDENTIFICATION OF SIGNIFICANT PHRASES USING MULTIPLE LANGUAGE MODELS
    23.
    发明申请
    IDENTIFICATION OF SIGNIFICANT PHRASES USING MULTIPLE LANGUAGE MODELS 审中-公开
    使用多种语言模型识别重要的语法

    公开(公告)号:US20160217127A1

    公开(公告)日:2016-07-28

    申请号:US15007699

    申请日:2016-01-27

    CPC classification number: G06F17/2775 G06F16/367 G06F17/277

    Abstract: A method for expanding an initial ontology via processing of communication data, wherein the initial ontology is a structural representation of language elements comprising a set of entities, a set of terms, a set of term-entity associations, a set of entity-association rules, a set of abstract relations, and a set of relation instances. A method for extracting a set of significant phrases and a set of significant phrase co-occurrences from an input set of documents further includes utilizing the terms to identify relations within the training set of communication data, wherein a relation is a pair of terms that appear in proximity to one another.

    Abstract translation: 一种用于通过处理通信数据来扩展初始本体的方法,其中初始本体是包括一组实体的语言元素的结构表示,一组术语,一组术语 - 实体关联,一组实体关联规则 ,一组抽象关系,以及一组关系实例。 一种用于从输入文档集中提取一组重要短语和一组重要短语共同出现的方法还包括利用这些术语来识别通信数据训练集内的关系,其中关系是出现的一对术语 彼此接近。

    System and Method of Automated Evaluation of Transcription Quality
    24.
    发明申请
    System and Method of Automated Evaluation of Transcription Quality 有权
    自动评估转录质量的系统和方法

    公开(公告)号:US20150039306A1

    公开(公告)日:2015-02-05

    申请号:US14319853

    申请日:2014-06-30

    Inventor: Oana Sidi Ron Wein

    CPC classification number: G10L15/01 G10L15/04 G10L15/12 G10L15/26

    Abstract: Systems and methods automatedly evaluate a transcription quality. Audio data is obtained. The audio data is segmented into a plurality of utterances with a voice activity detector operating on a computer processor. The plurality of utterances are transcribed into at least one word lattice with a large vocabulary continuous speech recognition system operating on the processor. A minimum Bayes risk decoder is applied to the at least one word lattice to create at least one confusion network. At least conformity ratio is calculated from the at least one confusion network.

    Abstract translation: 系统和方法自动评估转录质量。 获得音频数据。 音频数据被分割成具有在计算机处理器上操作的语音活动检测器的多个话语。 多个话语被转录成具有在处理器上操作的大词汇连续语音识别系统的至少一个单词格。 最小贝叶斯风险解码器被应用于至少一个单词网格以创建至少一个混淆网络。 至少一个混淆网络计算出至少一致性比例。

    Speaker Separation in Diarization
    25.
    发明申请
    Speaker Separation in Diarization 有权
    演讲者分离在Diarization

    公开(公告)号:US20140074467A1

    公开(公告)日:2014-03-13

    申请号:US14016783

    申请日:2013-09-03

    CPC classification number: G10L15/26 G10L17/06 G10L25/51 G10L25/78 G10L2025/783

    Abstract: The system and method of separating speakers in an audio file including obtaining an audio file. The audio file is transcribed into at least one text file by a transcription server. Homogenous speech segments are identified within the at least one text file. The audio file is segmented into homogenous audio segments that correspond to the identified homogenous speech segments. The homogenous audio segments of the audio file are separated into a first speaker audio file and second speaker audio file the first speaker audio file and the second speaker audio file are transcribed to produce a diarized transcript.

    Abstract translation: 分离音频文件中的扬声器的系统和方法,包括获取音频文件。 音频文件由转录服务器转录成至少一个文本文件。 在至少一个文本文件内识别均匀的语音段。 音频文件被分割成与所识别的同源语音片段对应的同质音频段。 音频文件的同质音频片段被分成第一扬声器音频文件和第二扬声器音频文件,第一扬声器音频文件和第二扬声器音频文件被转录以产生经过缩小的转录。

    Voice activity detection using a soft decision mechanism

    公开(公告)号:US11670325B2

    公开(公告)日:2023-06-06

    申请号:US16880560

    申请日:2020-05-21

    Inventor: Ron Wein

    CPC classification number: G10L25/78

    Abstract: Voice activity detection (VAD) is an enabling technology for a variety of speech based applications. Herein disclosed is a robust VAD algorithm that is also language independent. Rather than classifying short segments of the audio as either “speech” or “silence”, the VAD as disclosed herein employees a soft-decision mechanism. The VAD outputs a speech-presence probability, which is based on a variety of characteristics.

    Ontology expansion using entity-association rules and abstract relations

    公开(公告)号:US11663411B2

    公开(公告)日:2023-05-30

    申请号:US17225589

    申请日:2021-04-08

    CPC classification number: G06F40/289 G06F16/367 G06F40/284

    Abstract: A method for expanding an initial ontology via processing of communication data, wherein the initial ontology is a structural representation of language elements comprising a set of entities, a set of terms, a set of term-entity associations, a set of entity-association rules, a set of abstract relations, and a set of relation instances. A method for extracting a set of significant phrases and a set of significant phrase co-occurrences from an input set of documents further includes utilizing the terms to identify relations within the training set of communication data, wherein a relation is a pair of terms that appear in proximity to one another.

Patent Agency Ranking