Blind Diarization of Recorded Calls with Arbitrary Number of Speakers
    14.
    发明申请
    Blind Diarization of Recorded Calls with Arbitrary Number of Speakers 有权
    用任意数量的演讲者打电话的盲目化

    公开(公告)号:US20150025887A1

    公开(公告)日:2015-01-22

    申请号:US14319860

    申请日:2014-06-30

    Inventor: Oana Sidi Ron Wein

    Abstract: In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded.

    Abstract translation: 在音频数据的分类方法中,将音频数据分割为多个话语。 每个话语被表示为代表多个特征向量的话语模型。 话语模型是聚类的。 从群集话语模型构建多个说话者模型。 由多个扬声器模型构成隐马尔可夫模型。 已识别的扬声器模型的序列被解码。

    ONTOLOGY EXPANSION USING ENTITY-ASSOCIATION RULES AND ABSTRACT RELATIONS

    公开(公告)号:US20210224483A1

    公开(公告)日:2021-07-22

    申请号:US17225589

    申请日:2021-04-08

    Abstract: A method for expanding an initial ontology via processing of communication data, wherein the initial ontology is a structural representation of language elements comprising a set of entities, a set of terms, a set of term-entity associations, a set of entity-association rules, a set of abstract relations, and a set of relation instances. A method for extracting a set of significant phrases and a set of significant phrase co-occurrences from an input set of documents further includes utilizing the terms to identify relations within the training set of communication data, wherein a relation is a pair of terms that appear in proximity to one another.

Patent Agency Ranking