System and method of diarization and labeling of audio data

    公开(公告)号:US10720164B2

    公开(公告)日:2020-07-21

    申请号:US16702998

    申请日:2019-12-04

    IPC分类号: G10L15/26 G10L17/00 G10L17/02

    摘要: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    Diarization using acoustic labeling

    公开(公告)号:US10650826B2

    公开(公告)日:2020-05-12

    申请号:US16594812

    申请日:2019-10-07

    IPC分类号: G10L17/02 G10L17/00

    摘要: Systems and method of diarization of audio files use an acoustic voiceprint model. A plurality of audio files are analyzed to arrive at an acoustic voiceprint model associated to an identified speaker. Metadata associate with an audio file is used to select an acoustic voiceprint model. The selected acoustic voiceprint model is applied in a diarization to identify audio data of the identified speaker.

    DIARIZATION USING LINGUISTIC LABELING
    7.
    发明申请

    公开(公告)号:US20200105275A1

    公开(公告)日:2020-04-02

    申请号:US16702998

    申请日:2019-12-04

    IPC分类号: G10L17/00 G10L17/02

    摘要: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    Diarization using textual and audio speaker labeling

    公开(公告)号:US10593332B2

    公开(公告)日:2020-03-17

    申请号:US16567446

    申请日:2019-09-11

    IPC分类号: G10L15/26 G10L17/00 G10L17/02

    摘要: Systems and methods diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    DIARIZATION USING LINGUISTIC LABELING
    9.
    发明申请

    公开(公告)号:US20190066691A1

    公开(公告)日:2019-02-28

    申请号:US16170289

    申请日:2018-10-25

    IPC分类号: G10L17/00 G10L17/02

    摘要: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcribed audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.