Diarization using linguistic labeling with segmented and clustered diarized textual transcripts

    公开(公告)号:US10950241B2

    公开(公告)日:2021-03-16

    申请号:US16703206

    申请日:2019-12-04

    IPC分类号: G10L17/02 G10L17/00

    摘要: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    System and method of diarization and labeling of audio data

    公开(公告)号:US10902856B2

    公开(公告)日:2021-01-26

    申请号:US16703245

    申请日:2019-12-04

    IPC分类号: G10L17/02 G10L17/00

    摘要: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    DIARIZATION USING ACOUSTIC LABELING
    35.
    发明申请

    公开(公告)号:US20200035246A1

    公开(公告)日:2020-01-30

    申请号:US16594812

    申请日:2019-10-07

    IPC分类号: G10L17/00 G10L17/02

    摘要: Systems and method of diarization of audio files use an acoustic voiceprint model. A plurality of audio files are analyzed to arrive at an acoustic voiceprint model associated to an identified speaker. Metadata associate with an audio file is used to select an acoustic voiceprint model. The selected acoustic voiceprint model is applied in a diarization to identify audio data of the identified speaker.

    DIARIZATION USING LINGUISTIC LABELING
    36.
    发明申请

    公开(公告)号:US20200005796A1

    公开(公告)日:2020-01-02

    申请号:US16567446

    申请日:2019-09-11

    IPC分类号: G10L17/00 G10L17/02

    摘要: Systems and methods diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    Diarization using linguistic labeling

    公开(公告)号:US10134401B2

    公开(公告)日:2018-11-20

    申请号:US14084976

    申请日:2013-11-20

    IPC分类号: G10L15/26 G10L17/00 G10L17/02

    摘要: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    Funnel Analysis
    39.
    发明申请

    公开(公告)号:US20170200167A1

    公开(公告)日:2017-07-13

    申请号:US15409921

    申请日:2017-01-19

    IPC分类号: G06Q30/00 G06N99/00

    摘要: Systems, methods, and media for the application of funnel analysis using desktop analytics and textual analytics to map and analyze the flow of customer service interactions. In an example implementation, the method includes: defining at least one flow that is representative of a series of events comprising at least one speech event, at least one Data Processing Activity (DPA) event, and at least one Computer Telephone Integration (CTI) event; receiving customer service interaction data comprising communication data, DPA metadata, and CTI metadata; applying the at least one flow to the customer service interaction data; determining if the customer service interaction data meets the at least one flow; and producing an automated indication based upon the determination.

    System and method of automated language model adaptation
    40.
    发明授权
    System and method of automated language model adaptation 有权
    自动语言模型适应的系统和方法

    公开(公告)号:US09508346B2

    公开(公告)日:2016-11-29

    申请号:US14291895

    申请日:2014-05-30

    摘要: Systems and methods of automated adaptation of a language model for transcription of audio data include obtaining audio data. The audio data is transcribed with a language model to produce a plurality of audio file transcriptions. A quality of the plurality of audio file transcriptions is evaluated. At least one best transcription from a plurality of audio file transcriptions is selected based upon the evaluated quality. Statistics are calculated from the selected at least one best transcription from the plurality of audio file transcriptions. The language model is modified from the calculated statistics.

    摘要翻译: 用于音频数据转录的语言模型的自动适应的系统和方法包括获得音频数据。 音频数据用语言模型转录以产生多个音频文件转录。 评估多个音频文件转录的质量。 基于评估的质量来选择来自多个音频文件转录的至少一个最佳转录。 根据来自多个音频文件转录的所选择的至少一个最佳转录来计算统计量。 语言模型根据计算的统计信息进行修改。