Diarization using linguistic labeling

    公开(公告)号:US11322154B2

    公开(公告)日:2022-05-03

    申请号:US16703099

    申请日:2019-12-04

    Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. At least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcribed customer service interaction.

    DIARIZATION USING LINGUISTIC LABELING
    3.
    发明申请

    公开(公告)号:US20200035245A1

    公开(公告)日:2020-01-30

    申请号:US16587518

    申请日:2019-09-30

    Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    Speech analytics system and system and method for determining structured speech
    5.
    发明授权
    Speech analytics system and system and method for determining structured speech 有权
    用于确定结构化语音的语音分析系统和系统及方法

    公开(公告)号:US09401145B1

    公开(公告)日:2016-07-26

    申请号:US14270280

    申请日:2014-05-05

    CPC classification number: G10L15/20 G10L15/19 G10L15/197 G10L15/26

    Abstract: A method for converting speech to text in a speech analytics system is provided. The method includes receiving audio data containing speech made up of sounds from an audio source, processing the sounds with a phonetic module resulting in symbols corresponding to the sounds, and processing the symbols with a language module and occurrence table resulting in text. The method also includes determining a probability of correct translation for each word in the text, comparing the probability of correct translation for each word in the text to the occurrence table, and adjusting the occurrence table based on the probability of correct translation for each word in the text.

    Abstract translation: 提供了一种在语音分析系统中将语音转换为文本的方法。 该方法包括接收包含来自音频源的声音的音频数据,用声音模块处理声音,产生与声音相对应的符号,以及用导致文本的语言模块和出现表处理符号。 该方法还包括确定文本中的每个单词的正确翻译的概率,将文本中的每个单词的正确翻译的概率与出现表进行比较,并且基于每个单词的正确翻译的概率来调整出现表 文本。

    System and Method of Automated Model Adaptation
    6.
    发明申请
    System and Method of Automated Model Adaptation 有权
    自动模型适应的系统和方法

    公开(公告)号:US20150066502A1

    公开(公告)日:2015-03-05

    申请号:US14291893

    申请日:2014-05-30

    Abstract: Methods, systems, and computer readable media for automated transcription model adaptation includes obtaining audio data from a plurality of audio files. The audio data is transcribed to produce at least one audio file transcription which represents a plurality of transcription alternatives for each audio file. Speech analytics are applied to each audio file transcription. A best transcription is selected from the plurality of transcription alternatives for each audio file. Statistics from the selected best transcription are calculated. An adapted model is created from the calculated statistics.

    Abstract translation: 用于自动转录模型适应的方法,系统和计算机可读介质包括从多个音频文件获得音频数据。 音频数据被转录以产生表示每个音频文件的多个转录替代品的至少一个音频文件转录。 语音分析应用于每个音频文件转录。 从每个音频文件的多个转录替代品中选择最佳转录。 计算出所选择的最佳转录数据。 从计算的统计信息创建一个适应模型。

    Diarization Using Linguistic Labeling
    7.
    发明申请
    Diarization Using Linguistic Labeling 审中-公开
    使用语言标签进行分类

    公开(公告)号:US20140142940A1

    公开(公告)日:2014-05-22

    申请号:US14084976

    申请日:2013-11-20

    CPC classification number: G10L17/005 G10L17/02

    Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    Abstract translation: 使用语言标签进行分类的系统和方法包括收集一组二维化的文本记录。 至少一个启发式算法被自动地应用于二进制的文本记录,以选择可能与识别的发言者组相关联的成绩单。 分析所选择的成绩单以创建至少一个语言模型。 语言模型被应用于被转录的音频数据,以将被转录的音频数据的一部分标记为被识别的发言者群说出来。 使用语言标签的进一步的二进制实施例可以用于在记录和转录的客户服务交互中标记代理人语音和客户语音。

    Diarization using linguistic labeling with segmented and clustered diarized textual transcripts

    公开(公告)号:US10950241B2

    公开(公告)日:2021-03-16

    申请号:US16703206

    申请日:2019-12-04

    Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    System and method of diarization and labeling of audio data

    公开(公告)号:US10902856B2

    公开(公告)日:2021-01-26

    申请号:US16703245

    申请日:2019-12-04

    Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

Patent Agency Ranking