ACOUSTIC SIGNATURE BUILDING FOR A SPEAKER FROM MULTIPLE SESSIONS
    33.
    发明申请
    ACOUSTIC SIGNATURE BUILDING FOR A SPEAKER FROM MULTIPLE SESSIONS 有权
    来自多个会议的演讲者的声音签名大楼

    公开(公告)号:US20160217793A1

    公开(公告)日:2016-07-28

    申请号:US15006575

    申请日:2016-01-26

    CPC classification number: G10L17/04 G10L15/26 G10L17/02 G10L17/16 G10L25/84

    Abstract: Disclosed herein are methods of diarizing audio data using first-pass blind diarization and second-pass blind diarization that generate speaker statistical models, wherein the first pass-blind diarization is on a per-frame basis and the second pass-blind diarization is on a per-word basis, and methods of creating acoustic signatures for a common speaker based only on the statistical models of the speakers in each audio session.

    Abstract translation: 这里公开的是使用第一遍盲目校正和产生说话者统计模型的二次盲目校正对音频数据进行着色的方法,其中第一次盲盲二值化处于每帧的基础上,而第二次盲目校正位于 每个单词基础的方法,以及仅基于每个音频会话中的扬声器的统计模型为公共扬声器创建声学签名的方法。

    Speaker separation in diarization
    34.
    发明授权
    Speaker separation in diarization 有权
    讲话者分离在diarization

    公开(公告)号:US09368116B2

    公开(公告)日:2016-06-14

    申请号:US14016783

    申请日:2013-09-03

    CPC classification number: G10L15/26 G10L17/06 G10L25/51 G10L25/78 G10L2025/783

    Abstract: The system and method of separating speakers in an audio file including obtaining an audio file. The audio file is transcribed into at least one text file by a transcription server. Homogenous speech segments are identified within the at least one text file. The audio file is segmented into homogenous audio segments that correspond to the identified homogenous speech segments. The homogenous audio segments of the audio file are separated into a first speaker audio file and second speaker audio file the first speaker audio file and the second speaker audio file are transcribed to produce a diarized transcript.

    Abstract translation: 分离音频文件中的扬声器的系统和方法,包括获取音频文件。 音频文件由转录服务器转录成至少一个文本文件。 在至少一个文本文件内识别均匀的语音段。 音频文件被分割成与所识别的同源语音片段对应的同质音频段。 音频文件的同质音频片段被分成第一扬声器音频文件和第二扬声器音频文件,第一扬声器音频文件和第二扬声器音频文件被转录以产生经过缩小的转录。

    System and Method of Automated Language Model Adaptation
    35.
    发明申请
    System and Method of Automated Language Model Adaptation 有权
    自动语言模型适应的系统与方法

    公开(公告)号:US20150066503A1

    公开(公告)日:2015-03-05

    申请号:US14291895

    申请日:2014-05-30

    Abstract: Systems and methods of automated adaptation of a language model for transcription of audio data include obtaining audio data. The audio data is transcribed with a language model to produce a plurality of audio file transcriptions. A quality of the plurality of audio file transcriptions is evaluated. At least one best transcription from a plurality of audio file transcriptions is selected based upon the evaluated quality. Statistics are calculated from the selected at least one best transcription from the plurality of audio file transcriptions. The language model is modified from the calculated statistics.

    Abstract translation: 用于音频数据转录的语言模型的自动适应的系统和方法包括获得音频数据。 音频数据用语言模型转录以产生多个音频文件转录。 评估多个音频文件转录的质量。 基于评估的质量来选择来自多个音频文件转录的至少一个最佳转录。 根据来自多个音频文件转录的所选择的至少一个最佳转录来计算统计量。 语言模型根据计算的统计信息进行修改。

    System and method of diarization and labeling of audio data

    公开(公告)号:US11380333B2

    公开(公告)日:2022-07-05

    申请号:US16703143

    申请日:2019-12-04

    Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    System and method of diarization and labeling of audio data

    公开(公告)号:US11367450B2

    公开(公告)日:2022-06-21

    申请号:US16703030

    申请日:2019-12-04

    Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    System and method of diarization and labeling of audio data

    公开(公告)号:US10950242B2

    公开(公告)日:2021-03-16

    申请号:US16703274

    申请日:2019-12-04

    Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

Patent Agency Ranking