Speaker separation in diarization
    21.
    发明授权
    Speaker separation in diarization 有权
    讲话者分离在diarization

    公开(公告)号:US09368116B2

    公开(公告)日:2016-06-14

    申请号:US14016783

    申请日:2013-09-03

    CPC classification number: G10L15/26 G10L17/06 G10L25/51 G10L25/78 G10L2025/783

    Abstract: The system and method of separating speakers in an audio file including obtaining an audio file. The audio file is transcribed into at least one text file by a transcription server. Homogenous speech segments are identified within the at least one text file. The audio file is segmented into homogenous audio segments that correspond to the identified homogenous speech segments. The homogenous audio segments of the audio file are separated into a first speaker audio file and second speaker audio file the first speaker audio file and the second speaker audio file are transcribed to produce a diarized transcript.

    Abstract translation: 分离音频文件中的扬声器的系统和方法,包括获取音频文件。 音频文件由转录服务器转录成至少一个文本文件。 在至少一个文本文件内识别均匀的语音段。 音频文件被分割成与所识别的同源语音片段对应的同质音频段。 音频文件的同质音频片段被分成第一扬声器音频文件和第二扬声器音频文件,第一扬声器音频文件和第二扬声器音频文件被转录以产生经过缩小的转录。

    System and Method of Automated Language Model Adaptation
    22.
    发明申请
    System and Method of Automated Language Model Adaptation 有权
    自动语言模型适应的系统与方法

    公开(公告)号:US20150066503A1

    公开(公告)日:2015-03-05

    申请号:US14291895

    申请日:2014-05-30

    Abstract: Systems and methods of automated adaptation of a language model for transcription of audio data include obtaining audio data. The audio data is transcribed with a language model to produce a plurality of audio file transcriptions. A quality of the plurality of audio file transcriptions is evaluated. At least one best transcription from a plurality of audio file transcriptions is selected based upon the evaluated quality. Statistics are calculated from the selected at least one best transcription from the plurality of audio file transcriptions. The language model is modified from the calculated statistics.

    Abstract translation: 用于音频数据转录的语言模型的自动适应的系统和方法包括获得音频数据。 音频数据用语言模型转录以产生多个音频文件转录。 评估多个音频文件转录的质量。 基于评估的质量来选择来自多个音频文件转录的至少一个最佳转录。 根据来自多个音频文件转录的所选择的至少一个最佳转录来计算统计量。 语言模型根据计算的统计信息进行修改。

    DIARIZATION USING ACOUSTIC LABELING
    24.
    发明申请

    公开(公告)号:US20200312334A1

    公开(公告)日:2020-10-01

    申请号:US16848385

    申请日:2020-04-14

    Abstract: Systems and method of diarization of audio files use an acoustic voiceprint model. A plurality of audio files are analyzed to arrive at an acoustic voiceprint model associated to an identified speaker. Metadata associate with an audio file is used to select an acoustic voiceprint model. The selected acoustic voiceprint model is applied in a diarization to identify audio data of the identified speaker.

    Diarization using linguistic labeling

    公开(公告)号:US10522152B2

    公开(公告)日:2019-12-31

    申请号:US16170278

    申请日:2018-10-25

    Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

    DIARIZATION USING LINGUISTIC LABELING
    28.
    发明申请

    公开(公告)号:US20190066690A1

    公开(公告)日:2019-02-28

    申请号:US16170278

    申请日:2018-10-25

    Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

Patent Agency Ranking