-
公开(公告)号:US09368116B2
公开(公告)日:2016-06-14
申请号:US14016783
申请日:2013-09-03
Applicant: Verint Systems Ltd.
Inventor: Omer Ziv , Ron Wein , Ido Shapira , Ran Achituv
CPC classification number: G10L15/26 , G10L17/06 , G10L25/51 , G10L25/78 , G10L2025/783
Abstract: The system and method of separating speakers in an audio file including obtaining an audio file. The audio file is transcribed into at least one text file by a transcription server. Homogenous speech segments are identified within the at least one text file. The audio file is segmented into homogenous audio segments that correspond to the identified homogenous speech segments. The homogenous audio segments of the audio file are separated into a first speaker audio file and second speaker audio file the first speaker audio file and the second speaker audio file are transcribed to produce a diarized transcript.
Abstract translation: 分离音频文件中的扬声器的系统和方法,包括获取音频文件。 音频文件由转录服务器转录成至少一个文本文件。 在至少一个文本文件内识别均匀的语音段。 音频文件被分割成与所识别的同源语音片段对应的同质音频段。 音频文件的同质音频片段被分成第一扬声器音频文件和第二扬声器音频文件,第一扬声器音频文件和第二扬声器音频文件被转录以产生经过缩小的转录。
-
22.
公开(公告)号:US20150066503A1
公开(公告)日:2015-03-05
申请号:US14291895
申请日:2014-05-30
Applicant: VERINT SYSTEMS LTD.
Inventor: Ran Achituv , Omer Ziv , Ido Shapira , Daniel Baum
IPC: G10L15/26
CPC classification number: G10L15/197 , G06F17/30746 , G10L15/063 , G10L15/083 , G10L15/26 , G10L2015/0635 , H04M3/51
Abstract: Systems and methods of automated adaptation of a language model for transcription of audio data include obtaining audio data. The audio data is transcribed with a language model to produce a plurality of audio file transcriptions. A quality of the plurality of audio file transcriptions is evaluated. At least one best transcription from a plurality of audio file transcriptions is selected based upon the evaluated quality. Statistics are calculated from the selected at least one best transcription from the plurality of audio file transcriptions. The language model is modified from the calculated statistics.
Abstract translation: 用于音频数据转录的语言模型的自动适应的系统和方法包括获得音频数据。 音频数据用语言模型转录以产生多个音频文件转录。 评估多个音频文件转录的质量。 基于评估的质量来选择来自多个音频文件转录的至少一个最佳转录。 根据来自多个音频文件转录的所选择的至少一个最佳转录来计算统计量。 语言模型根据计算的统计信息进行修改。
-
公开(公告)号:US11545137B2
公开(公告)日:2023-01-03
申请号:US16983550
申请日:2020-08-03
Applicant: Verint Systems Ltd.
Inventor: Ran Achituv , Omer Ziv , Roni Romano , Ido Shapira , Daniel Baum
IPC: G10L15/26 , G10L15/065 , G06F16/683 , G10L15/07 , G10L15/01 , G10L15/14 , G10L15/08
Abstract: Methods, systems, and computer readable media for automated transcription model adaptation includes obtaining audio data from a plurality of audio files. The audio data is transcribed to produce at least one audio file transcription which represents a plurality of transcription alternatives for each audio file. Speech analytics are applied to each audio file transcription. A best transcription is selected from the plurality of transcription alternatives for each audio file. Statistics from the selected best transcription are calculated. An adapted model is created from the calculated statistics.
-
公开(公告)号:US20200312334A1
公开(公告)日:2020-10-01
申请号:US16848385
申请日:2020-04-14
Applicant: Verint Systems Ltd.
Inventor: Omer Ziv , Ran Achituv , Ido Shapira , Jeremie Dreyfuss
Abstract: Systems and method of diarization of audio files use an acoustic voiceprint model. A plurality of audio files are analyzed to arrive at an acoustic voiceprint model associated to an identified speaker. Metadata associate with an audio file is used to select an acoustic voiceprint model. The selected acoustic voiceprint model is applied in a diarization to identify audio data of the identified speaker.
-
公开(公告)号:US10733977B2
公开(公告)日:2020-08-04
申请号:US15473231
申请日:2017-03-29
Applicant: Verint Systems Ltd.
Inventor: Ran Achituv , Omer Ziv , Roni Romano , Ido Shapira , Daniel Baum
IPC: G10L15/26 , G10L15/065 , G06F16/683 , G10L15/07 , G10L15/01 , G10L15/14 , G10L15/08
Abstract: Methods, systems, and computer readable media for automated transcription model adaptation includes obtaining audio data from a plurality of audio files. The audio data is transcribed to produce at least one audio file transcription which represents a plurality of transcription alternatives for each audio file. Speech analytics are applied to each audio file transcription. A best transcription is selected from the plurality of transcription alternatives for each audio file. Statistics from the selected best transcription are calculated. An adapted model is created from the calculated statistics.
-
公开(公告)号:US10692501B2
公开(公告)日:2020-06-23
申请号:US16594764
申请日:2019-10-07
Applicant: Verint Systems Ltd.
Inventor: Omer Ziv , Ran Achituv , Ido Shapira , Jeremie Dreyfuss
Abstract: Systems and method of diarization of audio files use an acoustic voiceprint model. A plurality of audio files are analyzed to arrive at an acoustic voiceprint model associated to an identified speaker. Metadata associate with an audio file is used to select an acoustic voiceprint model. The selected acoustic voiceprint model is applied in a diarization to identify audio data of the identified speaker.
-
公开(公告)号:US10522152B2
公开(公告)日:2019-12-31
申请号:US16170278
申请日:2018-10-25
Applicant: Verint Systems Ltd.
Inventor: Omer Ziv , Ran Achituv , Ido Shapira , Jeremie Dreyfuss
Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.
-
公开(公告)号:US20190066690A1
公开(公告)日:2019-02-28
申请号:US16170278
申请日:2018-10-25
Applicant: Verint Systems Ltd.
Inventor: Omer Ziv , Ran Achituv , Ido Shapira , Jeremie Dreyfuss
Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.
-
公开(公告)号:US09633650B2
公开(公告)日:2017-04-25
申请号:US14291893
申请日:2014-05-30
Applicant: Verint Systems Ltd.
Inventor: Ran Achituv , Omer Ziv , Roni Romano , Ido Shapira , Daniel Baum
IPC: G10L15/26 , G10L15/065 , G06F17/30 , G10L15/07 , G10L15/08
CPC classification number: G10L15/065 , G06F17/30746 , G10L15/01 , G10L15/07 , G10L15/083 , G10L15/14 , G10L15/26
Abstract: Methods, systems, and computer readable media for automated transcription model adaptation includes obtaining audio data from a plurality of audio files. The audio data is transcribed to produce at least one audio file transcription which represents a plurality of transcription alternatives for each audio file. Speech analytics are applied to each audio file transcription. A best transcription is selected from the plurality of transcription alternatives for each audio file. Statistics from the selected best transcription are calculated. An adapted model is created from the calculated statistics.
-
公开(公告)号:US20170098445A1
公开(公告)日:2017-04-06
申请号:US15332411
申请日:2016-10-24
Applicant: Verint Systems Ltd.
Inventor: Ran Achituv , Omer Ziv , Ido Shapira , Daniel Baum
IPC: G10L15/197 , G10L15/08 , G10L15/06
CPC classification number: G10L15/197 , G06F17/30746 , G10L15/063 , G10L15/083 , G10L15/26 , G10L2015/0635 , H04M3/51
Abstract: Systems and methods of automated adaptation of a language model for transcription of audio data include obtaining audio data. The audio data is transcribed with a language model to produce a plurality of audio tile transcriptions. A quality of the plurality of audio file transcriptions is evaluated. At least one best transcription from a plurality of audio tile transcriptions is selected based upon the evaluated quality. Statistics are calculated from the selected at least one best transcription from the plurality of audio file transcriptions. The language model is modified from the calculated statistics.
-
-
-
-
-
-
-
-
-