-
1.
公开(公告)号:US20240214343A1
公开(公告)日:2024-06-27
申请号:US18401210
申请日:2023-12-29
申请人: Greenfly, Inc.
IPC分类号: H04L51/52 , G06F16/40 , G06F16/435 , G06F16/438 , G06F16/483 , G06F16/9535 , G06F16/9538 , G06Q10/101 , G06Q50/00 , G06Q50/10 , G06V40/16 , G10L17/00 , G10L17/06 , G10L17/16 , G10L17/22 , H04L65/612 , H04L67/06 , H04L67/306 , H04L67/50 , H04N21/475 , H04N21/4784 , H04N21/4788
CPC分类号: H04L51/52 , G06F16/40 , G06F16/435 , G06F16/437 , G06F16/438 , G06F16/483 , G06F16/9535 , G06F16/9538 , G06Q50/01 , G06V40/166 , G06V40/172 , G10L17/00 , G10L17/06 , G10L17/16 , G10L17/22 , H04L65/612 , H04L67/306 , H04L67/535 , H04N21/4756 , H04N21/4758 , H04N21/4784 , H04N21/4788 , G06Q10/101 , G06Q50/10 , H04L67/06
摘要: A content distribution facilitation system is described comprising configured servers and a network interface configured to interface with a plurality of terminals in a client server relationship and optionally with a cloud-based storage system. A request from a first source for content comprising content criteria is received, the content criteria comprising content subject matter. At least a portion of the content request content criteria is transmitted to a selected content contributor. If recorded content is received from the first content contributor, the first source is provided with access to the received recorded content. The recorded content may be transmitted via one or more networks to one or more destination devices. Optionally, a voice analysis and/or facial recognition engine are utilized to determine if the recorded content is from the first content contributor.
-
2.
公开(公告)号:US20230368798A1
公开(公告)日:2023-11-16
申请号:US18044247
申请日:2020-11-16
发明人: Matthieu LIM , Ibtissam BRAHMI
摘要: The communication system (1) manages the communications of a plurality of user groups (GF) and authorizes secure communications between members (USER) of the same group (GF). The system comprises a server (SRC) and a plurality of user devices (UD) connected to an Internet-type network (IP) allowing voice communications. Speaker recognition and access authorization means (RL) are included and comprise artificial intelligence means (AI). According to the invention, the system comprises voice signal analysis means producing a scalogram of a speaker's voice signal by means of a discrete wavelet transform followed by a continuous wavelet transform, the scalogram being provided as input to the artificial intelligence means for speaker recognition.
-
公开(公告)号:US20210006531A1
公开(公告)日:2021-01-07
申请号:US16789345
申请日:2020-02-12
申请人: Greenfly, Inc.
IPC分类号: H04L12/58 , H04L29/08 , H04N21/475 , G06Q50/00 , H04L29/06 , H04N21/4788 , G06F16/435 , G06F16/438 , G06F16/9535 , H04N21/4784 , G06F16/40 , G10L17/00 , G06K9/00 , G10L17/06 , G10L17/16 , G10L17/22
摘要: A content distribution facilitation system is described comprising configured servers and a network interface configured to interface with a plurality of terminals in a client server relationship and optionally with a cloud-based storage system. A request from a first source for content comprising content criteria is received, the content criteria comprising content subject matter. At least a portion of the content request content criteria is transmitted to a selected content contributor. If recorded content is received from the first content contributor, the first source is provided with access to the received recorded content. The recorded content may be transmitted via one or more networks to one or more destination devices. Optionally, a voice analysis and/or facial recognition engine are utilized to determine if the recorded content is from the first content contributor.
-
公开(公告)号:US10789960B2
公开(公告)日:2020-09-29
申请号:US15803024
申请日:2017-11-03
申请人: PW GROUP
摘要: Disclosed is a method including a prior phase for referencing an authorized user, during which this user pronounces a reference phrase at least once, the phrase being converted into a series of reference symbols by a statistical conversion mutual to all of the users to be referenced, and an authentication test phase, including a first step during which a candidate user pronounces the reference phrase at least once, and this pronounced phrase is converted in the same manner as the reference phrase during the prior phase, by using the same conversion, into a sequence of candidate symbols, and a second step during which the series of candidate symbols is compared to the series of reference symbols to determine a comparison result, which is compared to at least one predetermined threshold, determining whether the candidate user who pronounced the phrase during the test phase is indeed the authorized user, providing authentication.
-
公开(公告)号:US10424317B2
公开(公告)日:2019-09-24
申请号:US15403481
申请日:2017-01-11
IPC分类号: G10L21/0232 , G10L15/04 , G10L17/16 , G10L21/028 , H04R1/40 , G10L17/06 , G10L25/03 , G10L21/0216 , G10L15/00 , G10L25/84
摘要: Disclosed methods and systems are directed to determining a best microphone pair and segmenting sound signals. The methods and systems may include receiving a collection of sound signals comprising speech from one or more audio sources (e.g., meeting participants) and/or background noise. The methods and systems may include calculating a TDOA and determining, based on the TDOA and via robust statistics, the best pair of microphones. The methods and systems may also include segmenting sound signals from multiple sources.
-
公开(公告)号:US20190214020A1
公开(公告)日:2019-07-11
申请号:US16353756
申请日:2019-03-14
发明人: Jie CHEN , Dan SU , Tianxiao FU , Na HU
CPC分类号: G10L17/005 , G10L15/02 , G10L17/02 , G10L17/06 , G10L17/16 , G10L17/18 , G10L25/18 , G10L2025/783
摘要: The present disclosure relates to a method, apparatus, and system for speaker verification. The method includes: acquiring an audio recording; extracting speech signals from the audio recording; extracting features of the extracted speech signals; and determining whether the extracted speech signals represent speech by a predetermined speaker based on the extracted features and a speaker model trained with reference voice data of the predetermined speaker.
-
公开(公告)号:US10109280B2
公开(公告)日:2018-10-23
申请号:US15839190
申请日:2017-12-12
申请人: Verint Systems Ltd.
IPC分类号: G10L15/26 , G10L17/06 , G10L17/16 , G10L25/78 , G10L15/02 , G10L17/04 , H04M3/51 , G10L17/02 , G10L15/00
摘要: In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded.
-
公开(公告)号:US09881617B2
公开(公告)日:2018-01-30
申请号:US15254326
申请日:2016-09-01
申请人: Verint Systems Ltd.
IPC分类号: G10L15/26 , G10L17/06 , G10L17/02 , G10L17/16 , G10L15/02 , G10L17/04 , G10L25/78 , H04M3/51 , G10L15/00
CPC分类号: G10L17/06 , G10L15/02 , G10L15/26 , G10L17/02 , G10L17/04 , G10L17/16 , G10L25/78 , G10L2015/025 , H04M3/5175 , H04M2201/41 , H04M2203/303
摘要: In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded.
-
公开(公告)号:US09875742B2
公开(公告)日:2018-01-23
申请号:US15006572
申请日:2016-01-26
申请人: Verint Systems Ltd.
发明人: Alex Gorodetski , Oana Sidi , Ron Wein , Ido Shapira
IPC分类号: G10L15/00 , G10L17/00 , G10L15/06 , G10L17/04 , G10L17/16 , G10L17/02 , G10L25/84 , G10L15/26
摘要: Disclosed herein are methods of diarizing audio data using first-pass blind diarization and second-pass blind diarization that generate speaker statistical models, wherein the first pass-blind diarization is on a per-frame basis and the second pass-blind diarization is on a per-word basis, and methods of creating acoustic signatures for a common speaker based only on the statistical models of the speakers in each audio session.
-
10.
公开(公告)号:US20170053653A1
公开(公告)日:2017-02-23
申请号:US15254326
申请日:2016-09-01
申请人: Verint Systems Ltd.
IPC分类号: G10L17/06 , G10L17/02 , H04M3/51 , G10L15/02 , G10L17/16 , G10L17/04 , G10L15/26 , G10L25/78
CPC分类号: G10L17/06 , G10L15/02 , G10L15/26 , G10L17/02 , G10L17/04 , G10L17/16 , G10L25/78 , G10L2015/025 , H04M3/5175 , H04M2201/41 , H04M2203/303
摘要: In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded.
摘要翻译: 在音频数据的分类方法中,将音频数据分割为多个话语。 每个话语被表示为代表多个特征向量的话语模型。 话语模型是聚类的。 从群集话语模型构建多个说话者模型。 由多个扬声器模型构成隐马尔可夫模型。 已识别的扬声器模型的序列被解码。
-
-
-
-
-
-
-
-
-