专利检索 ipc:"G10L17/16" 第 1 页

1.

发明公开
METHODS AND SYSTEM FOR DISTRIBUTING INFORMATION VIA MULTIPLE FORMS OF DELIVERY SERVICES 审中-公开

公开(公告)号：US20240214343A1

公开(公告)日：2024-06-27

申请号：US18401210

申请日：2023-12-29

申请人： Greenfly, Inc.

发明人： Shawn David Green , Daniel Brian Kirschner

IPC分类号： H04L51/52 , G06F16/40 , G06F16/435 , G06F16/438 , G06F16/483 , G06F16/9535 , G06F16/9538 , G06Q10/101 , G06Q50/00 , G06Q50/10 , G06V40/16 , G10L17/00 , G10L17/06 , G10L17/16 , G10L17/22 , H04L65/612 , H04L67/06 , H04L67/306 , H04L67/50 , H04N21/475 , H04N21/4784 , H04N21/4788

CPC分类号： H04L51/52 , G06F16/40 , G06F16/435 , G06F16/437 , G06F16/438 , G06F16/483 , G06F16/9535 , G06F16/9538 , G06Q50/01 , G06V40/166 , G06V40/172 , G10L17/00 , G10L17/06 , G10L17/16 , G10L17/22 , H04L65/612 , H04L67/306 , H04L67/535 , H04N21/4756 , H04N21/4758 , H04N21/4784 , H04N21/4788 , G06Q10/101 , G06Q50/10 , H04L67/06

摘要： A content distribution facilitation system is described comprising configured servers and a network interface configured to interface with a plurality of terminals in a client server relationship and optionally with a cloud-based storage system. A request from a first source for content comprising content criteria is received, the content criteria comprising content subject matter. At least a portion of the content request content criteria is transmitted to a selected content contributor. If recorded content is received from the first content contributor, the first source is provided with access to the received recorded content. The recorded content may be transmitted via one or more networks to one or more destination devices. Optionally, a voice analysis and/or facial recognition engine are utilized to determine if the recorded content is from the first content contributor.

2.

发明公开
SECURE COMMUNICATION SYSTEM WITH SPEAKER RECOGNITION BY VOICE BIOMETRICS FOR USER GROUPS SUCH AS FAMILY GROUPS 审中-公开

公开(公告)号：US20230368798A1

公开(公告)日：2023-11-16

申请号：US18044247

申请日：2020-11-16

申请人： KIWIP TECHNOLOGIES SAS

发明人： Matthieu LIM , Ibtissam BRAHMI

IPC分类号： G10L17/18 , G10L17/16 , G10L17/22 , G10L25/27 , G10L17/02

CPC分类号： G10L17/18 , G10L17/16 , G10L17/22 , G10L25/27 , G10L17/02

摘要： The communication system (1) manages the communications of a plurality of user groups (GF) and authorizes secure communications between members (USER) of the same group (GF). The system comprises a server (SRC) and a plurality of user devices (UD) connected to an Internet-type network (IP) allowing voice communications. Speaker recognition and access authorization means (RL) are included and comprise artificial intelligence means (AI). According to the invention, the system comprises voice signal analysis means producing a scalogram of a speaker's voice signal by means of a discrete wavelet transform followed by a continuous wavelet transform, the scalogram being provided as input to the artificial intelligence means for speaker recognition.

3.

发明申请
METHODS AND SYSTEM FOR DISTRIBUTING INFORMATION VIA MULTIPLE FORMS OF DELIVERY SERVICES 有权

公开(公告)号：US20210006531A1

公开(公告)日：2021-01-07

申请号：US16789345

申请日：2020-02-12

申请人： Greenfly, Inc.

发明人： Shawn David Green , Daniel Brian Kirschner

IPC分类号： H04L12/58 , H04L29/08 , H04N21/475 , G06Q50/00 , H04L29/06 , H04N21/4788 , G06F16/435 , G06F16/438 , G06F16/9535 , H04N21/4784 , G06F16/40 , G10L17/00 , G06K9/00 , G10L17/06 , G10L17/16 , G10L17/22

摘要： A content distribution facilitation system is described comprising configured servers and a network interface configured to interface with a plurality of terminals in a client server relationship and optionally with a cloud-based storage system. A request from a first source for content comprising content criteria is received, the content criteria comprising content subject matter. At least a portion of the content request content criteria is transmitted to a selected content contributor. If recorded content is received from the first content contributor, the first source is provided with access to the received recorded content. The recorded content may be transmitted via one or more networks to one or more destination devices. Optionally, a voice analysis and/or facial recognition engine are utilized to determine if the recorded content is from the first content contributor.

4.

发明授权
Method and system for user authentication by voice biometrics 有权

公开(公告)号：US10789960B2

公开(公告)日：2020-09-29

申请号：US15803024

申请日：2017-11-03

申请人： PW GROUP

发明人： Gregory Libert , Dijana Petrovski Chollet , Houssemeddine Khemiri

IPC分类号： G10L17/24 , G10L17/16 , G10L17/20

摘要： Disclosed is a method including a prior phase for referencing an authorized user, during which this user pronounces a reference phrase at least once, the phrase being converted into a series of reference symbols by a statistical conversion mutual to all of the users to be referenced, and an authentication test phase, including a first step during which a candidate user pronounces the reference phrase at least once, and this pronounced phrase is converted in the same manner as the reference phrase during the prior phase, by using the same conversion, into a sequence of candidate symbols, and a second step during which the series of candidate symbols is compared to the series of reference symbols to determine a comparison result, which is compared to at least one predetermined threshold, determining whether the candidate user who pronounced the phrase during the test phase is indeed the authorized user, providing authentication.

5.

发明授权
Method for microphone selection and multi-talker segmentation with ambient automated speech recognition (ASR) 有权

公开(公告)号：US10424317B2

公开(公告)日：2019-09-24

申请号：US15403481

申请日：2017-01-11

申请人： Nuance Communications, Inc.

发明人： Pablo Peso Parada , Dushyant Sharma , Patrick Naylor

IPC分类号： G10L21/0232 , G10L15/04 , G10L17/16 , G10L21/028 , H04R1/40 , G10L17/06 , G10L25/03 , G10L21/0216 , G10L15/00 , G10L25/84

摘要： Disclosed methods and systems are directed to determining a best microphone pair and segmenting sound signals. The methods and systems may include receiving a collection of sound signals comprising speech from one or more audio sources (e.g., meeting participants) and/or background noise. The methods and systems may include calculating a TDOA and determining, based on the TDOA and via robust statistics, the best pair of microphones. The methods and systems may also include segmenting sound signals from multiple sources.

6.

发明申请
METHOD, APPARATUS AND SYSTEM FOR SPEAKER VERIFICATION 审中-公开

公开(公告)号：US20190214020A1

公开(公告)日：2019-07-11

申请号：US16353756

申请日：2019-03-14

申请人： BEIJING DIDI INFINITY TECHNOLOGY AND DEVELOPMENT CO., LTD.

发明人： Jie CHEN , Dan SU , Tianxiao FU , Na HU

IPC分类号： G10L17/00 , G10L17/06 , G10L17/18 , G10L15/02 , G10L17/16

CPC分类号： G10L17/005 , G10L15/02 , G10L17/02 , G10L17/06 , G10L17/16 , G10L17/18 , G10L25/18 , G10L2025/783

摘要： The present disclosure relates to a method, apparatus, and system for speaker verification. The method includes: acquiring an audio recording; extracting speech signals from the audio recording; extracting features of the extracted speech signals; and determining whether the extracted speech signals represent speech by a predetermined speaker based on the extracted features and a speaker model trained with reference voice data of the predetermined speaker.

7.

发明授权
Blind diarization of recorded calls with arbitrary number of speakers 有权

公开(公告)号：US10109280B2

公开(公告)日：2018-10-23

申请号：US15839190

申请日：2017-12-12

申请人： Verint Systems Ltd.

发明人： Oana Sidi , Ron Wein

IPC分类号： G10L15/26 , G10L17/06 , G10L17/16 , G10L25/78 , G10L15/02 , G10L17/04 , H04M3/51 , G10L17/02 , G10L15/00

摘要： In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded.

8.

发明授权
Blind diarization of recorded calls with arbitrary number of speakers 有权

公开(公告)号：US09881617B2

公开(公告)日：2018-01-30

申请号：US15254326

申请日：2016-09-01

申请人： Verint Systems Ltd.

发明人： Oana Sidi , Ron Wein

IPC分类号： G10L15/26 , G10L17/06 , G10L17/02 , G10L17/16 , G10L15/02 , G10L17/04 , G10L25/78 , H04M3/51 , G10L15/00

CPC分类号： G10L17/06 , G10L15/02 , G10L15/26 , G10L17/02 , G10L17/04 , G10L17/16 , G10L25/78 , G10L2015/025 , H04M3/5175 , H04M2201/41 , H04M2203/303

摘要： In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded.

9.

发明授权
Word-level blind diarization of recorded calls with arbitrary number of speakers 有权

公开(公告)号：US09875742B2

公开(公告)日：2018-01-23

申请号：US15006572

申请日：2016-01-26

申请人： Verint Systems Ltd.

发明人： Alex Gorodetski , Oana Sidi , Ron Wein , Ido Shapira

IPC分类号： G10L15/00 , G10L17/00 , G10L15/06 , G10L17/04 , G10L17/16 , G10L17/02 , G10L25/84 , G10L15/26

CPC分类号： G10L17/04 , G10L15/26 , G10L17/02 , G10L17/16 , G10L25/84

摘要： Disclosed herein are methods of diarizing audio data using first-pass blind diarization and second-pass blind diarization that generate speaker statistical models, wherein the first pass-blind diarization is on a per-frame basis and the second pass-blind diarization is on a per-word basis, and methods of creating acoustic signatures for a common speaker based only on the statistical models of the speakers in each audio session.

10.

发明申请
Blind Diarization of Recorded Calls With Arbitrary Number of Speakers 有权
标题翻译：用任意数量的演讲者进行录音电话的黑暗化

公开(公告)号：US20170053653A1

公开(公告)日：2017-02-23

申请号：US15254326

申请日：2016-09-01

申请人： Verint Systems Ltd.

发明人： Oana Sidi , Ron Wein

IPC分类号： G10L17/06 , G10L17/02 , H04M3/51 , G10L15/02 , G10L17/16 , G10L17/04 , G10L15/26 , G10L25/78

CPC分类号： G10L17/06 , G10L15/02 , G10L15/26 , G10L17/02 , G10L17/04 , G10L17/16 , G10L25/78 , G10L2015/025 , H04M3/5175 , H04M2201/41 , H04M2203/303

摘要： In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded.

摘要翻译： 在音频数据的分类方法中，将音频数据分割为多个话语。每个话语被表示为代表多个特征向量的话语模型。话语模型是聚类的。从群集话语模型构建多个说话者模型。由多个扬声器模型构成隐马尔可夫模型。已识别的扬声器模型的序列被解码。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类