专利检索 ipc:G10L17/18 第 1 页

1.

发明授权
JOINT NEURAL NETWORK FOR SPEAKER RECOGNITION 有权转让

公开(公告)号：EP3791392B1

公开(公告)日：2024-08-07

申请号：EP19723267.1

申请日：2019-04-27

IPC分类号： G10L17/18 , G10L17/04 , G10L17/10 , G06N3/08 , G06V20/52 , G06V40/16 , G10L17/02 , G06N3/044 , G06N3/045

CPC分类号： G06N3/08 , G10L17/04 , G10L17/10 , G10L17/18 , G10L17/02 , G10L15/16 , G10L15/07 , G06V40/16 , G06V40/172 , G06V20/52 , G06N3/044 , G06N3/045

2.

发明公开
SPEAKER DIARIZATION SUPPORTING EPOSODICAL CONTENT 审中-公开

公开(公告)号：EP4330965A1

公开(公告)日：2024-03-06

申请号：EP22724184.1

申请日：2022-04-27

申请人： Dolby Laboratories Licensing Corporation

发明人： FANELLI, Andrea , YUN, Mingqing , PANKEY, Satej Suresh , ENGEL, Nicholas Laurence , CRUM, Poppy Anne Carrie

IPC分类号： G10L25/84 , G10L17/02 , G10L17/06 , G10L17/18

3.

发明公开
END-TO-END SPEECH DIARIZATION VIA ITERATIVE SPEAKER EMBEDDING 审中-公开

公开(公告)号：EP4323988A1

公开(公告)日：2024-02-21

申请号：EP21742664.2

申请日：2021-06-22

申请人： GOOGLE LLC

发明人： GRANGIER, David , ZEGHIDOUR, Neil , TEBOUL, Oliver

IPC分类号： G10L25/87 , G10L25/78 , G10L17/18

4.

发明授权
TEXT INDEPENDENT SPEAKER-VERIFICATION ON A MEDIA OPERATING SYSTEM USING DEEP LEARNING ON RAW WAVEFORMS 有权

公开(公告)号：EP4082008B1

公开(公告)日：2024-01-31

申请号：EP20842838.3

申请日：2020-12-21

发明人： MUHAMED, Aashiq , GHOSE, Susmita

IPC分类号： G10L17/18 , G10L17/04 , G06N3/04 , G06N3/045 , G06N3/048 , G06N3/08

5.

发明授权
SPEAKER RECOGNITION/LOCATION USING NEURAL NETWORK 有权

公开(公告)号：EP3791393B1

公开(公告)日：2023-01-25

申请号：EP19723576.5

申请日：2019-04-30

发明人： ZHANG, Shixiong , XIAO, Xiong

IPC分类号： G10L17/18 , G01N21/898

6.

发明公开
SPEAKER AWARENESS USING SPEAKER DEPENDENT SPEECH MODEL(S) 审中-公开

公开(公告)号：EP4086904A1

公开(公告)日：2022-11-09

申请号：EP22181074.0

申请日：2019-12-04

申请人： Google LLC

发明人： MORENO, Ignacio Lopez , WANG, Quan , PELECANOS, Jason , WAN, Li , GRUENSTEIN, Alexander , ERDOGAN, Hakan

IPC分类号： G10L25/78 , G10L25/30 , G10L17/18 , G10L15/07 , G10L15/20 , G10L17/04 , G10L17/20

摘要： Techniques disclosed herein enable training and/or utilizing speaker dependent (SD) speech models which are personalizable to any user of a client device. Various implementations include personalizing a SD speech model for a target user by processing, using the SD speech model, a speaker embedding corresponding to the target user along with an instance of audio data. The SD speech model can be personalized for an additional target user by processing, using the SD speech model, an additional speaker embedding, corresponding to the additional target user, along with another instance of audio data. Additional or alternative implementations include training the SD speech model based on a speaker independent speech model using teacher student learning.

7.

发明公开
NEURAL NETWORKS FOR SPEAKER VERIFICATION 审中-公开

公开(公告)号：EP4084000A3

公开(公告)日：2022-11-09

申请号：EP22179382.1

申请日：2016-07-27

申请人： Google LLC

发明人： Heigold, Georg , Bengio, Samy , Lopez Moreno, Ignacio

IPC分类号： G10L17/18 , G10L17/02 , G07C9/37 , G10L17/04

摘要： This document generally describes systems, methods, devices, and other techniques related to speaker verification, including (i) training a neural network for a speaker verification model, (ii) enrolling users at a client device, and (iii) verifying identities of users based on characteristics of the users' voices. Some implementations include a computer-implemented method. The method can include receiving, at a computing device, data that characterizes an utterance of a user of the computing device. A speaker representation can be generated, at the computing device, for the utterance using a neural network on the computing device. The neural network can be trained based on a plurality of training samples that each: (i) include data that characterizes a first utterance and data that characterizes one or more second utterances, and (ii) are labeled as a matching speakers sample or a non-matching speakers sample.

8.

发明公开
METHOD AND SYSTEM FOR TELECONFERENCE ACTUAL PARTICIPANT RECOGNITION 审中-公开

公开(公告)号：EP4040435A1

公开(公告)日：2022-08-10

申请号：EP21305164.2

申请日：2021-02-05

申请人： ALE International

发明人： HELBERT, Emmanuel , WARICHET, Sebastien

IPC分类号： G10L17/18

摘要： A method and system for detecting and recognizing an actual participant (607) as an active speaker in a teleconference (606) by training an encoder before the teleconference takes place, and creating a database (602) of reference vectors representing the voice of candidate participants, then, while the teleconference takes place, comparing reference vectors with vectors representing the voice stream of actual participants. The encoder may for example be a Convolutional Neural Network (CNN).

9.

发明授权
METHOD WITH SPEAKER RECOGNITION REGISTRATION AND CORRESPONDING NON-TRANSITORY COMPUTER-READABLE STORAGE MEDIUM 有权

公开(公告)号：EP3706117B1

公开(公告)日：2022-05-11

申请号：EP20161160.5

申请日：2020-03-05

发明人： PARK, Sung-Un , KIM, Kyuhong

IPC分类号： G10L17/04 , G10L17/20 , G10L17/02 , G10L17/18 , G10L21/0216

10.

发明公开
SIGNAL EXTRACTION SYSTEM, SIGNAL EXTRACTION LEARNING METHOD, AND SIGNAL EXTRACTION LEARNING PROGRAM 审中-公开

公开(公告)号：EP3979240A1

公开(公告)日：2022-04-06

申请号：EP19930251.4

申请日：2019-05-28

申请人： NEC Corporation

发明人： KOSHINAKA, Takafumi , YAMAMOTO, Hitoshi , KOIDA, Kaoru , SUZUKI, Takayuki

IPC分类号： G10L17/18 , G10L17/00 , G10L17/04 , G10L17/10 , G10L25/30

摘要： A neural network input unit 81 inputs a neural network in which a first network having a layer for inputting an anchor signal belonging to a predetermined class and a mixed signal including a target signal belonging to the class and a layer for outputting, as an estimation result, a reconstruction mask indicating a time-frequency domain in which the target signal is present in the mixed signal, and a second network having a layer for inputting the target signal extracted by applying the mixed signal to the reconstruction mask and a layer for outputting a result obtained by classifying the input target signal into a predetermined class are combined. A reconstruction mask estimation unit 82 applies the anchor signal and mixed signal to the first network to estimate the reconstruction mask of the class to which the anchor signal belongs. A signal classification unit 83 applies the mixed signal to the estimated reconstruction mask to extract the target signal, and applies the extracted target signal to the second network to classify the target signal into the class.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类