Patent search ap:("Google LLC") AND inv:"Oliver Teboul" Page 1

1.

发明授权
End-to-end speech diarization via iterative speaker embedding 有权

公开(公告)号：US11887623B2

公开(公告)日：2024-01-30

申请号：US17304514

申请日：2021-06-22

Applicant: Google LLC

Inventor： David Grangier , Neil Zeghidour , Oliver Teboul

IPC: G10L25/78 , G06N3/04 , G10L15/06 , G10L15/07 , G10L17/18 , G10L19/008

CPC classification number: G10L25/78 , G06N3/04 , G10L15/063 , G10L15/07 , G10L17/18 , G10L19/008

Abstract: A method includes receiving an input audio signal corresponding to utterances spoken by multiple speakers. The method also includes encoding the input audio signal into a sequence of T temporal embeddings. During each of a plurality of iterations each corresponding to a respective speaker of the multiple speakers, the method includes selecting a respective speaker embedding for the respective speaker by determining a probability that the corresponding temporal embedding includes a presence of voice activity by a single new speaker for which a speaker embedding was not previously selected during a previous iteration and selecting the respective speaker embedding for the respective speaker as the temporal embedding. The method also includes, at each time step, predicting a respective voice activity indicator for each respective speaker of the multiple speakers based on the respective speaker embeddings selected during the plurality of iterations and the temporal embedding.

2.

发明公开
END-TO-END SPEECH DIARIZATION VIA ITERATIVE SPEAKER EMBEDDING 审中-公开

公开(公告)号：US20240144957A1

公开(公告)日：2024-05-02

申请号：US18544647

申请日：2023-12-19

Applicant: Google LLC

Inventor： David Grangier , Neil Zeghidour , Oliver Teboul

IPC: G10L25/78 , G06N3/04 , G10L15/06 , G10L15/07 , G10L17/18 , G10L19/008

CPC classification number: G10L25/78 , G06N3/04 , G10L15/063 , G10L15/07 , G10L17/18 , G10L19/008

Abstract: A method includes receiving an input audio signal corresponding to utterances spoken by multiple speakers. The method also includes encoding the input audio signal into a sequence of T temporal embeddings. During each of a plurality of iterations each corresponding to a respective speaker of the multiple speakers, the method includes selecting a respective speaker embedding for the respective speaker by determining a probability that the corresponding temporal embedding includes a presence of voice activity by a single new speaker for which a speaker embedding was not previously selected during a previous iteration and selecting the respective speaker embedding for the respective speaker as the temporal embedding. The method also includes, at each time step, predicting a respective voice activity indicator for each respective speaker of the multiple speakers based on the respective speaker embeddings selected during the plurality of iterations and the temporal embedding.

3.

发明申请
End-To-End Speech Diarization Via Iterative Speaker Embedding 有权

公开(公告)号：US20220375492A1

公开(公告)日：2022-11-24

申请号：US17304514

申请日：2021-06-22

Applicant: Google LLC

Inventor： David Grangier , Neil Zeghidour , Oliver Teboul

IPC: G10L25/78 , G10L19/008 , G06N3/04 , G10L15/07 , G10L15/06 , G10L17/18

Abstract: A method includes receiving an input audio signal corresponding to utterances spoken by multiple speakers. The method also includes encoding the input audio signal into a sequence of T temporal embeddings. During each of a plurality of iterations each corresponding to a respective speaker of the multiple speakers, the method includes selecting a respective speaker embedding for the respective speaker by determining a probability that the corresponding temporal embedding includes a presence of voice activity by a single new speaker for which a speaker embedding was not previously selected during a previous iteration and selecting the respective speaker embedding for the respective speaker as the temporal embedding. The method also includes, at each time step, predicting a respective voice activity indicator for each respective speaker of the multiple speakers based on the respective speaker embeddings selected during the plurality of iterations and the temporal embedding.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification