TARGET SPEAKER KEYWORD SPOTTING

Invention Application

US20250078840A1 TARGET SPEAKER KEYWORD SPOTTING 有权

Please log in to see more content

Patent Title: TARGET SPEAKER KEYWORD SPOTTING
Application No.: US18812338

Application Date: 2024-08-22
Publication No.: US20250078840A1

Publication Date: 2025-03-06
Inventor: Pai Zhu , Beltrán Labrador Serrano , Guanlong Zhao , Angelo Alfredo Scorza Scarpati , Quan Wang , Alex Seungryong Park , Ignacio Lopez Moreno
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Main IPC: G10L17/02
IPC: G10L17/02 ; G10L17/04 ; G10L17/22

Abstract:

A method includes receiving audio data corresponding to an utterance spoken by a particular user and captured in streaming audio by a user device. The method also includes performing speaker identification on the audio data to identify an identity of the particular user that spoke the utterance. The method also includes obtaining a keyword detection model personalized for the particular user based on the identity of the particular user that spoke the utterance. The keyword detection model is conditioned on speaker characteristic information associated with the particular user to adapt the keyword detection model to detect a presence of a keyword in audio for the particular user. The method also includes determining that the utterance includes the keyword using the keyword detection model personalized for the particular user.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L17/00	讲话者辨认或验证
G10L17/02	.预处理操作，例如：片断选择；模式表示或模拟，例如基于线性判别式分析(LDA)或主要部件；特征选择或提取