Multi channel voice activity detection

Invention Grant

US11790888B2 Multi channel voice activity detection 有权

Please log in to see more content

Patent Title: Multi channel voice activity detection
Application No.: US17806198

Application Date: 2022-06-09
Publication No.: US11790888B2

Publication Date: 2023-10-17
Inventor: Nolan Andrew Miller , Ramin Mehran
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Agency: Honigman LLP
Agent Brett A. Krueger; Grant Griffith
Main IPC: G10L15/02
IPC: G10L15/02 ; H04R3/00

Abstract:

A method for multi-channel voice activity detection includes receiving a sequence of input frames characterizing streaming multi-channel audio captured by an array of microphones. Each channel of the streaming multi-channel audio includes respective audio features captured by a separate dedicated microphone. The method also includes determining, using a location fingerprint model, a location fingerprint indicating a location of a source of the multi-channel audio relative to the user device based on the respective audio features of each channel of the multi-channel audio. The method also includes generating an output from an application-specific classifier. The first score indicates a likelihood that the multi-channel audio corresponds to a particular audio type that the particular application is configured to process. The method also includes determining whether to accept or reject the multi-channel audio for processing by the particular application based on the first score generated as output from the application-specific classifier.

Public/Granted literature

US20220310060A1 Multi Channel Voice Activity Detection Public/Granted day:2022-09-29

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/02	.语音识别的特征提取；识别单位的选择