Invention Grant
- Patent Title: Multi channel voice activity detection
-
Application No.: US17806198Application Date: 2022-06-09
-
Publication No.: US11790888B2Publication Date: 2023-10-17
- Inventor: Nolan Andrew Miller , Ramin Mehran
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Honigman LLP
- Agent Brett A. Krueger; Grant Griffith
- Main IPC: G10L15/02
- IPC: G10L15/02 ; H04R3/00

Abstract:
A method for multi-channel voice activity detection includes receiving a sequence of input frames characterizing streaming multi-channel audio captured by an array of microphones. Each channel of the streaming multi-channel audio includes respective audio features captured by a separate dedicated microphone. The method also includes determining, using a location fingerprint model, a location fingerprint indicating a location of a source of the multi-channel audio relative to the user device based on the respective audio features of each channel of the multi-channel audio. The method also includes generating an output from an application-specific classifier. The first score indicates a likelihood that the multi-channel audio corresponds to a particular audio type that the particular application is configured to process. The method also includes determining whether to accept or reject the multi-channel audio for processing by the particular application based on the first score generated as output from the application-specific classifier.
Public/Granted literature
- US20220310060A1 Multi Channel Voice Activity Detection Public/Granted day:2022-09-29
Information query