-
公开(公告)号:US11798533B2
公开(公告)日:2023-10-24
申请号:US17221220
申请日:2021-04-02
Applicant: Google LLC
Inventor: Joseph Caroselli, Jr. , Yiteng Huang , Arun Narayanan
IPC: G10L15/08 , G10L21/0216 , G06N20/00 , G10L15/05
CPC classification number: G10L15/083 , G06N20/00 , G10L15/05 , G10L21/0216 , G10L2015/088 , G10L2021/02166
Abstract: Implementations disclosed herein are directed to initializing and utilizing a beamformer in processing of audio data received at a computing device. The computing device can: receive audio data that captures a spoken utterance of a user, determine that a first audio data segment of the audio data includes one or more particular words or phrases; obtain a preceding audio data segment that precedes the first audio data segment; estimate a spatial correlation matrix based on the first audio data segment and based on the preceding audio data segment; initialize the beamformer based on the estimated spatial correlation matrix; and cause the initialized beamformer to be utilized in processing of at least a second audio data segment of the audio data. Additionally, or alternatively, the computing device can transmit the spatial correlation matrix to server(s), and the server(s) can transmit the initialized beamformer back to the computing device.