Patent search ap:("Google LLC") AND inv:"Shao-Fu Shih" Page 1

1.

发明公开
Guided Speech Enhancement Network 审中-公开

公开(公告)号：US20240249741A1

公开(公告)日：2024-07-25

申请号：US18159679

申请日：2023-01-25

Applicant: Google LLC

Inventor： George Chiachi Sung , Yang Yang , Shao-Fu Shih , Hakan Erdogan , Jamie Menjay Lin

IPC: G10L21/0232 , G10L15/06 , G10L15/16 , G10L15/22 , G10L21/0308 , G10L25/18

CPC classification number: G10L21/0232 , G10L15/063 , G10L15/16 , G10L15/22 , G10L21/0308 , G10L25/18 , G10L2021/02082

Abstract: A method includes receiving, as input, reference audio data representing a reference audio signal captured by an audio input device. The method also includes receiving, as input, from a beamformer, spatially-filtered audio data representing an output of the beamformer, the beamformer configured to spatially filter, based on additional audio data captured by one or more additional audio input devices, the reference audio data to attenuate one or more interfering signals in the spatially-filtered audio data. The method processes, using a trained guided speech-enhancement network, the reference audio data and the spatially-filtered audio data to generate, as output, enhanced audio data, the guided speech-enhancement network processing the reference audio data and the spatially-filtered audio data to further attenuate, in the enhanced audio data, the one or more interfering signals attenuated by the beamformer.

Patent Agency Ranking