-
公开(公告)号:US20240380865A1
公开(公告)日:2024-11-14
申请号:US18316136
申请日:2023-05-11
Applicant: Google LLC
Inventor: Jamie Menjay Lin , Yu-Hui Chen
IPC: H04N7/15 , G06F3/04842 , G06V10/56 , G06V10/74 , H04N7/14
Abstract: Methods and systems for user-selected viewpoint rendering of a virtual meeting are provided herein. First image data generated by a first client device during a virtual meeting and second image data generated by a second client device during a virtual meeting is obtained. The first image data depicts object(s) captured from a first vantage point and the second image data depicts the object(s) captured from a second vantage point. A request is received from a third client device for third image data depicting the object(s) captured from a third vantage point. The third image data depicting the object(s) corresponding to the third vantage point is generated based on the first image data and the second image data. A rendering of the third image data is provided for presentation via a graphical user interface (GUI) of the third client device during the virtual meeting in accordance with the request.
-
公开(公告)号:US20240249741A1
公开(公告)日:2024-07-25
申请号:US18159679
申请日:2023-01-25
Applicant: Google LLC
Inventor: George Chiachi Sung , Yang Yang , Shao-Fu Shih , Hakan Erdogan , Jamie Menjay Lin
IPC: G10L21/0232 , G10L15/06 , G10L15/16 , G10L15/22 , G10L21/0308 , G10L25/18
CPC classification number: G10L21/0232 , G10L15/063 , G10L15/16 , G10L15/22 , G10L21/0308 , G10L25/18 , G10L2021/02082
Abstract: A method includes receiving, as input, reference audio data representing a reference audio signal captured by an audio input device. The method also includes receiving, as input, from a beamformer, spatially-filtered audio data representing an output of the beamformer, the beamformer configured to spatially filter, based on additional audio data captured by one or more additional audio input devices, the reference audio data to attenuate one or more interfering signals in the spatially-filtered audio data. The method processes, using a trained guided speech-enhancement network, the reference audio data and the spatially-filtered audio data to generate, as output, enhanced audio data, the guided speech-enhancement network processing the reference audio data and the spatially-filtered audio data to further attenuate, in the enhanced audio data, the one or more interfering signals attenuated by the beamformer.
-
公开(公告)号:US20240111572A1
公开(公告)日:2024-04-04
申请号:US17935709
申请日:2022-09-27
Applicant: Google LLC
Inventor: Jamie Menjay Lin , Chuo-Ling Chang
CPC classification number: G06F9/4881 , G06F9/463
Abstract: A method including processing a stream of data in a sequence of tasks. The processing including receiving a first block of data of the stream of data, determining features associated with the first block of data, selecting, based on the features, one of a first a task to process the first block of data or a second task to process the first block of data and if the second task is selected, shift an output of the second task in time to align the output of the second task with a predicted output of the first task processing a second block of data of the stream of data.
-
-