-
公开(公告)号:US20240297959A1
公开(公告)日:2024-09-05
申请号:US18116981
申请日:2023-03-03
Applicant: Zoom Video Communications, Inc.
Inventor: Qiang GAO , Zhaofeng Jia
IPC: H04N7/15 , G10L21/013 , G10L21/0208 , H04N7/14
CPC classification number: H04N7/152 , G10L21/013 , G10L21/0208 , H04N7/147 , G10L2021/0135 , G10L2021/02082
Abstract: Systems and methods for providing voice feedback for video conferences are provided. A computer-implemented method includes joining, by a client device, a video conference hosted by a video conference provider, the video conference having a plurality of participants using a plurality of client devices. With voice feedback enabled, the client device may receive a first audio stream from the video conference provider, the first audio stream associated with a first participant of the plurality of participants and a second audio stream from a user of the client device, the second audio stream comprising a voice of the user of the client device. The client device can play the first audio stream on one or more channels of one or more audio devices connected to the client device and play the second audio stream on the one or more channels of the one or more audio devices.
-
公开(公告)号:US20240195530A1
公开(公告)日:2024-06-13
申请号:US18533088
申请日:2023-12-07
Applicant: Zoom Video Communications, Inc.
Inventor: Jing Wu , Zhaofeng Jia , Bo Ling , Qiyong Liu
IPC: H04L1/08 , H04L1/00 , H04L1/1867 , H04L43/0829 , H04L65/403 , H04L65/70 , H04L65/80 , H04N7/15 , H04N19/136 , H04N19/172 , H04N19/177 , H04N19/179
CPC classification number: H04L1/08 , H04L1/0009 , H04L1/0045 , H04L1/0076 , H04L1/1867 , H04L43/0829 , H04L65/403 , H04L65/70 , H04L65/80 , H04N7/15 , H04N7/152 , H04N19/136 , H04N19/172 , H04N19/177 , H04N19/179 , H04L1/007 , H04L2001/0093
Abstract: Scrolling motion is detected within a video stream to output an indication of a scrolling motion vector for use in encoding a current picture of the video stream. A first line of pixels within a motion region of the current picture is identified. A second line of pixels matching the first line of pixels is identified within a last played picture of the video stream. The scrolling motion vector is determined based on a comparison of lines of pixels nearby the second line of pixels within the last played picture. The indication of the scrolling motion vector is then output for use in encoding the current picture.
-
公开(公告)号:US20240146876A1
公开(公告)日:2024-05-02
申请号:US17978760
申请日:2022-11-01
Applicant: Zoom Video Communications, Inc.
Inventor: Zhaofeng Jia , Yuhui Chen
IPC: H04N7/15 , G10L21/003 , H04L12/18 , H04N7/14
CPC classification number: H04N7/152 , G10L21/003 , H04L12/1822 , H04N7/147
Abstract: Various embodiments of an apparatus, method(s), system(s) and computer program product(s) described herein are directed to a Visualization Engine. The Visualization Engine receives audio data associated with a user account accessing a virtual meeting via a communications environment client software application. The Visualization Engine detects presence of a pre-selected type(s) of audio event(s) in the received audio data. The Visualization Engine generates a visualization representative of at least one attribute of the detected audio event(s). During playback of the audio data in the virtual meeting, the Visualization Engine renders the visualization within the communications environment client software application of the user account.
-
公开(公告)号:US11847999B2
公开(公告)日:2023-12-19
申请号:US17512509
申请日:2021-10-27
Applicant: Zoom Video Communications, Inc.
Inventor: Zhaofeng Jia , Yang Liu , Qiyong Liu
IPC: G10K15/08
CPC classification number: G10K15/08
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for generating echo recordings. The system receives, by an autoencoder, an audio signal representation that represents an audio signal and a target echo embedding that comprises information about a target room. The autoencoder comprises an encoder and a decoder. The system generates, by the encoder, a content embedding and an estimated echo embedding. The system generates, by the decoder, an echo recording representation based on the content embedding and the target echo embedding.
-
公开(公告)号:US11039015B2
公开(公告)日:2021-06-15
申请号:US16824642
申请日:2020-03-19
Applicant: Zoom Video Communications, inc.
Inventor: Zhaofeng Jia , Huipin Zhang
Abstract: An apparatus and/or method discloses a video conference with enhanced audio quality using high-fidelity audio sharing (“HAS”). In one embodiment, a network connection between a first user equipment (“UE”) and a second UE is established via a communication network for providing an interactive real-time meeting. After sending a first calibration audio signal from the first UE to the second UE, a second calibration audio signal is retuned from the second UE to the first UE according to the first calibration audio signal. Upon identifying a far end audio (“FEA”) delay based on the first calibration audio signal and the second calibration audio signal, a first mixed audio data containing the first shared audio data and first FEA data is fetched from an audio buffer. The first FEA data is subsequently removed or extracted from the mixed audio data in response to the FEA delay.
-
公开(公告)号:US10348454B2
公开(公告)日:2019-07-09
申请号:US15727907
申请日:2017-10-09
Applicant: Zoom Video Communications, Inc.
Inventor: Qiyong Liu , Zhaofeng Jia , Kai Jin , Jing Wu , Huipin Zhang
Abstract: An error resilience method comprising: using a computer, creating and storing, in computer memory, one or more FEC filter tables for use by the FEC filter for selectively forwarding a FEC packet; selectively forwarding a request for the FEC packet through a FEC filter based on the FEC table and a dynamic packet loss level at a receiver; limiting a re-transmission request for a particular packet through the FEC filter based on a number of re-transmission requests for the particular packet; and selectively skipping a key frame request based on a number of key frame requests received from a plurality receiver devices, wherein the method is performed by one or more special-purpose computing devices.
-
公开(公告)号:US11881945B2
公开(公告)日:2024-01-23
申请号:US17591346
申请日:2022-02-02
Applicant: Zoom Video Communications, Inc.
Inventor: Jing Wu , Zhaofeng Jia , Bo Ling , Qiyong Liu
IPC: H04L1/08 , H04L1/00 , H04L1/1867 , H04N7/15 , H04L65/403 , H04L65/80 , H04N19/136 , H04N19/172 , H04N19/177 , H04N19/179 , H04L65/70 , H04L43/0829
CPC classification number: H04L1/08 , H04L1/0009 , H04L1/0045 , H04L1/0076 , H04L1/1867 , H04L43/0829 , H04L65/403 , H04L65/70 , H04L65/80 , H04N7/15 , H04N7/152 , H04N19/136 , H04N19/172 , H04N19/177 , H04N19/179 , H04L1/007 , H04L2001/0093
Abstract: An adaptive screen encoding method comprising: using a computer, creating and storing, in computer memory, a plurality of conditions for use by a server configured to determine which of picture coding type to select; detecting a current picture by a sender for a content type including textual content, graphical content, and natural image content; determining a percentage of static macroblocks corresponding to the current picture; selecting the picture coding type based on the content type, the plurality of conditions, and the percentage of static macroblocks, wherein the method is performed by one or more special-purpose computing devices.
-
公开(公告)号:US11870940B2
公开(公告)日:2024-01-09
申请号:US17341242
申请日:2021-06-07
Applicant: Zoom Video Communications, Inc.
Inventor: Zhaofeng Jia , Huipin Zhang
Abstract: An apparatus and/or method discloses a video conference with enhanced audio quality using high-fidelity audio sharing (“HAS”). In one embodiment, a network connection between a first user equipment (“UE”) and a second UE is established via a communication network for providing an interactive real-time meeting. After sending a first calibration audio signal from the first UE to the second UE, a second calibration audio signal is retuned from the second UE to the first UE according to the first calibration audio signal. Upon identifying a far end audio (“FEA”) delay based on the first calibration audio signal and the second calibration audio signal, a first mixed audio data containing the first shared audio data and first FEA data is fetched from an audio buffer. The first FEA data is subsequently removed or extracted from the mixed audio data in response to the FEA delay.
-
公开(公告)号:US20230100986A1
公开(公告)日:2023-03-30
申请号:US17512509
申请日:2021-10-27
Applicant: Zoom Video Communications, Inc.
Inventor: Zhaofeng Jia , Yang Liu , Qiyong Liu
IPC: G10K15/08
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for generating echo recordings. The system receives, by an autoencoder, an audio signal representation that represents an audio signal and a target echo embedding that comprises information about a target room. The autoencoder comprises an encoder and a decoder. The system generates, by the encoder, a content embedding and an estimated echo embedding. The system generates, by the decoder, an echo recording representation based on the content embedding and the target echo embedding.
-
公开(公告)号:US20210297534A1
公开(公告)日:2021-09-23
申请号:US17341242
申请日:2021-06-07
Applicant: Zoom Video Communications, Inc.
Inventor: Zhaofeng Jia , Hulpin Zhang
Abstract: An apparatus and/or method discloses a video conference with enhanced audio quality using high-fidelity audio sharing (“HAS”). In one embodiment, a network connection between a first user equipment (“UE”) and a second UE is established via a communication network for providing an interactive real-time meeting. After sending a first calibration audio signal from the first UE to the second UE, a second calibration audio signal is retuned from the second UE to the first UE according to the first calibration audio signal. Upon identifying a far end audio (“FEA”) delay based on the first calibration audio signal and the second calibration audio signal, a first mixed audio data containing the first shared audio data and first FEA data is fetched from an audio buffer. The first FEA data is subsequently removed or extracted from the mixed audio data in response to the FEA delay.
-
-
-
-
-
-
-
-
-