Distance-based framing for an online conference session

    公开(公告)号:US11563783B2

    公开(公告)日:2023-01-24

    申请号:US16993955

    申请日:2020-08-14

    Abstract: Distance-based framing includes obtaining at least a video stream during an online conference session. The video stream, an audio stream received with the video stream, or both the video stream and the audio stream are analyzed and a framing that either focuses on a speaker in the video stream or provides an overview of participants in the video stream, the framing being is composed based on the analyzing. A potential error in the framing is detected based on further analysis of at least one of the video stream, the audio stream, or distance sensor data received with the video stream. The potential error may be contradicted or confirmed based on an amount of motion when the framing focuses on the speaker. If the distance sensor data contradicts the potential error, the framing is maintained, but if the distance sensor data confirms the potential error, a new framing is generated.

    Multiple simultaneous framing alternatives using speaker tracking

    公开(公告)号:US11418758B2

    公开(公告)日:2022-08-16

    申请号:US17112091

    申请日:2020-12-04

    Abstract: In one embodiment, a video conference endpoint may detect a one or more participants within a field of view of a camera of the video conference endpoint. The video conference endpoint may determine one or more alternative framings of an output of the camera of the video conference endpoint based on the detected one or more participants. The video conference endpoint may send the output of the camera of the video conference endpoint to one or more far-end video conference endpoints participating in a video conference with the video conference endpoint. The video conference endpoint may send data descriptive of the one or more alternative framings of the output of the camera to the far-end video conference endpoints. The far-end video conference endpoints may utilize the data to display one of the one or more alternative framings.

    MULTIPLE SIMULTANEOUS FRAMING ALTERNATIVES USING SPEAKER TRACKING

    公开(公告)号:US20210144337A1

    公开(公告)日:2021-05-13

    申请号:US17112091

    申请日:2020-12-04

    Abstract: In one embodiment, a video conference endpoint may detect a one or more participants within a field of view of a camera of the video conference endpoint. The video conference endpoint may determine one or more alternative framings of an output of the camera of the video conference endpoint based on the detected one or more participants. The video conference endpoint may send the output of the camera of the video conference endpoint to one or more far-end video conference endpoints participating in a video conference with the video conference endpoint. The video conference endpoint may send data descriptive of the one or more alternative framings of the output of the camera to the far-end video conference endpoints. The far-end video conference endpoints may utilize the data to display one of the one or more alternative framings.

    Multiple simultaneous framing alternatives using speaker tracking

    公开(公告)号:US10917612B2

    公开(公告)日:2021-02-09

    申请号:US16665386

    申请日:2019-10-28

    Abstract: In one embodiment, a video conference endpoint may detect a one or more participants within a field of view of a camera of the video conference endpoint. The video conference endpoint may determine one or more alternative framings of an output of the camera of the video conference endpoint based on the detected one or more participants. The video conference endpoint may send the output of the camera of the video conference endpoint to one or more far-end video conference endpoints participating in a video conference with the video conference endpoint. The video conference endpoint may send data descriptive of the one or more alternative framings of the output of the camera to the far-end video conference endpoints. The far-end video conference endpoints may utilize the data to display one of the one or more alternative framings.

    Defining content of interest for video conference endpoints with multiple pieces of content

    公开(公告)号:US10397519B1

    公开(公告)日:2019-08-27

    申请号:US16005971

    申请日:2018-06-12

    Abstract: A video conference system may include two or more video conference endpoints, each having a display configured to display content. The video conference system may detect a plurality of participants within a field of view of a camera of the system. The video conference system may determine an attention score for each endpoint based on the participants. The video conference system may determine whether the content of the first endpoint and/or the content of the second endpoint are active content based on whether the attention scores exceed a predetermined threshold value. The video conference system may send to secondary video conference systems an indication of the active content to enable the secondary video conference systems to display the active content.

    AUTOMATIC SWITCHING BETWEEN DYNAMIC AND PRESET CAMERA VIEWS IN A VIDEO CONFERENCE ENDPOINT

    公开(公告)号:US20170099462A1

    公开(公告)日:2017-04-06

    申请号:US15383231

    申请日:2016-12-19

    CPC classification number: H04N7/152 H04N5/23219 H04N7/142 H04N7/147

    Abstract: A video conference endpoint includes a camera to capture video and a microphone array to sense audio. One or more preset views are defined. Images in the captured video are processed with a face detection algorithm to detect faces. Active talkers are detected from the sensed audio. The camera is controlled to capture video from the preset views, and from dynamic views created without user input and which include a dynamic overview and a dynamic close-up view. The camera is controlled to dynamically adjust each of the dynamic views to track changing positions of detected faces over time, and dynamically switch the camera between the preset views, the dynamic overview, and the dynamic close-up view over time based on positions of the detected faces and the detected active talkers relative to the preset views and the dynamic views.

    USE OF FACE AND MOTION DETECTION FOR BEST VIEW FRAMING IN VIDEO CONFERENCE ENDPOINT
    49.
    发明申请
    USE OF FACE AND MOTION DETECTION FOR BEST VIEW FRAMING IN VIDEO CONFERENCE ENDPOINT 审中-公开
    在视频会议终点中使用最佳视图框架进行脸部和运动检测

    公开(公告)号:US20160227163A1

    公开(公告)日:2016-08-04

    申请号:US15059386

    申请日:2016-03-03

    CPC classification number: H04N7/147 G06K9/00255 H04N5/23219 H04N7/15

    Abstract: A video conference endpoint detects faces at associated face positions in video frames capturing a scene. The endpoint frames the video frames to a view of the scene encompassing all of the detected faces. The endpoint detects that a previously detected face is no longer detected. In response, a timeout period is started and independently of detecting faces, motion is detected across the view. It is determined if any detected motion (i) coincides with the face position of the previously detected face that is no longer detected, and (ii) occurs before the timeout period expires. If conditions (i) and (ii) are not both met, the endpoint reframes the view.

    Abstract translation: 视频会议终端检测拍摄场景的视频帧中相关联的脸部位置处的脸部。 端点将视频帧框架到包含所有检测到的面部的场景视图。 端点检测到先前检测到的脸部不再被检测到。 作为响应,开始超时时段并且独立于检测面,在整个视图中检测到运动。 确定任何检测到的运动(i)是否与不再检测到的先前检测到的面部的面部位置一致,并且(ii)在超时时段到期之前发生。 如果条件(i)和(ii)都不满足,则端点重新构造视图。

    Ultrasonic echo canceler-based technique to detect participant presence at a video conference endpoint
    50.
    发明授权
    Ultrasonic echo canceler-based technique to detect participant presence at a video conference endpoint 有权
    基于超声波回波消除器的技术来检测视频会议端点的参与者存在

    公开(公告)号:US09319633B1

    公开(公告)日:2016-04-19

    申请号:US14662691

    申请日:2015-03-19

    Abstract: A loudspeaker transmits an ultrasonic signal into a spatial region. A microphone transduces ultrasonic sound, including an echo of the transmitted ultrasonic signal, received from the spatial region into a received ultrasonic signal. A controller transforms the ultrasonic signal and the received ultrasonic signal into respective time-frequency domains that cover respective ultrasound frequency ranges. The controller computes an error signal, representative of an estimate of an echo-free received ultrasonic signal, based on the transformed ultrasonic signal and the transformed received ultrasonic signal. The controller computes power estimates of the error signal over time, and detects a change in people presence in the spatial region based on a change in the power estimates of the error signal over time.

    Abstract translation: 扬声器将超声波信号发送到空间区域。 麦克风将从空间区域接收的包括发送的超声信号的回波的超声波转换为接收的超声波信号。 控制器将超声波信号和接收到的超声波信号变换成覆盖各个超声频率范围的各个时域。 控制器基于变换的超声波信号和经变换的接收到的超声信号,计算代表无回波接收的超声波信号的估计的误差信号。 控制器随时间计算误差信号的功率估计,并且基于随时间的误差信号的功率估计值的变化来检测人员在空间区域中的存在的变化。

Patent Agency Ranking