Coherence groups: region descriptors for low bit rate encoding
    111.
    发明授权
    Coherence groups: region descriptors for low bit rate encoding 有权
    相干组:用于低比特率编码的区域描述符

    公开(公告)号:US08923640B1

    公开(公告)日:2014-12-30

    申请号:US13913169

    申请日:2013-06-07

    Applicant: Apple Inc.

    Abstract: The invention is directed to an efficient way for encoding and decoding video. Embodiments include identifying different coding units that share a similar characteristic. The characteristic can be, for example: quantization values, modes, block sizes, color space, motion vectors, depth, facial and non-facial regions, and filter values. An encoder may then group the units together as a coherence group. An encoder may similarly create a table or other data structure of the coding units. An encoder may then extract the commonly repeating characteristic or attribute from the coding units. The encoder may transmit the coherence groups along with the data structure, and other coding units which were not part of a coherence group. The decoder may receive the data, and utilize the shared characteristic by storing locally in cache, for faster repeated decoding, and decode the coherence group together.

    Abstract translation: 本发明涉及用于对视频进行编码和解码的有效方式。 实施例包括识别共享相似特征的不同编码单元。 特征可以是例如:量化值,模式,块大小,颜色空间,运动矢量,深度,面部和非面部区域以及滤波器值。 然后,编码器可以将这些单元组合在一起作为一致性组。 编码器可以类似地创建编码单元的表或其他数据结构。 然后,编码器可以从编码单元中提取常见的重复特性或属性。 编码器可以连同数据结构以及不是相干组的一部分的其他编码单元一起发送相干组。 解码器可以接收数据,并通过将本地存储在高速缓存中来利用共享特性,以便更快的重复解码,并将相干组一起解码。

    ERROR RESILIENT MANAGEMENT OF PICTURE ORDER COUNT IN PREDICTIVE CODING SYSTEMS
    112.
    发明申请
    ERROR RESILIENT MANAGEMENT OF PICTURE ORDER COUNT IN PREDICTIVE CODING SYSTEMS 有权
    预测编码系统中图像订单的错误恢复管理

    公开(公告)号:US20140086315A1

    公开(公告)日:2014-03-27

    申请号:US13626730

    申请日:2012-09-25

    Applicant: APPLE INC.

    Abstract: Coding techniques for input video may include assigning picture identifiers to input frames in either long-form or short-form formats. If a network error has occurred that results in loss of previously-coded video data, a new input frame may be assigned a picture identifier that is coded in a long-form coding format. If no network error has occurred, the input frame may be assigned a picture identifier that is coded in a short-form coding format. Long-form coding may mitigate against loss of synchronization between an encoder and a decoder by picture identifiers.

    Abstract translation: 用于输入视频的编码技术可以包括将图像标识符分配给长格式或短格式格式的输入帧。 如果发生导致先前编码的视频数据丢失的网络错误,则可以为新的输入帧分配以长格式编码格式编码的图像标识符。 如果没有发生网络错误,则可以为输入帧分配以短格式编码格式编码的图像标识符。 长格式编码可以通过图像标识符减轻编码器和解码器之间的同步丢失。

    TEMPORAL ALIASING REDUCTION AND CODING OF UPSAMPLED VIDEO
    114.
    发明申请
    TEMPORAL ALIASING REDUCTION AND CODING OF UPSAMPLED VIDEO 有权
    延时视频的时间缩减和编码

    公开(公告)号:US20130329064A1

    公开(公告)日:2013-12-12

    申请号:US13691163

    申请日:2012-11-30

    Applicant: APPLE INC.

    Abstract: Embodiments of the invention provide techniques for upsampling a video sequence for coding. According to the method, an estimate of camera motion may be obtained from motion sensor data. Video data may be analyzed to detect motion within frames output from a camera that is not induced by the camera motion. When non-camera motion falls within a predetermined operational limit, video upsampling processes may be engaged. In another embodiment, video upsampling may be performed by twice estimating image content for a hypothetical new a frame using two different sources as inputs. A determination may be made whether the two estimates of the frame match each other sufficiently well. If so, the two estimates may be merged to yield a final estimated frame and the new frame may be integrated into a stream of video data.

    Abstract translation: 本发明的实施例提供了用于对用于编码的视频序列进行上采样的技术。 根据该方法,可以从运动传感器数据获得相机运动的估计。 可以分析视频数据以检测来自摄像机输出的不是由摄像机运动引起的帧内的运动。 当非相机运动落在预定的操作限制内时,可以使用视频上采样处理。 在另一个实施例中,视频上采样可以通过使用两个不同的源作为输入来对假设的新帧的两次估计图像内容来执行。 可以确定帧的两个估计是否足够好地匹配。 如果是这样,则两个估计可以被合并以产生最终的估计帧,并且新的帧可以被集成到视频数据流中。

    Auto Exposure Techniques for Variable Lighting Conditions
    115.
    发明申请
    Auto Exposure Techniques for Variable Lighting Conditions 有权
    可变照明条件的自动曝光技术

    公开(公告)号:US20130070151A1

    公开(公告)日:2013-03-21

    申请号:US13677887

    申请日:2012-11-15

    Applicant: Apple Inc.

    CPC classification number: H04N5/235 H04N5/2351 H04N5/243 H04N2007/145

    Abstract: Systems, methods, and a computer readable medium for performing auto exposure (AE) techniques that are beneficial in variable lighting conditions—and particularly applicable to handheld and/or mobile videoconferencing applications—are disclosed herein. Handheld and/or mobile videoconferencing applications—unlike their fixed camera counterparts—are often exposed to a wide variety of rapidly changing lighting and scene conditions, and thus face a difficult trade-off between adjusting exposure parameter values too frequently or not frequently enough. In personal electronic devices executing such handheld and/or mobile videoconferencing applications, it may be desirable to: use a small, centered, and center-weighted exposure metering region; set a relatively low brightness target value; and adjust the camera's exposure parameter values according to a distance-dependent convergence speed function. The use of such techniques, in conjunction with a relatively large stability region, may also improve the quality of a video encoder's temporal predictions—and thus video quality—in videoconferencing applications.

    Abstract translation: 本文公开了用于执行可变照明条件有利的自动曝光(AE)技术的系统,方法和计算机可读介质,并且特别适用于手持和/或移动视频会议应用。 手持和/或移动视频会议应用程序(与固定摄像机对应的设备不同)通常会受到各种快速变化的照明和场景条件的影响,因此在过多调整曝光参数值或不够频繁地面临困难的折衷。 在执行这种手持和/或移动视频会议应用的个人电子设备中,可能需要:使用小的,中心的和中心加权的曝光计量区域; 设定相对较低的亮度目标值; 并根据距离相关的收敛速度功能调整相机的曝光参数值。 使用这样的技术结合相对较大的稳定性区域,也可以提高视频编码器在视频会议应用中的时间预测的质量,从而提高视频质量。

    Immersive video streaming using view-adaptive prefetching and buffer control

    公开(公告)号:US11570417B2

    公开(公告)日:2023-01-31

    申请号:US17326204

    申请日:2021-05-20

    Applicant: Apple Inc.

    Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.

    Instant Video Communication Connections

    公开(公告)号:US20220286644A1

    公开(公告)日:2022-09-08

    申请号:US17826116

    申请日:2022-05-26

    Applicant: Apple Inc.

    Abstract: Computing devices may implement instant video communication connections for video communications. Connection information for mobile computing devices may be maintained. A request to initiate an instant video communication may be received, and if authorized, the connection information for the particular recipient mobile computing device may be accessed. Video communication data may then be sent to the recipient mobile computing device according to the connection information so that the video communication data may be displayed at the recipient device as it is received. New connection information for different mobile computing devices may be added, or updates to existing connection information may also be performed. Connection information for some mobile computing devices may be removed.

    SPHERE PROJECTED MOTION ESTIMATION/COMPENSATION AND MODE DECISION

    公开(公告)号:US20210321133A1

    公开(公告)日:2021-10-14

    申请号:US17207267

    申请日:2021-03-19

    Applicant: Apple Inc.

    Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.

Patent Agency Ranking