-
公开(公告)号:US20110249729A1
公开(公告)日:2011-10-13
申请号:US12794580
申请日:2010-06-04
IPC分类号: H04N7/26
CPC分类号: H04N19/105 , H04N19/114 , H04N19/166 , H04N19/58 , H04N19/89
摘要: Embodiments of the present invention provide a video encoding system that codes video sequence into a multi-level hierarchy based on levels of long term reference (LTR) frames. According to the present invention, an encoder designates a reference frame as a long term reference (LTR) frame and transmits the LTR frame to a receiver. Upon receiving feedback from the receiver acknowledging receipt of the LTR frame, the encoder periodically codes subsequent frames as reference frames using the acknowledged LTR frame as a reference and designates subsequent reference frames as secondary LTR frames. A determined number of frames after each secondary LTR frame may be coded using a preceding secondary LTR frame as a reference.
摘要翻译: 本发明的实施例提供了一种视频编码系统,其基于长期参考(LTR)帧的级别将视频序列编码为多级层级。 根据本发明,编码器将参考帧指定为长期参考(LTR)帧,并将LTR帧发送到接收机。 在接收到来自确认接收到LTR帧的接收机的反馈时,编码器使用确认的LTR帧作为参考,将后续帧定时作为参考帧,并将后续参考帧指定为辅助LTR帧。 每个辅助LTR帧之后的确定数量的帧可以使用先前的次级LTR帧作为参考进行编码。
-
公开(公告)号:US10205953B2
公开(公告)日:2019-02-12
申请号:US13359377
申请日:2012-01-26
申请人: Douglas Scott Price , Hsi-Jung Wu , Xiaosong Zhou , Dazhong Zhang
发明人: Douglas Scott Price , Hsi-Jung Wu , Xiaosong Zhou , Dazhong Zhang
IPC分类号: H04N19/20 , H04N19/176 , H04N19/115 , H04N19/124 , H04N19/137 , H04N19/17 , G06K9/34 , G06K9/00 , G06K9/46 , G06K9/62 , G06T7/10 , G06T7/11 , G06T7/12 , G06T7/13 , G06T7/136 , G06T7/181 , G06T7/143 , G06T7/187 , G06T7/194
摘要: Embodiments of the present invention provide techniques for coding video data efficiently based on detection of objects within video sequences. A video coder may perform object detection on the frame and when an object is detected, develop statistics of an area of the frame in which the object is located. The video coder may compare pixels adjacent to the object location to the object's statistics and may define an object region to include pixel blocks corresponding to the object's location and pixel blocks corresponding to adjacent pixels having similar statistics as the detected object. The coder may code the video frame according to a block-based compression algorithm wherein pixel blocks of the object region are coded according to coding parameters generating relatively high quality coding and pixel blocks outside the object region are coded according to coding parameters generating relatively lower quality coding.
-
公开(公告)号:US20130195178A1
公开(公告)日:2013-08-01
申请号:US13359377
申请日:2012-01-26
申请人: Douglas Scott Price , Hsi-Jung Wu , Xiaosong Zhou , Dazhong Zhang
发明人: Douglas Scott Price , Hsi-Jung Wu , Xiaosong Zhou , Dazhong Zhang
IPC分类号: H04N7/26
CPC分类号: H04N19/20 , G06K9/00221 , G06K9/00234 , G06K9/00248 , G06K9/0061 , G06K9/342 , G06K9/4638 , G06K9/6202 , G06T7/10 , G06T7/11 , G06T7/12 , G06T7/13 , G06T7/136 , G06T7/143 , G06T7/181 , G06T7/187 , G06T7/194 , G06T2207/20012 , G06T2207/20016 , G06T2207/20116 , G06T2207/20121 , G06T2207/20124 , G06T2207/20128 , G06T2207/20132 , G06T2207/20152 , G06T2207/20156 , G06T2207/20161 , G06T2207/20164 , G06T2207/20168 , G06T2207/30201 , H04N19/115 , H04N19/124 , H04N19/137 , H04N19/17 , H04N19/176
摘要: Embodiments of the present invention provide techniques for coding video data efficiently based on detection of objects within video sequences. A video coder may perform object detection on the frame and when an object is detected, develop statistics of an area of the frame in which the object is located. The video coder may compare pixels adjacent to the object location to the object's statistics and may define an object region to include pixel blocks corresponding to the object's location and pixel blocks corresponding to adjacent pixels having similar statistics as the detected object. The coder may code the video frame according to a block-based compression algorithm wherein pixel blocks of the object region are coded according to coding parameters generating relatively high quality coding and pixel blocks outside the object region are coded according to coding parameters generating relatively lower quality coding.
摘要翻译: 本发明的实施例提供了基于视频序列内的对象的检测来有效地对视频数据进行编码的技术。 视频编码器可以在帧上执行对象检测,并且当检测到对象时,开发对象所在的帧的区域的统计。 视频编码器可以将与物体位置相邻的像素与对象的统计信息进行比较,并且可以定义对象区域以包括对应于对象的位置的像素块和对应于具有与检测对象相似的统计信息的相邻像素的像素块。 编码器可以根据基于块的压缩算法对视频帧进行编码,其中根据产生相对较高质量编码的编码参数来编码对象区域的像素块,并且根据生成相对较低质量的编码参数对目标区域外的像素块进行编码 编码。
-
4.
公开(公告)号:US20130329799A1
公开(公告)日:2013-12-12
申请号:US13755928
申请日:2013-01-31
申请人: Yao-Chung Lin , Xiaosong Zhou , Hsi-Jung Wu , Douglas Scott Price , Chris Y. Chung , Dazhong Zhang
发明人: Yao-Chung Lin , Xiaosong Zhou , Hsi-Jung Wu , Douglas Scott Price , Chris Y. Chung , Dazhong Zhang
IPC分类号: H04N7/26
CPC分类号: H04N19/51 , H04N19/503
摘要: Video coders may perform perspective transformation of reference frames during coding in a manner that conserves processing resources. When a new input frame is available for coding, a camera position for the input frame may be estimated. A video coder may search for reference pictures having similar camera positions as the position of the input frame and, for each reference picture identified, the video coder may perform a prediction search to identify a reference picture that is the best prediction match for the input frame. Once the video coder identifies a reference picture to serve as a prediction source for the input frame, the video coder may derive a transform to match the reference frame data to the input frame data and may transform the reference picture accordingly. The video coder may code the input frame using the transformed reference picture as a prediction reference and may transmit coded frame data and the camera position of the input frame to a decoder. Thus, the video coder may perform derivation and execution of transforms on a limited basis which conserves system resources.
摘要翻译: 视频编码器可以在编码期间以保存处理资源的方式执行参考帧的透视变换。 当新的输入帧可用于编码时,可以估计用于输入帧的摄像机位置。 视频编码器可以搜索具有与输入帧的位置相似的相机位置的参考图像,并且对于识别的每个参考图像,视频编码器可以执行预测搜索以识别作为输入帧的最佳预测匹配的参考图像 。 一旦视频编码器识别用作输入帧的预测源的参考图像,则视频编码器可以导出将参考帧数据与输入帧数据相匹配的变换,并且可以相应地变换参考图像。 视频编码器可以使用变换的参考图片作为预测参考来对输入帧进行编码,并且可以将编码的帧数据和输入帧的摄像机位置发送到解码器。 因此,视频编码器可以在有限的基础上进行变换的推导和执行,从而节省系统资源。
-
公开(公告)号:US08842723B2
公开(公告)日:2014-09-23
申请号:US12986703
申请日:2011-01-07
申请人: Ke Zhang , Dazhong Zhang , Douglas Scott Price , Hsi-Jung Wu , Xiaosong Zhou
发明人: Ke Zhang , Dazhong Zhang , Douglas Scott Price , Hsi-Jung Wu , Xiaosong Zhou
摘要: A video coding/decoding system builds implied reference frames from a plurality of reference frames developed during coding. Coded data of reference pictures are decoded and stored in a reference picture cache. An implied reference frame may be derived from a plurality of reference frames and may be stored in the reference picture cache. Thereafter, coding of new input data may proceed using the implied reference frame as a source of prediction. The method may be used to identify visual elements such as background elements that may persist in video over a longer period that can be captured by the system under conventional reference frames assembly and eviction. Implied reference frames may be built at both an encoder and a decoder to serve as sources of prediction.
摘要翻译: 视频编码/解码系统从编码期间开发的多个参考帧构建隐含的参考帧。 参考图像的编码数据被解码并存储在参考图像缓存中。 可以从多个参考帧导出隐含的参考帧,并且可以将其存储在参考图像高速缓存中。 此后,可以使用隐含参考帧作为预测源来进行新输入数据的编码。 该方法可以用于识别视觉元素,例如可以在可在常规参考帧组装和驱逐下由系统捕获的较长时间段内持续存在视频的背景元素。 可以在编码器和解码器两者构建隐含的参考帧以用作预测的源。
-
公开(公告)号:US08493499B2
公开(公告)日:2013-07-23
申请号:US12794475
申请日:2010-06-04
申请人: Xiaosong Zhou , Douglas Scott Price , Hsi-Jung Wu , Dazhong Zhang
发明人: Xiaosong Zhou , Douglas Scott Price , Hsi-Jung Wu , Dazhong Zhang
CPC分类号: H04N5/232 , H04N19/102 , H04N19/117 , H04N19/134 , H04N19/146 , H04N19/154 , H04N19/85
摘要: Embodiments of the present invention provide a video encoding system in which a video coding engine establishes coding quality metrics that govern its own operation as well as the operation of a camera and/or a pre-processor. An imaging system may include an image acquisition system, a pre-processor and a coding engine. The coding engine may output a quality indicator identifying, for each portion of a video sequence currently being coded, a relatively level of coding quality that is being achieved. The imaging system further may include an image acquisition controller and a pre-processor controller that impose respective operating parameters upon the image acquisition system and the pre-processor in response to these quality indicators. In this manner, overall performance of the imaging system may be improved.
摘要翻译: 本发明的实施例提供了一种视频编码系统,其中视频编码引擎建立控制其自身操作以及照相机和/或预处理器的操作的编码质量度量。 成像系统可以包括图像采集系统,预处理器和编码引擎。 编码引擎可以输出质量指示符,对于正在编码的视频序列的每个部分,识别正在实现的相对级别的编码质量。 成像系统还可以包括图像采集控制器和预处理器控制器,其响应于这些质量指示器,在图像采集系统和预处理器上施加相应的操作参数。 以这种方式,可以提高成像系统的整体性能。
-
公开(公告)号:US20120170654A1
公开(公告)日:2012-07-05
申请号:US12986703
申请日:2011-01-07
申请人: Ke Zhang , Dazhong Zhang , Douglas Scott Price , Hsi-Jung Wu , Xiaosong Zhou
发明人: Ke Zhang , Dazhong Zhang , Douglas Scott Price , Hsi-Jung Wu , Xiaosong Zhou
摘要: A video coding/decoding system builds implied reference frames from a plurality of reference frames developed during coding. Coded data of reference pictures are decoded and stored in a reference picture cache. An implied reference frame may be derived from a plurality of reference frames and may be stored in the reference picture cache. Thereafter, coding of new input data may proceed using the implied reference frame as a source of prediction. The method may be used to identify visual elements such as background elements that may persist in video over a longer period that can be captured by the system under conventional reference frames assembly and eviction. Implied reference frames may be built at both an encoder and a decoder to serve as sources of prediction.
摘要翻译: 视频编码/解码系统从编码期间开发的多个参考帧构建隐含的参考帧。 参考图像的编码数据被解码并存储在参考图像缓存中。 可以从多个参考帧导出隐含的参考帧,并且可以将其存储在参考图像高速缓存中。 此后,可以使用隐含参考帧作为预测源来进行新输入数据的编码。 该方法可以用于识别视觉元素,例如可以在可在常规参考帧组装和驱逐下由系统捕获的较长时间段内持续存在视频的背景元素。 可以在编码器和解码器两者构建隐含的参考帧以用作预测的源。
-
公开(公告)号:US09402034B2
公开(公告)日:2016-07-26
申请号:US13558309
申请日:2012-07-25
申请人: Douglas Scott Price , Xiaosong Zhou , Hsi-Jung Wu
发明人: Douglas Scott Price , Xiaosong Zhou , Hsi-Jung Wu
IPC分类号: H04N5/235 , H04N19/172 , H04N19/169 , H04N19/61 , H04N19/87
CPC分类号: H04N5/2351 , H04N5/2353 , H04N19/169 , H04N19/172 , H04N19/61 , H04N19/87
摘要: Techniques for adjusting exposure parameters of a camera such that video data captured by the camera may be coded efficiently. A camera with auto exposure control may capture and output frames of video. A pre-processor may estimate brightness of the frames of the video output from the camera. A controller may estimate a rate of brightness change among the frames, and when the rate of change is lower than a predetermined threshold, the controller may reduce sensitivity of the auto exposure control. A coding engine may predictively code the video.
摘要翻译: 用于调整照相机的曝光参数的技术,使得由相机拍摄的视频数据可以被有效地编码。 具有自动曝光控制的照相机可以捕获和输出视频帧。 预处理器可以估计从相机输出的视频的帧的亮度。 控制器可以估计帧之间的亮度变化率,并且当变化率低于预定阈值时,控制器可以降低自动曝光控制的灵敏度。 编码引擎可以预测性地对视频进行编码。
-
公开(公告)号:US20130051467A1
公开(公告)日:2013-02-28
申请号:US13591637
申请日:2012-08-22
申请人: Xiaosong Zhou , Douglas Scott Price , Hsi-Jung Wu
发明人: Xiaosong Zhou , Douglas Scott Price , Hsi-Jung Wu
IPC分类号: H04N7/34
CPC分类号: H04N19/182 , H04N19/105 , H04N19/107 , H04N19/14
摘要: Embodiments of the present invention provide techniques for efficiently coding/decoding video data during circumstances where no single coding mode is appropriate. A coder may predict content of an input pixel block according to a prediction technique for intra-coding and obtain a first predicted pixel block therefrom. The coder may predict content of the input pixel block according to a prediction technique for inter-coding and obtain a second predicted pixel block therefrom. The coder may average the first and second predicted pixel blocks by weighted averaging. The weight of the first predicted pixel block may be inversely proportional to the weight of the second predicted pixel block coding. The coder may predictively code the input pixel block based on a third predicted pixel block obtained by the averaging.
摘要翻译: 本发明的实施例提供了在没有单个编码模式是适当的情况下有效地对视频数据进行编码/解码的技术。 编码器可以根据用于帧内编码的预测技术来预测输入像素块的内容,并从其获得第一预测像素块。 编码器可以根据用于帧间编码的预测技术来预测输入像素块的内容,并从其获得第二预测像素块。 编码器可以通过加权平均来平均第一和第二预测像素块。 第一预测像素块的权重可以与第二预测像素块编码的权重成反比。 编码器可以基于通过平均获得的第三预测像素块来预测性地对输入像素块进行编码。
-
公开(公告)号:US20130027581A1
公开(公告)日:2013-01-31
申请号:US13558309
申请日:2012-07-25
申请人: Douglas Scott Price , Xiaosong Zhou , Hsi-Jung Wu
发明人: Douglas Scott Price , Xiaosong Zhou , Hsi-Jung Wu
CPC分类号: H04N5/2351 , H04N5/2353 , H04N19/169 , H04N19/172 , H04N19/61 , H04N19/87
摘要: Techniques for adjusting exposure parameters of a camera such that video data captured by the camera may be coded efficiently. A camera with auto exposure control may capture and output frames of video. A pre-processor may estimate brightness of the frames of the video output from the camera. A controller may estimate a rate of brightness change among the frames, and when the rate of change is lower than a predetermined threshold, the controller may reduce sensitivity of the auto exposure control. A coding engine may predictively code the video.
摘要翻译: 用于调整照相机的曝光参数的技术,使得由相机拍摄的视频数据可以被有效地编码。 具有自动曝光控制的照相机可以捕获和输出视频帧。 预处理器可以估计从相机输出的视频的帧的亮度。 控制器可以估计帧之间的亮度变化率,并且当变化率低于预定阈值时,控制器可以降低自动曝光控制的灵敏度。 编码引擎可以预测性地对视频进行编码。
-
-
-
-
-
-
-
-
-