ACCELEROMETER / GYRO-FACILITATED VIDEO STABILIZATION
    21.
    发明申请
    ACCELEROMETER / GYRO-FACILITATED VIDEO STABILIZATION 有权
    加速度计/ GYRO-FACIA-VIDEO STABILIZATION

    公开(公告)号:US20110234825A1

    公开(公告)日:2011-09-29

    申请号:US12755620

    申请日:2010-04-07

    IPC分类号: H04N5/228

    摘要: Embodiments of the present invention provide a control system for video processes that selectively control the operation of motion stabilization processes. According to the present invention, motion sensor data indicative of motion of a mobile device may be received and processed. A determination may be made by comparing processed motion sensor data to a threshold. Based on the determination, motion stabilization may be suspended on select portions of a captured video sequence.

    摘要翻译: 本发明的实施例提供了一种用于视频过程的控制系统,其选择性地控制运动稳定过程的操作。 根据本发明,可以接收并处理表示移动设备的运动的运动传感器数据。 可以通过将经处理的运动传感器数据与阈值进行比较来确定。 基于该确定,可以在拍摄的视频序列的选择部分上暂停运动稳定。

    H.264/AVC coder incorporating rate and quality controller
    22.
    发明授权
    H.264/AVC coder incorporating rate and quality controller 有权
    H.264 / AVC编码器并入速率和质量控制器

    公开(公告)号:US07986731B2

    公开(公告)日:2011-07-26

    申请号:US10811983

    申请日:2004-03-30

    IPC分类号: H04N7/12

    摘要: A rate control system is disclosed for video coding applications. The rate controller assigns a quantization parameter for video data in a picture in response to complexity indicators indicative of spatial complexity, motion complexity and/or bits per pel of the picture. A virtual buffer based quantizer parameter is proposed based on a virtual buffer fullness analysis and a target rate estimate, which is derived from the complexity indicators. A second quantizer parameter is proposed from a linear regression analysis of quantizer parameters used to code previously coded pictures of similar type (e.g., I pictures, P pictures or B pictures). A coding policy decision unit defines a final quantizer parameter from a comparison of the two proposed quantizer parameters.

    摘要翻译: 公开了一种用于视频编码应用的速率控制系统。 速率控制器响应于表示图像的空间复杂度,运动复杂度和/或每像素的复杂度指示符,为图像中的视频数据分配量化参数。 基于虚拟缓冲区丰满度分析和目标速率估计提出了一种基于虚拟缓冲器的量化器参数,该参数是从复杂性指标中得出的。 从用于对先前编码的类似类型的图像(例如,I图像,P图像或B图像)进行编码的量化器参数的线性回归分析提出了第二量化参数。 编码策略决定单元根据两个提出的量化器参数的比较来定义最终的量化器参数。

    Joint Bandwidth Detection Algorithm For Real-Time Communication
    23.
    发明申请
    Joint Bandwidth Detection Algorithm For Real-Time Communication 有权
    用于实时通信的联合带宽检测算法

    公开(公告)号:US20110149087A1

    公开(公告)日:2011-06-23

    申请号:US12646065

    申请日:2009-12-23

    IPC分类号: H04N7/26 H04N5/225

    摘要: A video coding system and method for increasing a transmitted output bit rate of a video encoding system by altering the content of the bit stream. A video encoder may receive a coding mode signal from a computer application for coding source video data, the coding mode signal indicating a target bit rate having a risk factor related to transmission error associated to the target bit rate. The coded bitstream may be modified based on the risk factor indicated in the coding mode signal. A modified coded bitstream may be outputted at the target bit rate and at a reduced coding efficiency, and the channel may be tested for transmission errors. Based on the test results, a revised coding mode signal indicating the same target bit rate, but a revised risk factor may be provided. The coded bitstream may be revised by removing the modifications previously made to the coded bitstream and a revised coded bitstream having greater coding efficiency may be output at the target bit rate.

    摘要翻译: 一种用于通过改变比特流的内容来增加视频编码系统的发送的输出比特率的视频编码系统和方法。 视频编码器可以接收来自用于编码源视频数据的计算机应用的编码模式信号,编码模式信号指示具有与目标比特率相关联的传输错误的风险因子的目标比特率。 可以基于编码模式信号中指示的危险因素来修改编码比特流。 可以以目标比特率和降低的编码效率输出经修改的编码比特流,并且可以测试该信道的传输错误。 基于测试结果,可以提供指示相同目标比特率的修订的编码模式信号,但是可以提供经修订的风险因子。 可以通过去除先前对编码比特流进行的修改来修改编码比特流,并且可以以目标比特率输出具有较高编码效率的修订编码比特流。

    Facial Pose Improvement with Perspective Distortion Correction
    24.
    发明申请
    Facial Pose Improvement with Perspective Distortion Correction 有权
    透视畸变修正的面部姿态改善

    公开(公告)号:US20110090303A1

    公开(公告)日:2011-04-21

    申请号:US12581043

    申请日:2009-10-16

    IPC分类号: H04N7/15 H04N5/217

    摘要: Methods, systems, and apparatus are presented for reducing distortion in an image, such as a video image. A video image can be captured by an image capture device, e.g. during a video conferencing session. Distortion correction processing, such as the application of one or more warping techniques, can be applied to the captured image to produce a distortion corrected image, which can be transmitted to one or more participants. The warping techniques can be performed in accordance with one or more warp parameters specifying a transformation of the captured image. Further, the warp parameters can be generated in accordance with an orientation of the image capture device, which can be determined based on sensor data or can be a fixed value. Additionally or alternatively, the warp parameters can be determined in accordance with a reference image or model to which the captured image should be warped.

    摘要翻译: 呈现了用于减少诸如视频图像的图像中的失真的方法,系统和装置。 视频图像可以由图像捕获设备捕获,例如, 在视频会议期间。 畸变校正处理,例如应用一个或多个翘曲技术,可以应用于所捕获的图像,以产生可以发送到一个或多个参与者的失真校正图像。 翘曲技术可以根据指定捕获图像的变换的一个或多个翘曲参数来执行。 此外,可以根据可以基于传感器数据确定的图像捕获装置的取向来生成翘曲参数,或者可以是固定值。 附加地或替代地,可以根据捕获的图像应该翘曲的参考图像或模型来确定翘曲参数。

    IMAGE ACQUISITION AND ENCODING SYSTEM
    25.
    发明申请
    IMAGE ACQUISITION AND ENCODING SYSTEM 审中-公开
    图像采集和编码系统

    公开(公告)号:US20100309987A1

    公开(公告)日:2010-12-09

    申请号:US12533927

    申请日:2009-07-31

    IPC分类号: H04N7/26 H04N7/24

    摘要: A method and system are provided to encode a video sequence into a compressed bitstream. An encoder receives a video sequence from an image-capture device, together with metadata associated with the video sequence, and codes the video sequence into a first compressed bitstream using the metadata to select or revise a coding parameter associated with a coding operation. Optionally, the video sequence may be conditioned for coding by a preprocessor, which also may use the metadata to select or revise a preprocessing parameter associated with a preprocessing operation. The encoder may itself generate metadata associated with the first compressed bitstream, which may be used together with any metadata received by the encoder, to transcode the first compressed bitstream into a second compressed bitstream. The compressed bitstreams may be decoded by a decoder to generate recovered video data, and the recovered video data may be conditioned for viewing by a postprocessor, which may use the metadata to select or revise a postprocessing parameter associated with a postprocessing operation.

    摘要翻译: 提供了将视频序列编码为压缩比特流的方法和系统。 编码器从图像捕获装置接收视频序列以及与视频序列相关联的元数据,并且使用元数据将视频序列编码为第一压缩比特流,以选择或修改与编码操作相关联的编码参数。 可选地,视频序列可以由预处理器进行调节,预处理器也可以使用元数据来选择或修改与预处理操作相关联的预处理参数。 编码器本身可以产生与可与编码器接收的任何元数据一起使用的第一压缩比特流相关联的元数据,以将第一压缩比特流转码为第二压缩比特流。 压缩比特流可以由解码器解码以产生恢复的视频数据,并且恢复的视频数据可以被调整为由后处理器进行查看,后处理器可以使用元数据来选择或修改与后处理操作相关联的后处理参数。

    DYNAMIC DITHERING FOR VIDEO COMPRESSION
    27.
    发明申请
    DYNAMIC DITHERING FOR VIDEO COMPRESSION 有权
    动态压缩视频压缩

    公开(公告)号:US20100246689A1

    公开(公告)日:2010-09-30

    申请号:US12463859

    申请日:2009-05-11

    IPC分类号: H04N7/26

    摘要: A method and device for dithering video from a decoder. The method and device provide: selecting a segment of video frames from the video; computing a noise map for the segment of the video, the noise map computed from differences among pixels selected from spatially-distributed sampling patterns in the segment; determining contours of quantization artifacts within the video frame based on the noise map and a histogram of pixel values in the video frame; computing a gradient measurement of pixel values along the contours; identifying regions along the contours of quantization artifacts to apply dithering based on the noise map and the gradient measurement; determining a dithering strength based on the noise map and the gradient measurement; and applying dithering noise to the identified regions at the determined dithering strength.

    摘要翻译: 一种从解码器抖动视频的方法和装置。 该方法和设备提供:从视频中选择一段视频帧; 计算视频段的噪声映射,根据从片段中的空间分布采样模式中选择的像素之间的差异计算的噪声图; 基于噪声图和视频帧中的像素值的直方图确定视频帧内的量化伪像的轮廓; 计算沿着轮廓的像素值的梯度测量; 识别沿着量化伪影轮廓的区域,以基于噪声图和梯度测量应用抖动; 基于噪声图和梯度测量确定抖动强度; 并以确定的抖动强度对所识别的区域施加抖动噪声。

    Method for implementing a quantizer in a multimedia compression and encoding system
    28.
    发明授权
    Method for implementing a quantizer in a multimedia compression and encoding system 有权
    在多媒体压缩和编码系统中实现量化器的方法

    公开(公告)号:US07769084B1

    公开(公告)日:2010-08-03

    申请号:US10427843

    申请日:2003-04-30

    IPC分类号: H04N7/18

    摘要: Method For Implementing A Quantizer In A Multimedia Compression And Encoding System is disclosed. In the Quantizer system of the present invention, several new quantization ideas are disclosed. In one embodiment, adjacent macroblocks are grouped together into macroblock groups. The macroblock groups are then assigned a common quantizer value. The common quantizer value may be selected based upon how the macroblocks are encoded, the type of macroblocks within the macroblock group (intra-blocks or inter-blocks), the history of the motion vectors associated with the macroblocks in the macroblock group, the residuals of the macroblocks in the macroblock group, and the energy of the macroblocks in the macroblock group. The quantizer value may be adjusted in a manner that is dependent on the current quantizer value. Specifically, if the quantizer value is at the low end of the quantizer scale, then only small adjustments are made. If the quantizer value is at the high end then larger adjustments may be made to the quantizer. Finally, in one embodiment, the quantizer is implemented along with an inverse quantizer for efficient operation.

    摘要翻译: 公开了一种在多媒体压缩和编码系统中实现量化器的方法。 在本发明的量化器系统中,公开了几种新的量化思想。 在一个实施例中,相邻宏块被分组在一起成为宏块组。 然后向宏块组分配一个公共量化器值。 可以基于宏块如何编码,宏块组(块内或块内)中的宏块的类型,与宏块组中的宏块相关联的运动向量的历史来选择公共量化器值,残差 的宏块组中的宏块的能量,以及宏块组中的宏块的能量。 量化器值可以以取决于当前量化器值的方式进行调整。 具体地,如果量化器值处于量化器标尺的低端,则仅进行小的调整。 如果量化器值处于高端,则可以对量化器进行较大的调整。 最后,在一个实施例中,量化器与用于有效操作的逆量化器一起被实现。

    VIDEO ENCODING IN A VIDEO CONFERENCE
    29.
    发明申请
    VIDEO ENCODING IN A VIDEO CONFERENCE 有权
    在视频会议中进行视频编码

    公开(公告)号:US20100189178A1

    公开(公告)日:2010-07-29

    申请号:US12702287

    申请日:2010-02-08

    IPC分类号: H04N7/12 H04N7/14

    CPC分类号: H04N7/15

    摘要: Some embodiments provide an architecture for establishing multi-participant video conferences. This architecture has a central distributor that receives video images from two or more participants. From the received images, the central distributor generates composite images that the central distributor transmits back to the participants. Each composite image includes a set of sub images, where each sub image belongs to one participant. In some embodiments, the central distributor saves network bandwidth by removing each particular participant's image from the composite image that the central distributor sends to the particular participant. In some embodiments, images received from each participant are arranged in the composite in a non-interleaved manner. For instance, in some embodiments, the composite image includes at most one sub-image for each participant, and no two sub-images are interleaved.

    摘要翻译: 一些实施例提供了用于建立多参与者视频会议的架构。 这种架构有一个中央分配器,可以接收来自两个或更多参与者的视频图像。 从接收到的图像中,中心分配器生成中央分配器传送回参与者的合成图像。 每个合成图像包括一组子图像,其中每个子图像属于一个参与者。 在一些实施例中,中央分配器通过从中央分配器向特定参与者发送的合成图像中移除每个特定参与者的图像来节省网络带宽。 在一些实施例中,从每个参与者接收的图像以非交错方式排列在复合体中。 例如,在一些实施例中,合成图像对于每个参与者至多包括一个子图像,并且没有两个子图像被交织。

    VIDEO CODING SYSTEM USING SUB-CHANNELS AND CONSTRAINED PREDICTION REFERENCES TO PROTECT AGAINST DATA TRANSMISSION ERRORS
    30.
    发明申请
    VIDEO CODING SYSTEM USING SUB-CHANNELS AND CONSTRAINED PREDICTION REFERENCES TO PROTECT AGAINST DATA TRANSMISSION ERRORS 审中-公开
    使用子频道的视频编码系统和受限的预测参考以防止数据传输错误

    公开(公告)号:US20100150230A1

    公开(公告)日:2010-06-17

    申请号:US12337273

    申请日:2008-12-17

    IPC分类号: H04N7/32

    摘要: A coding technique is disclosed in which frames of a video sequence are assigned to one of a plurality of sub-channels to be transmitted to a decoder. The frames are coded according to predictive coding techniques such that ordinarily prediction references of the frames in each sub-channel only reach the reference frames that occur within the same sub-channel. Thus, if transmission errors arise with respect to one sub-channel, decoding may occur for other sub-channels until the transmission error is detected and corrected. The decoder may also try to reconstruct the frames in the failed sub-channel by interpolating from the frames in other channels. Furthermore, when feedback scheme is available between the encoder and decoder, the encoder may restart the failed sub-channel by coding the next frames in the sub-channel by predicting from correctly received frames in other sub-channels. And the encoder and decoder may resume normal encoding and decoding once the restart frame is sent and received, respectively. Additionally, the encoder and decoder can maintain an identical and correctly received long-term reference frame that can be used to restart all sub-channels in cases all sub-channels are corrupted at one point. The long-term reference frame can be refreshed periodically.

    摘要翻译: 公开了一种编码技术,其中将视频序列的帧分配给多个子信道之一以发送到解码器。 帧根据预测编码技术进行编码,使得每个子信道中的帧的通常预测参考仅到达发生在相同子信道内的参考帧。 因此,如果相对于一个子信道出现传输错误,则可能对其他子信道进行解码,直到检测和校正传输错误。 解码器还可以尝试通过从其他信道中的帧进行内插来重构失败子信道中的帧。 此外,当编码器和解码器之间有反馈方案可用时,编码器可以通过从其他子信道中正确接收到的帧预测来对子信道中的下一帧进行编码来重新启动失败的子信道。 一旦发送和接收重启帧,编码器和解码器就可以恢复正常的编码和解码。 此外,编码器和解码器可以保持相同且正确接收的长期参考帧,可以在所有子信道在一个点被破坏的情况下重新启动所有子信道。 长期参考系可以定期刷新。