Abstract:
An image processing apparatus is provided that comprises an input configured to receive an image and a Laplacian generator configured to generate, from the image, a Laplacian pyramid that represents the image as a series of frames that contain different frequency components of the image. The image processing apparatus also comprises a compressor configured to compress the Laplacian pyramid for writing to memory.
Abstract:
The system provides a method and apparatus for constructing, and for dynamically rearranging the order of content in a composite video. The re-ordering of clips in the composite video can be based on one or more weighting factors associated with each clip. These factors can include freshness or newness of the clip, popularity based on the number of "likes" of a clip by others, the content of the clip (e.g. celebrity creator or presence), paid boosting (e.g. for commercial concerns); and other factors. Each clip has associated metadata that can be used to assign a weight value to the clip for purposes of reordering the composite video.
Abstract:
For a video sequence including a composite image having a foreground image and a transparency mask, a video encoder determines whether the transparency mask is the same as a transparency mask of a preceding image. Where the transparency mask is not the same, the transparency mask is encoded as an image. The encoder then transmits the encoded foreground image, any encoded transparency mask and a flag signifying whether the encoded transparency mask for a preceding image is to be used in association with the encoded foreground image of the current image.
Abstract:
The present invention relates to a method for decoding an encoded video bitstream with multiple frames for displaying a video on a transparent display, the method comprising the following steps: receiving the encoded video bitstream, wherein the video bitstream comprises image data comprising multiple image values representing an image to be displayed and transparency data representing the intended transparency level of at least a part of the image to be displayed; for each frame of the video bitstream: decoding the received encoded image data to obtain a decoded image data; determining whether the current frame comprises transparency data; in case the current frame comprises transparency data: adjusting the decoded image data to obtain a transparency-adjusted image data depending on the received transparency data.
Abstract:
To represent a 3D scene, the MPI format uses a set of fronto-parallel planes. Different from MPI, the current MIV standard accepts a 3D scene represented as sequence input pairs of texture and depth pictures as input. To enable transmission of an MPI cube via the MIV-V3C standard, in one embodiment, an MPI cube is divided into empty regions and local MPI partitions that contain 3D objects. Each partition in the MPI cube can be projected to one or more patches. For a patch, the geometry is generated as well as the texture attribute and alpha attributes, and the alpha attributes may be represented as a peak and a width of an impulse. In another embodiment, an MPI RGBA layer of the MPI is cut into sub-images. Each sub-image may correspond to a patch, and the RGB and alpha information of the sub-image are assigned to the patch.
Abstract:
사용자 입력 없이 자동으로 영상 얼라인먼트를 수행한 방법이 제공된다. 영상 얼라인먼트 장치에 의해 수행되는 본 발명의 일 실시예에 따른 영상 얼라인먼트 방법은, 입력 영상에서 적어도 하나의 사람을 인식하는 단계, 상기 인식 된 사람 중 관심 사람을 결정하는 단계 및 상기 입력 영상에 대하여 상기 관심 사람을 기준으로 영상 얼라인먼트를 수행하되, 상기 영상 얼라인먼트를 위한 상기 영상 얼라인먼트 장치의 사용자의 입력 없이 상기 영상 얼라인먼트를 수행하는 단계를 포함할 수 있다.
Abstract:
A feature encoding section (5) extracts a feature of a video signal (102), encodes it, and produces a feature stream (103). A feature identification section (11) collates a feature (109) encoded from the feature stream (103) with a search key (108) entered by a user to find video contents (111) that the user needs.
Abstract:
Multiplexing means on the side of a picture encoding device multiplexes display speed information or information expressing the absolute time, and a picture decoding device carries out processing based on the multiplexed display speed information or information expressing the absolute time. Thus, picture decoding processing may be carried out smoothly and precisely.
Abstract:
Methods and devices for generated multiplane images from a representation of a 3D scene are disclosed. When an MPI is generated by the system, an image of the accumulation of the alpha values is computed. A convolutional networked is trained with a penalization function that penalizes MPIs with high values of accumulated alpha according to a parameter selected or trained to reduce the redundancy of information between layers of the MPI. When generating a MPI from a subset of the views of a new captured 3D scene, a first MPI is generated with the trained CNN with the associated parameter. An error is calculated between the other views and corresponding synthesized with the first MPI. This error is used to modify the parameter. Then, a second MPI is generated with the trained CNN with the new parameter.
Abstract:
The invention is directed to a method (300) of encoding (340) a mask (4) of a frame (2), wherein the mask (4) comprises a plurality of pixels with at least one component, wherein the value of each component of a pixel is set to a first value if the pixel is within an area of interest or set to a second value if a pixel is outside the area of interest, comprising: selecting (342) at least one mask window (14) overlapping with an area of interest; extracting (344) the at least one mask window (14) to form an encoded mask (4*).