摘要:
Provided is a scalable video coding method and apparatus. Motion data of a high-quality fine grain scalability (FGS) layer is used for interlayer coding in order to remove redundancy between coarse grain scalability (CGS) layers or layers having different spatial resolutions, and information indicating that data of the FGS layer has been used for interlayer motion prediction is inserted for Moving Picture Expert Group (MPEG)-4 scalable video encoding. A bitstream extractor checks the information and performs extraction to maintain the data of the FGS layer. MPEG-4 scalable video decoding is performed using the information. By using the FGS layer, interlayer redundancy can be efficiently removed, thereby improving encoding efficiency.
摘要:
An apparatus for and method of adapting a bitstream to which scalable video coding (SVC) technology is applied are provided. The apparatus for adapting a bitstream includes: an Adaptation QoS information extraction unit extracting SVC adaptation operators, and relationships between the SVC adaptation operators and the usage environment information of a terminal from the Adaptation QoS information on the bitstream to which SVC technology is applied; an Adaptation Decision Taking Engine(ADTE) unit determining the SVC adaptation operators corresponding to the usage environment of the terminal receiving the transmitted bitstream among the SVC adaptation operators; and a SVC bitstream extraction unit extracting the bitstream based on the determined SVC adaptation operator. According to the apparatus and method, scalable video can be efficiently provided for changing network environments and multimedia usage environments, through adaptation of scalable video streams using an adaptation operator suggested in Classification Scheme (AQoSJDS).
摘要:
A multiple ROI (region of interest) setting method and apparatus in scalable video coding and an ROI reconstructing method and apparatus are provided. The multiple ROI setting apparatus includes: an ROI setting unit which sets at least one or more R Ols and allocates ROI identification numbers to the each of ROIs; a mapping unit which allocates at least one or more slice group identification numbers to the at least one or more ROI identification numbers; and a message generating unit which generates a me ssage including ROI-associated information, slice-group-associated information, mappi ng information on mapping of the ROI identification number to the at least one or more slice group identification numbers, and scalability information.
摘要:
Provided is a hierarchical video encoding/decoding method for complete spatial scalability and apparatus thereof . The apparatus for encoding a video image including: an overlapped region (OR) detector for receiving coding region information about a plurality of regions of interest (ROI) in the video image to encode and detecting overlapped regions (OR) in the ROI regions; a region arranger for arranging the video image, the regions of interest and the detected overlapped regions into a plurality of layers according to a resolution; and a region encoder for encoding the video image, the regions of interest and the detected overlapped regions according to a resolution of a corresponding layer arranged at the region arranger. The coding region information may include information about locations of the regions of interest in the video image and a coding resolution of the regions of interest. A video encoding/decoding apparatus according to the present invention provides a complete scalability of a spatial domain by defining a region of interest (ROI) in a video image. Also, the video encoding/decoding apparatus according to the present invention provides an improved coding rate by encoding video image in consideration of spatial redundancy among a plurality of regions of interest.
摘要:
The present invention relates to a method of systematically and synthetically accessing modality conversion that is an important part in the contents adaptive conversion process of a universal multimedia access system. The present invention provides an effective method of solving a problem, which is incurred at the time of modality conversion and still remains as one of difficult problems incurred during adaptive contents conversion. For this purpose, the present invention includes overlapped contents modeling newly proposed to determine modality conversion, a method of flexibly and clearly expressing and applying user preference for the modality conversion, and a resource allocation method of distributing resources among complicated contents based on the user preference. As a result, the integration of the above three methods provides a synthetic solution, particularly, to a problem incurred in the modality conversion and, generally, to a problem incurred in the adaptive conversion of contents.
摘要:
Provided are a combined file format for DMB contents which automatically filters contents desired by a user, stores the desired contents and shows the contents to the user when the user is available to watch them, and enhances applicability of contents through sharing and exchanging the contents; and an apparatus and method for processing the DMB contents of the combined file format. The file format for DMB contents is capable of combining into one file a Digital Multimedia Broadcasting (DMB) content including at least one of DMB video, DMB audio, data and a combination thereof; detailed information metadata for describing detailed information of the DMB content; and protection and governance metadata for describing information on protecting and governing the DMB content.
摘要:
Disclosed are a method and a system that could adaptively improve the visual quality of people with low-vision impairment, regardless of network and terminal. The low-vision impairment is described by a set of "symptoms" that is semantically defined. As the description tool of low vision impairments, it is flexible and reliable to use the proposed "symptoms" based descriptions rather than individually identified names of eye disease, because the user can describe his/her low-vision impairment by specifying associated symptoms based on his/her own experience. The inputted visual contents are adaptively transformed according to the low vision-impairment.
摘要:
Provided are an apparatus and method for coding and decoding a multi object audio signal with multi channel. The apparatus includes: a multi channel encoding means for down-mixing an audio signal including a plurality of channels, generating a spatial cue for the audio signal including the plurality of channels, and generating first rendering information including the generated spatial cue; and a multi object encoding unit for down-mixing an audio signal including a plurality of objects, which includes the down-mixed signal from the multi channel encoding unit, generating a spatial cue for the audio signal including the plurality of objects, and generating second rendering information including the generated spatial cue, wherein the multichannel encoding unit generates a spatial cue for the audio signal including the plurality of objects regardless of a Coder-DECoder (CODEC) scheme the limits the multi channel encoding unit.
摘要:
Provided are a method and apparatus for displaying lightweight applications scene representation (LASeR) content. A LASeR markup language (ML) that is based on a LASeR binary stream or a LASeR extensible markup language (XML) is parsed so as to generate a LASeR document object model (DOM). A LASeR application program interface (API) is used to generate a LASeR DOM object tree. A LASeR player accesses the LASeR DOM to display LASeR DOM scene information.