摘要:
As information to be processed at an object-based video or audio-visual (AV) terminal, an object-oriented bitstream includes objects, composition information, and scene demarcation information. Such bitstream structure allows on-line editing, e.g. cut and paste, insertion/deletion, grouping, and special effects. In the interest of ease of editing, AV objects and their composition information are transmitted or accessed on separate logical channels (LCs). Objects which have a lifetime in the decoder beyond their initial presentation time are cached for reuse until a selected expiration time. The system includes a de-multiplexer, a controller which controls the operation of the AV terminal, input buffers, AV objects decoders, buffers for decoded data, a composer, a display, and an object cache.
摘要:
An interactive video/multimedia application (IVM application) may specify one or more media assets for playback. The IVM application may define the rendering, composition, and interactivity of one or more the assets, such as video. Video multimedia application data (IVMA data may) be used to define the behavior of the IVM application. The IVMA data may be embodied as a standalone file in a text or binary, compressed format. Alternatively, the IVMA data may be embedded within other media content. A video asset used in the IVM application may include embedded, content-aware metadata that is tightly coupled to the asset. The IVM application may reference the content-aware metadata embedded within the asset to define the rendering and composition of application display elements and user-interactivity features. The interactive video/multimedia application (defined by the video and multimedia application data) may be presented to a viewer in a player application.
摘要:
An interactive video/multimedia application (IVM application) may specify one or more media assets for playback. The IVM application may define the rendering, composition, and interactivity of one or more the assets, such as video. Video multimedia application data (IVMA data may) be used to define the behavior of the IVM application. The IVMA data may be embodied as a standalone file in a text or binary, compressed format. Alternatively, the IVMA data may be embedded within other media content. A video asset used in the IVM application may include embedded, content-aware metadata that is tightly coupled to the asset. The IVM application may reference the content-aware metadata embedded within the asset to define the rendering and composition of application display elements and user-interactivity features. The interactive video/multimedia application (defined by the video and multimedia application data) may be presented to a viewer in a player application.
摘要:
A video coding for machines (VCM) encoder includes a first video encoder, the first video encoder configured to encode an input video into a bitstream. The VCM encoder includes a feature extractor, the feature extractor configured to detect at least a feature in the input video. The VCM encoder includes a second encoder, the second encoder configured to encode a feature bitstream as a function of the input video and at least a feature.
摘要:
A method for compressing and transmitting a sequence of video frames represented by arrays of digital pixel values includes the following steps: transmitting a representation of a first frame (I1) of the sequence; deriving a sorting permutation P1 of the first frame; using the sorting permutation of the first frame, P1, to approximately sort a second frame (I2) of the sequence, to obtain approximately sorted frame P1(I2); and compressing and transmitting the approximately sorted frame P1(I2).
摘要:
A method for receiving input video having a sequence of input video frames, and producing a compact video signature as an identifier of the input video, includes the following steps: generating a processed video tomograph using an arrangement of corresponding lines of pixels from the respective frames of the sequence of video frames; measuring characteristics of the processed video tomograph; and producing the video signature from the measured characteristics.
摘要:
A method for encoding frames of input video signals, including the following steps: implementing a learning/configuring stage that includes the following steps: providing frames of training video signals; determining training statistical parameters for groups of pixels of the frames of training video signals, and also encoding the frames of training video signals to obtain training modes; configuring a decision tree in response to the training statistical parameters and the training modes; and implementing an operating/encoding stage that includes the following steps: determining operating statistical parameters for groups of pixels of the frames of input video signals, and applying the operating statistical parameters to the configured decision tree to obtain operating modes; and encoding the frames of input video signals using the frames of input video signals and the operating modes.
摘要:
A video coding for machines (VCM) encoder for combined lossless and lossy encoding includes a feature encoder, the feature encoder configured to encode a sub-picture containing a feature in an input video and provide an indication of the sub-picture, and a video encoder, the video encoder configured to receive an indication of the sub-picture from the feature encoder and encode the sub-picture using a lossy encoding protocol.
摘要:
A method for receiving encoded H.264 video signals and transcoding the received encoded signals to encoded MPEG-2 video signals, including the following steps: decoding the encoded H.264 video signals to obtain uncompressed video signals and to also obtain H.264 feature signals; deriving MPEG-2 feature signals from the H.264 feature signals; and producing the encoded MPEG-2 video signals using the uncompressed video signals and the MPEG-2 feature signals. The H.264 feature signals include H.264 macro block modes and include H.264 motion vectors.
摘要:
A method for producing compressed video signals representative of a sequence of video frames, including the following steps: determining the value of a temporal variation parameter between successive frames, or portions thereof, of the sequence of frames; determining when the temporal variation parameter meets a predetermined criterion and indexing the frame transitions where the criterion is met; and digitally encoding the sequence of frames with relative reduction of the bitrate for at least a portion of the earlier-occurring frame of each indexed transition.