摘要:
The invention relates to a coding method applied to digital video data available in the form of a video stream consisting of consecutive frames. These frames, divided into macroblocks, include at least I-frames, independently coded, or P-frames, temporally disposed between said I-frames and predicted from at least a previous I- or P-frame, or B-frames, temporally disposed between an I-frame and a P-frame, or between two P-frames, and bidirectionally predicted from at least these two frames between which they are disposed, said predictions of P- and B-frames being performed by means of a weighted prediction with unequal amount of prediction from the past and the future. According to the invention, this coding method comprises the following steps: a structuring step, provided for capturing coding parameters characterizing the said weighted prediction; a computing step, for delivering statistics related to said parameters; an analyzing step for determining a change of preference regarding the direction of prediction; a step provided for detecting the occurrences of gradual scene changes; a step provided for generating description data of said occurrences; and a step for encoding the description data thus obtained and the original digital video data.
摘要:
A device (DVR) handles data items that can be rendered to a user. Such a device may be, for example, a digital video recorder. The device (DVR) comprises a network interface (NWIC, NWIM) that couples the device (DVR) to a network, which comprises other devices. A content analysis initiator (ECF) within the device (DVR) detects that another device, which forms part of the network, comprises a content analyzer. The content analysis initiator (ECF) causes the content analyzer (AVCA) of the other device to be applied to a data item (AVF).
摘要:
The invention relates to a coding method applied to digital video data available in the form of a video stream consisting of consecutive frames divided into macroblocks. These frames are coded in the form of at least I-frames, coded independently, P-frames, predicted from at least a previous I- or P-frame, and B-frames, bidirectionally predicted from at least two frames between which they are disposed. According to the invention, the coding method comprises the following steps: a structuring step, provided for capturing for all the macroblocks of the current frame related coding parameters characterizing the fact that they have been coded, or not, according to a predetermined intra prediction mode; a computing step, for delivering statistics related to said parameters; an analyzing step, provided for analyzing said statistics for determining the number of blocks which exhibit, or not, said intra prediction mode; a detecting step, provided for detecting, each time said number is greater than a given threshold, the occurrence of an image, or of a sub-region of an image, which is either monochrome or with a repetitive pattern; a description step, provided for generating description data of said occurrences of images or sub-images either monochrome or with a repetitive pattern; a coding step, for coding both description data and original data.
摘要:
The invention relates to an apparatus (300) and a method for analyzing a content stream (201) comprising a content item, and to a computer program product enabling a programmable device. The apparatus comprises a content analysis processor (310) for identifying an exact indicator of a boundary (221, 222) of the content item in the content stream, wherein identifying comprises determining a remote indicator (231) being remote from the boundary and analyzing the content stream starting from the remote indicator towards the boundary to identify the exact indicator.
摘要:
The invention relates to a system (101) for content analysis. The system (101) comprises an interface receiving a video signal in accordance with a first encoding standard, such as H.264. The interface is coupled to an extraction processor (107) which extracts video coding data from the video signal. The video coding data is fed to a conversion processor (109) which converts the video coding data to video coding data according to a second video encoding standard, such as MPEG-2. The conversion converts the extracted video data to video coding data related to a common encoding block size, for example, by grouping smaller blocks and averaging the video parameters to provide video coding parameters related to larger block sizes. The converted data is fed to a content analysis processor (111) which performs content analysis based on the converted data. A content analysis algorithm for one video encoding standard may thus be used for a different video encoding standard.
摘要:
2D/3D video conversion using a method for providing an estimation of visual depth for a video sequence, the method comprises an audio scene classification (34) in which a visual depth categorization index of visual depth (37) of a scene is made on basis of an analysis of audio information (32) for the scene, wherein the visual depth categorization index (37) is used in a 5 following visual depth estimation (38) based on video information (33) for the same scene, thereby reducing the calculation load and speeding up the processing.
摘要:
The present invention relates to a method of processing digital coded video data available in the form of a video stream consisting of consecutive frames divided into slices. The frames include at least I-frames, coded without any reference to other frames, P-frames, temporally disposed between said I-frames and predicted from at least a previous I- or P-frame, and B-frames, temporally disposed between an I-frame and a P-frame, or between two P-frames, and bidirectionally predicted from at least these two frames between which they are disposed. The processing method comprises the steps of determining for each slice of the current frame related slice coding parameters and parameters related to spatial relationships between the regions that are coded in each slice, collecting said parameters for all the successive slices of the current frame, for delivering statistics related to said parameters, analyzing said statistics for determining regions of interest (ROIs) in said current frame, and enabling a selective use of the coded data, targeted on the regions of interest thus determined.
摘要:
The invention relates to a method of informing a user about a category (152) of a media content item. The method comprises the steps of: identifying the category of the media content item, and enabling a user to obtain an audible signal (156) having an audio parameter (153) in accordance with the category of the media content item. The invention further relates to a device, which is capable of functioning in accordance with the method. The invention also relates to audio data comprising an audible signal informing a user about a category of a media content item, a database comprising a plurality of the audio data, and a computer program product. In a recommender system, the audible signal may be reproduced by the recommender system when a user interaction with the recommender system relates to the media content item of a particular genre. The invention may be used in the EPG user interface.
摘要:
The invention relates to a detection method applied to digital coded video data available in the form of a video stream comprising consecutive frames divided into macroblocks themselves subdivided into contiguous blocks. These frames comprise I-frames, coded independently, P-frames, predicted from a previous I- or P-frame, and B-frames, bidirectionally predicted from at least two frames between which they are disposed. According to the invention, the processing method comprises the steps of determining for each block of the current frame if it has been coded, or not, according to a predetermined intra prediction mode, collecting similar information for all the blocks of the current frame, for delivering statistics related to said intra prediction mode, analyzing said statistics for determining the number of blocks of said current frame which exhibit, or not, said intra prediction mode, and detecting in the sequence of frames, each time said number is greater than a given threshold, the occurrence of an image, or a sub-region of an image, which is either monochrome or with a repetitive pattern.
摘要:
The invention relates to a video encoding apparatus (100) comprising a video analysis processor (101) and a video encoder (103). The video analysis processor (101) comprises a segmentation processor (109) which divides a picture into a plurality of picture regions. A picture characteristic processor (111) determines picture characteristic, such as a texture level, for one of the regions, and in response a video encoding selector (113) selects a video encoding parameter for that region. The video encoding parameter is fed to the video encoder (103) wherein a video encode processor (I 19) encodes the picture using the video encoding parameter determined by the external analysis by the video analysis processor (101). The encoded picture is fed back to the video analysis processor (101) and the process is iterated until a desired encoding performance is achieved. The apparatus is particularly suitable for H.264 encoding and allows for improved performance from a selection of encoding parameters based on an external analysis.