摘要:
In video quality control and estimation, the joint evaluation of compression artifacts and channel artifacts is a problem. The invention considers the joint perception of both a compression artifact level Dm(V) and a channel artifact level Dh(V) in a method for calculating overall distortion in a video being affected by compression artifacts and channel artifacts. The method comprises a step of adding a compression artifact level Dm(V) and a summand S that is derived from log (Dh(V)) multiplied with a factor that decreases with increasing compression artifact level Dm, such as D(V)=Dm(V)+c·(K−Dm(V))·log(Dh(V)). The result obtained is a measure for the overall distortion.
摘要:
The invention concerns a method for embedding video annotation data into a coded video stream. The method comprises the step of—encapsulating said video annotation data into a unit, so-called video annotation unit, of the coded video data stream which format corresponds to at least one format used for sending the associated video data,—inserting an identifiable synchronizing code enabling the identification of said video annotation unit into the video data stream.
摘要:
In video quality control and estimation, the joint evaluation of compression artifacts and channel artifacts is a problem. The invention considers the joint perception of both a compression artifact level Dm(V) and a channel artifact level Dh(V) in a method for calculating overall distortion in a video being affected by compression artifacts and channel artifacts. The method comprises a step of adding a compression artifact level Dm(V) and a summand S that is derived from log (Dh(V)) multiplied with a factor that decreases with increasing compression artifact level Dm, such as D(V)=Dm(V)+c·(K−Dm(V))·log(Dh(V)). The result obtained is a measure for the overall distortion.
摘要:
A method and apparatus are disclosed for predicting subjective quality of a video contained in a bit stream on a packet layer. Header information of the bit-stream is parsed and frame layer information, such as frame type, is estimated. Visible artifact levels are then estimated based on frame layer information. An overall artifact level and quality metric are estimated based on artifact levels for individual frames with other parameters. Specifically, different weighting factors are used for different frame types when estimating the levels of initial visible artifacts and propagated visible artifacts. The number of slices per frame is used as a parameter when estimating the overall artifact level for the video. Moreover, the quality assessment model considers quality loss caused by both coding and channel artifacts.
摘要:
Objective video quality assessment models at media-layer or at packet-layer are known for estimating audio/video quality of experience. Existing models are not able to provide stable performance. A method for enabling quality assessment of a stream of frames of video data comprises receiving a sequence of packets, generating a set of parameters and inserting said generated set of parameters as side information into said stream of frames, wherein at least one parameter refers to a video slice level. A method for assessing the quality of a stream of frames of video data comprises receiving a sequence of packets, extracting a set of parameters from said sequence of packets and generating an estimated mean opinion score, wherein the video data comprise a slice level and wherein the extracted set of parameters comprises at least one parameter that refers to a video slice level.
摘要:
Spatial distortion (i.e., when a frame is viewed independently of other frames in a video sequence) may be quite different from temporal distortion (i.e., when frames are viewed continuously). To estimate temporal distortion, a sliding window approach is used. Specifically, multiple sliding windows around a current frame are considered. Within each sliding window, a large distortion density is calculated and a sliding window with the highest large distortion density is selected. A distance between the current frame and the closest frame with large distortion in the selected window is calculated. Subsequently, the temporal distortion is estimated as a function of the highest large distortion ratio, the spatial distortion for the current frame, and the distance. In another embodiment, a median of spatial distortion values is calculated for each sliding window and the maximum of median spatial distortion values is used to estimate the temporal distortion.
摘要:
Objective video quality assessment models at media-layer or at packet-layer are known for estimating audio/video quality of experience. Existing models are not able to provide stable performance. A method for enabling quality assessment of a stream of frames of video data comprises receiving a sequence of packets, generating a set of parameters and inserting said generated set of parameters as side information into said stream of frames, wherein at least one parameter refers to a video slice level. A method for assessing the quality of a stream of frames of video data comprises receiving a sequence of packets, extracting a set of parameters from said sequence of packets and generating an estimated mean opinion score, wherein the video data comprise a slice level and wherein the extracted set of parameters comprises at least one parameter that refers to a video slice level.
摘要:
Various implementations address distortion and quality measurements related to, for example, freeze-with-skip and/or a freeze-without-skip events. In several implementations, information is accessed indicating that a first and second set of one or more consecutive pictures are not to be displayed. A first and second indicator are determined. In one such implementation, the first and second indicators indicate an amount of distortion across one or more types of distortive effects that result from displaying substantially a first, or second, same picture during a display time for the first, or second, set of pictures. The first and second indicators are combined in a non-linear manner. In another such implementation, the distortion is associated with a given type of distortive effect, from among multiple types of distortive effects, and the first and second indicators are combined for the given type of distortive effect.
摘要:
A particular implementation decomposes an image into a structure component and a texture component. An edge strength map is calculated for the structure component, and a texture strength map is calculated for the texture component. Using the edge strength and the texture strength, texture masking weights are calculated. The stronger the texture strength is, or the weaker the edge strength is, the more distortion can be tolerated by human eyes, and thus, the smaller the texture masking weight is. The local distortions are then weighted by the texture masking weights to generate an overall distortion level or an overall quality metric.
摘要:
In 3D mesh coding, the geometry data is compressed by spatial tree based approaches. Bitstreams that result from the traversal of a tree structure of spatial tree based approaches for encoding 3D mesh models have systematically special redundancies, which is exploited for further improving the mesh model compression. A method for encoding a bitstream comprises steps of defining at least a first and a second symbol group of binary symbols, with S1 being a subset of S2, determining within the bitstream first portions, second portions and third portions, wherein first portions have Th1 or more consecutive S1 symbols and second portions have Th2 or more consecutive S2 symbols, encoding the bitstream, wherein first portions, second portions and third portions are encoded using different codes, and encoding values indicating the boundary positions between the first, second and third portions in the bitstream.