摘要:
A video is resized while preserving salient regions within the video. The dimensions of the video are reduced by selectively removing unimportant, or less salient, pixels from the video in a manner that diminishes any resulting spatial and temporal artifacts. More specifically, “seams” of pixels are selectively removed from frames of the video, where a “seam” is a horizontal or vertical chain of pixels through a frame. A seam can be selected from among all possible pixels in a row or column, thereby allowing discontinuous seams. Seams are selected using a technique that encourages the seam to pass through less-salient regions of the frame, and that reduces spatial artifacts when the video is played.
摘要:
A video is segmented to produce volumetric video regions. Descriptors are created for the video regions. A region graph is created for the video, where the region graph has weighted edges incident to video regions and the weight of an edge is calculated responsive to the descriptors of the video regions incident to the edge. The region graph is segmented responsive to the weights of the edges incident to the video regions to produce a new region graph having new volumetric video regions comprised of merged video regions of the first region graph. The descriptions of the region graphs are stored in a data storage.
摘要:
An image comprising color pixels with varying illumination is selected. Instances of a repeating pattern in the image are determined. Illumination values for illuminated pixels at locations within instances of the repeating pattern are calculated based on pixel intensities of non-illuminated pixels at corresponding locations in other instances of the repeating pattern. The illumination variation is removed from the illuminated pixels based on the calculated illumination values to produce enhanced pixels. Color from the non-illuminated pixels at the corresponding locations in other instances of the repeating pattern is propagated to the enhanced pixels.
摘要:
An exemplar dictionary is built from exemplars of digital content for determining predictor blocks for encoding and decoding digital content. The exemplar dictionary organizes the exemplars as clusters of similar exemplars. Each cluster is mapped to a label. Machine learning techniques are used to generate a prediction model for predicting a label for an exemplar. The exemplar dictionary is used to encode digital content. Clusters of exemplars are obtained by applying a prediction model to a target block of digital content for encoding. A predictor block is selected for encoding the target block based on frequency of occurrence of exemplars in the clusters. The target block is encoded using the predictor block.
摘要:
An image comprising varying illumination is selected. Instances of a repeating pattern in the image is determined. Illumination values for pixels at locations within instances of the repeating pattern are calculated responsive to pixel intensities of pixels at corresponding locations in other instances of the repeating pattern. The varying illumination is removed form the image responsive to the illumination values.
摘要:
Clustering algorithms such as k-means clustering algorithm are used in applications that process entities with spatial and/or temporal characteristics, for example, media objects representing audio, video, or graphical data. Feature vectors representing characteristics of the entities are partitioned using clustering methods that produce results sensitive to an initial set of cluster seeds. The set of initial cluster seeds is generated using principal component analysis of either the complete feature vector set or a subset thereof. The feature vector set is divided into a desired number of initial clusters and a seed determined from each initial cluster.
摘要:
An image comprising varying illumination is selected. Patches of pixels from among the plurality of pixels with the image are identified. Similarities between pairs of patches of pixels based on pixel intensities associated with the pairs of patches of pixels are calculated. Illumination values for the plurality of pixels within the image based on the calculated similarities between the pairs of patches of pixels is calculated. The illumination variation from the image is removed based on the calculated illumination values for the plurality of pixels within the image.
摘要:
An image processing system is provided for encoding images based on example-based compression. The system selects a set of candidate dictionary predictor entries to encode a portion of an image based at least in part on the neighbors of the portion. The spatial continuity between portions of the image is exploited by the image processing system by selecting corresponding dictionary predictor entries that have the same offset vector as the portion of the image and its neighboring portions.
摘要:
An image comprising varying illumination is selected. Patches of pixels from among the plurality of pixels with the image are identified. Similarities between pairs of patches of pixels based on pixel intensities associated with the pairs of patches of pixels are calculated. Illumination values for the plurality of pixels within the image based on the calculated similarities between the pairs of patches of pixels is calculated. The illumination variation from the image is removed based on the calculated illumination values for the plurality of pixels within the image.
摘要:
An image processing system is provided for encoding videos based on example-based compression. To select the dictionary predictor entries to encode a video, the image processing system reduces the complexity of the video by decomposing the video into smaller pieces. By breaking the video into the simpler pieces, it is easier to locate dictionary predictor entries that are similar to the pieces of the video. The image processing system may decompose the video into one more space-time tubes. For each space-time tube, the image processing system selects dictionary predictor entries to encode the tube.