Abstract:
A method includes identifying an optimal backlight value for at least one quality level of a first video segment.The method also includes transmitting data for thefirst video segment. The transmitted data for the first video segment includes a message containinga first set of display adaptation information. The first set of display adaptation information includes the optimal backlight value for the at least one quality level of the first video segment.The method further includes identifying a backlight value for the at least one quality level of a second video segment.The method also includes determining a maximum backlight value change threshold between successive video segments. In addition, the method includes applying temporal smoothing between the optimal backlight value and the backlight value based on the maximum backlight value change threshold.
Abstract:
A method and apparatus encode and decode video by determining whether to use discrete cosine transform (DCT) and DST for each of the horizontal and vertical transforms. During encoding, an intra-prediction is performed based on an intra-prediction mode determined for an MxN input image block to obtain an MxN intra-prediction residue matrix (E). Based on the intra-prediction mode, each of a horizontal transform and a vertical transform is performed using one of DCT and DST according to the intra-prediction mode. During decoding, the intra-prediction mode is determined from an incoming video bitstream. The MxN transformed coefficient matrix of the error residue is obtained from the video bitstream using an inverse quantizer. Based on the intra prediction mode, one of DCT and DST is performed for each of an inverse vertical transform and an inverse horizontal transform.
Abstract:
A decoder for video processing includes a receiver configured to receive a bitstream associated with a video from a coder. The decoder also includes a processor configured to parse the bitstream to determine a percentage of at least one a number of six tap filterings or a number of alpha point deblocking instances, in a specified period. The processor is further configured to determine a voltage and frequency to be used for decoding the video proportional to the percentage of the at least one of the number of six tap filterings or the number of alpha point deblocking instances. The processor is configured to decode the video at the determined voltage and frequency. Other embodiments including a encoder and method also are disclosed.
Abstract:
A method and apparatus for improving matching performance and compression efficiency with escriptor code segment collision probanility optimization. The method include extracting a global descriptor from a query image with a plurality of segments, identifying segments with a desirable discriminating potential by analyzing data of the plurality of segments based on an available image database, creating a bitmask where the identified segments are active, and masking any segment of the plurality of segments of the global descriptor that are inactive according to the bitmask.
Abstract:
A method comprises computing a color factor value indicating an amount of color gradients in at least one color channel from the query image. The method comprises combining the color-keypoints with the gray-keypoints when the color factor value is greater than a threshold. A method for performing a visual search comprises extracting a plurality of local descriptors from a query image and then selecting a subset of them based on various criteria's such as visual meaning score. A method comprises aggregating each mean vector for each visual codeword from distances between each visual codeword and local descriptors. The method comprises aggregating variance vector for each visual codeword from the distance between each visual codeword, and local descriptors. The method comprises transmitting aggregated mean vector information and aggregated variance vector information to a search server for efficient image retrieval.
Abstract:
An apparatus and method for processing video data are provided. The method includes determining a primary transform C N for application to residual data at the encoder, determining a secondary transform Tr K for application to the residual data, applying the primary transform C N to the residual data, and selectively applying the secondary transform Tr K to the residual data, wherein N denotes the length size of the input vector on which the primary transform C N is applied, and K denotes the length of the first few coefficients of the primary transform output on which the secondary transform Tr K is applied. Similar inverse operations are performed at the decoder, viz., selectively applying an inverse secondary transform inv(Tr K ) at the decoder for the input residual data, followed by application of the inverse primary transform inv(C N ).
Abstract:
In accordance with an aspect of the present disclosure, a method to perform a visual search by a visual search server is provided. The method includes receiving a visual search request including information relating to distinctive features within a query image, performing a forward matching operation on the distinctive features within the query image to distinctive features within each of the one or more repository images, if predetermined criteria are met, performing a backward matching operation on distinctive features within each of the one or more repository images to the distinctive features within the query image, and determining whether each of the one or more repository images correlates to the query image based upon results of at least one of the first matching operation and the second matching operation.
Abstract:
A video decoder performs a method of dynamic voltage/frequency scaling (DVFS) for video processing. The method includes parsing a bitstream associated with a video to determine embedded information associated with a complexity of a first portion of the video, estimating the complexity of the first portion of the video using the embedded information, determining a voltage and frequency to be used for decoding the first portion of the video based on the estimated complexity, and decoding the first portion of the video at the determined voltage and frequency.
Abstract:
An apparatus and method for video fingerprinting are provided. The method includes, for each frame of a video sequence including a plurality of frames, removing a portion of the frame, dividing a remaining portion of the frame into blocks, dividing each block into sub-blocks, computing a block level feature as a mean of pixels in each sub-block within the block, concatenating all block level features in the frame, and concatenating features of all frames in the video sequence.
Abstract:
A video processing system includes prediction primary transforms, quantization, entropy coding and filtering configured to receive and compress video information and output compressed video information corresponding to the received video information. The compressed video information comprising prediction mode, transform block size, quantization parameter, and filtering type. The video processing system also includes a secondary transform configured to receive and compress the compressed video information. The video processing system also includes a quantization stage configured to receive and compress the transformed coefficients. The video processing system also includes an entropy coding stage configured to convert the compressed video information into binary bits. The video processing system also includes a filtering stage configured to improve the reconstructed video information for better prediction.