Abstract:
A mechanism is described for facilitating adaptive resolution and viewpoint-prediction for immersive media in computing environments. An apparatus of embodiments, as described herein, includes one or more processors to receive viewing positions associated with a user with respect to a display, and analyze relevance of media contents based on the viewing positions, where the media content includes immersive videos of scenes captured by one or more cameras. The one or more processors are further to predict portions of the media contents as relevant portions based on the viewing positions and transmit the relevant portions to be rendered and displayed.
Abstract:
A mechanism is described for facilitating adaptive resolution and viewpoint-prediction for immersive media in computing environments. An apparatus of embodiments, as described herein, includes one or more processors to receive viewing positions associated with a user with respect to a display, and analyze relevance of media contents based on the viewing positions, where the media content includes immersive videos of scenes captured by one or more cameras. The one or more processors are further to predict portions of the media contents as relevant portions based on the viewing positions and transmit the relevant portions to be rendered and displayed.
Abstract:
Techniques related to video pre-processing for video coding are discussed. Such video pre-processing techniques may include applying adaptive temporal and spatial filtering to pixel values of video frames of input video to generate pre-processed video such that the adaptive temporal and spatial filtering includes blending spatial and temporal filtering of the individual pixel value when the block of pixels is a non-motion block and spatial-only filtering the individual pixel value when the block of pixels is a motion block.
Abstract:
A mechanism is described for facilitating adaptive resolution and viewpoint-prediction for immersive media in computing environments. An apparatus of embodiments, as described herein, includes one or more processors to receive viewing positions associated with a user with respect to a display, and analyze relevance of media contents based on the viewing positions, where the media content includes immersive videos of scenes captured by one or more cameras. The one or more processors are further to predict portions of the media contents as relevant portions based on the viewing positions and transmit the relevant portions to be rendered and displayed.
Abstract:
Techniques related to object detection using binary coded images are discussed. Such techniques may include performing object detection based on multiple spatial correlation mappings between a generated binary coded image and a binary coded image based object detection model and nesting look up tables such that binary coded representations are grouped and such groups are associated with confidence values for performing object detection.
Abstract:
Techniques for inter-layer residual prediction are described. In one embodiment, for example, an apparatus may comprise an encoding component to determine whether a predicted motion for an enhancement layer block is consistent with a predicted motion for a collocated lower-layer block, determine whether to apply inter-layer residual prediction to the enhancement layer block based on whether the predicted motion for the enhancement layer block is consistent with the predicted motion for the collocated lower-layer block, and in response to a determination that inter-layer residual prediction is to be applied to the enhancement layer block, generate a predicted residual for the enhancement layer block based on a residual for the collocated lower-layer block and generate a second-order residual for the enhancement layer block by comparing a calculated residual to the predicted residual. Other embodiments are described and claimed.