Abstract:
A method, one or more non-transitory computer readable media, and an apparatus for implementing a reduced precision bounding volume hierarchy ray traversal for graphics processing are disclosed. The method includes the step of reusing, in a child node, a computation for a parent node in a reduced precision bounding volume hierarchy ray traversal for graphics processing. The computational cost of the reduced precision bounding volume hierarchy ray traversal can be reduced by reusing, in the child node, the computation for the parent node.
Abstract:
By determining if there is redundancy between sub-samples in a multi-sampled anti-aliasing shader, determining which of at least two optimization techniques to use to reduce redundancy and eliminating a redundant shader thread using the selected technique, performance can be improved and power consumption may be reduced when sampling at the sample frequency in multi-sampled anti-aliasing.
Abstract:
Attributes of large scale computer vision systems may be made available to users of more limited processer-based systems by dynamically and adaptively updating recognizers into a smaller scale device from a connected larger scale device, based on the user's situational context and behavior. A recognizer is a hardware, software or firmware module specialized to use computer vision to recognize a defined class of imaged objects.
Abstract:
The power consumption of processor-based devices may be reduced by reducing the consumption of power during graphics processing. In some embodiments, the precision of pixel shading in parts of images where artifacts are less objectionable may be reduced. For example, in areas the user is not directly looking at, precision may be reduced to save power. At the same time, because a person is not focusing on those regions, even if usually perceptible artifacts occur because of the reduced precision, an overall pleasing depiction may be achieved.
Abstract:
A tracking algorithm includes a tracking failure detection technique using a key points selection algorithm. As a result, the described system is able to work on an embedded device in real time, providing high quality vehicle detection and tracking, in some embodiments. A vision system detects and tracks vehicles from sequences of images taken from another moving vehicle in real-time on an embedded platform. The system can achieve real-time performance on an embedded platform in some embodiments, taking into account that modern boosting detectors are too slow for use in such a system. A tracker includes an algorithm for tracking and an algorithm for detection of tracking failures. The tracking algorithm is based on an optical flow calculation for key points selected based on the distribution of features from the last detection. The algorithm for detection of tracking failures is based on an estimation of low confidence detections.
Abstract:
In accordance with some embodiments, multi-sampling may be used together with texture filtering and particularly texture filtering that generally uses rectangular grids of samples. This is accomplished by performing the texture filtering before doing the resolve, while conventionally the resolve is done and then the texture filtering is done. In addition, each sample is filtered as if it were the only sample.
Abstract:
In accordance with some embodiments, compaction, as contrasted with compression, is used to reduce the footprint of a near memory. In compaction, the density of data storage within a storage device is increased. In compression, the number of bits used to represent information is reduced. Thus you can have compression while still having sparse or non-contiguously arranged storage. As a result, compression may not always reduce the memory footprint. By compacting compressed data, the footprint of the information stored within the memory may be reduced. Compaction may reduce the need for far memory accesses in some cases.
Abstract:
A thread dependency scheme may significantly reduce the dependency penalty and improve the parallelism efficiency in some embodiments in video compression techniques with relatively high dependencies, such as VP9. One fundamental feature is to split an individual large kernel into multiple, less dependent, smaller kernels, thereby significantly increasing the number of software threads that can potentially run in parallel. Another feature is to define the larger number of thread dependencies (superset of all the dependency candidates for each thread), with the specific thread's spatial position and associated context, and mask out some of the unnecessary thread dependencies.
Abstract:
By converting a first color space to a second color space, using a two-dimensional lookup table in said second color space, and converting from said second color space to said first color space, it may be possible to use one or more two-dimensional lookup tables (LUTs) to do a task conventionally handled by three-dimensional lookup tables. This may reduce storage requirements and memory bandwidth requirements in some embodiments. In general a color pixel with N color components can be processed with n number of M dimensional LUT where M
Abstract:
A frame of pixel data may be burst at a higher frame rate to create a lower effective refresh rate when the actual image update rate is lower than the frame rate. This results in the ability to power manage the transmit portion of the display engine and the receive portion of the panel electronics, while reducing display initiated repetitive memory traffic to conserve power.