Abstract:
Several methods and systems for facilitating multimedia data encoding are disclosed. In an embodiment, a plurality of picture buffers associated with multimedia data are received in an order of capture associated with the plurality of picture buffers. Buffer information is configured for each picture buffer from among the plurality of picture buffers comprising at least one of a metadata associated with the corresponding picture buffer and one or more encoding parameters for the corresponding picture buffer. A provision of picture buffers in an order of encoding is facilitated based on the configured buffer information.
Abstract:
A matching accelerator in the form of a hardware accelerator configured to perform matrix multiplication and/or additional operations is used to optimize keypoint matching. An SSE calculation may be determined by utilizing the matching accelerator to perform matrix multiplication to obtain a cost matrix for two sets of keypoint descriptors from two images. The hardware accelerator may determine a best cost calculation for each keypoint in each direction, which is utilized to perform keypoint matching.
Abstract:
The disclosure provides a noise filter. The noise filter includes a motion estimation (ME) engine. The ME receives a current frame and a reference frame. The current frame comprising a current block and the reference frame includes a plurality of reference blocks. The ME engine generates final motion vectors. The current block comprises a plurality of current pixels. A motion compensation unit generates a motion compensated block based on the final motion vectors and the reference frame. The motion compensated block includes a plurality of motion compensated pixels. A weighted average filter multiplies each current pixel of the plurality of current pixels and a corresponding motion compensated pixel of the plurality of motion compensated pixels with a first weight and a second weight respectively. The weighted average filter generates a filtered block. A blockiness removal unit is coupled to the weighted average filter and removes artifacts in the filtered block.
Abstract:
A method of determining a summation of pixel characteristics for a rectangular region of a digital image includes determining if a base address for a data element in an integral image buffer is aligned for an SIMD operation by a processor embedded in an electronic assembly configured to perform Haar-like feature calculations. The data element represents a corner of the rectangular region of an integral image. The integral image is a representation of the digital image. The integral image is formed by data elements stored in the integral image buffer. The data element is loaded from the integral image buffer to the processor when the base address is aligned for the SIMD operation. An offset data element of an offset integral image is loaded from an offset integral buffer when the base address is non-aligned for the SIMD operation. The offset data element represents the corner of the rectangular region.
Abstract:
A method for sparse optical flow based tracking in a computer vision system is provided that includes detecting feature points in a frame captured by a monocular camera in the computer vision system to generate a plurality of detected feature points, generating a binary image indicating locations of the detected feature points with a bit value of one, wherein all other locations in the binary image have a bit value of zero, generating another binary image indicating neighborhoods of currently tracked points, wherein locations of the neighborhoods in the binary image have a bit value of zero and all other locations in the binary image have a bit value of one, and performing a binary AND of the two binary images to generate another binary image, wherein locations in the binary image having a bit value of one indicate new feature points detected in the frame.
Abstract:
A method of determining a summation of pixel characteristics for a rectangular region of a digital image includes determining if a base address for a data element in an integral image buffer is aligned for an SIMD operation by a processor embedded in an electronic assembly configured to perform Haar-like feature calculations. The data element represents a corner of the rectangular region of an integral image. The integral image is a representation of the digital image. The integral image is formed by data elements stored in the integral image buffer. The data element is loaded from the integral image buffer to the processor when the base address is aligned for the SIMD operation. An offset data element of an offset integral image is loaded from an offset integral buffer when the base address is non-aligned for the SIMD operation. The offset data element represents the corner of the rectangular region.
Abstract:
A method of determining a summation of pixel characteristics for a rectangular region of a digital image includes determining if a base address for a data element in an integral image buffer is aligned for an SIMD operation by a processor embedded in an electronic assembly configured to perform Haar-like feature calculations. The data element represents a corner of the rectangular region of an integral image. The integral image is a representation of the digital image. The integral image is formed by data elements stored in the integral image buffer. The data element is loaded from the integral image buffer to the processor when the base address is aligned for the SIMD operation. An offset data element of an offset integral image is loaded from an offset integral buffer when the base address is non-aligned for the SIMD operation. The offset data element represents the corner of the rectangular region.
Abstract:
Estimation of the ground plane of a three dimensional (3D) point cloud based modifications to the random sample consensus (RANSAC) algorithm is provided. The modifications may include applying roll and pitch constraints to the selection of random planes in the 3D point cloud, using a cost function based on the number of inliers in the random plane and the number of 3D points below the random plane in the 3D point cloud, and computing a distance threshold for the 3D point cloud that is used in determining whether or not a 3D point in the 3D point cloud is an inlier of a random plane.
Abstract:
This invention transforms a list of feature points in raster scan order into a list of maxima suppressed feature points. A working buffer has two more entries than the width of the original image. Each entry is assigned to an x coordinate of the original image. Each entry stores a combined y coordinate and reliability score for each feature point in the original list. This process involves a forward scan and a backward scan. For each original feature point its x coordinate defines the location within the working buffer where neighbor feature points would be stored if they exist. The working buffer initial data and the y coordinates assure a non suppress comparison result if the potential neighbors are not actual neighbors. For actual neighbor data, the y coordinates match and the comparison result depends solely upon the relative reliability scores.
Abstract:
A method for estimating time to collision (TTC) of a detected object in a computer vision system is provided that includes determining a three dimensional (3D) position of a camera in the computer vision system, determining a 3D position of the detected object based on a 2D position of the detected object in an image captured by the camera and an estimated ground plane corresponding to the image, computing a relative 3D position of the camera, a velocity of the relative 3D position, and an acceleration of the relative 3D position based on the 3D position of the camera and the 3D position of the detected object, wherein the relative 3D position of the camera is relative to the 3D position of the detected object, and computing the TTC of the detected object based on the relative 3D position, the velocity, and the acceleration.