Abstract:
Methods and apparatus for determining a depth of an object within a scene are provided. Image data of a scene can be captured using a lens configured to project an image of the scene onto an image sensor. The lens has a known focal length and is movable between at least a first lens position and a second lens position. A first image of the scene is captured with the lens at a first lens position, and a second image of the scene is captured with the lens at a second, different position. By measuring a first dimension of the object using the first image and a second dimension of the object using the second image, a depth of the object may be determined based upon a ratio of the first and second dimensions, the focal length of the lens, and a distance between the first and second lens positions.
Abstract:
A method performed by an electronic device is described. The method includes obtaining a first image from a first camera, the first camera having a first focal length and a first field of view. The method also includes obtaining a second image from a second camera, the second camera having a second focal length and a second field of view disposed within the first field of view. The method further includes aligning at least a portion of the first image and at least a portion of the second image to produce aligned images. The method additionally includes fusing the aligned images based on a diffusion kernel to produce a fused image. The diffusion kernel indicates a threshold level over a gray level range. The method also includes outputting the fused image. The method may be performed for each of a plurality of frames of a video feed.
Abstract:
A method includes identifying one or more codewords of a bit sequence that fail to satisfy at least one codeword constraint. The method also includes removing the one or more codewords from the bit sequence to generate a punctured bit sequence. The method includes, in response to determining that the punctured bit sequence is symmetric, generating a hermitian symmetric codebook primitive based at least in part on the punctured bit sequence, where the hermitian symmetric codebook primitive is useable to form a diffractive optical element (DOE) of a structured light depth sensing system.
Abstract:
Systems and method for generating depth maps using active sensing technology, for scenes with moving objects, is disclosed. One aspect provides for a method that includes estimating areas in adjacent frames that correspond to a moving object by generating a probability map for each received frame, the probability map comprising a probability value at each pixel. The method also includes computing a convex temporal average map using a plurality of the reflected structured light frames including at least the prior frame received at time t−1, the received frame received at time t, and the next frame received at time t+1, the value at each pixel of the convex temporal average map weighted and normalized by the probability map at each pixel at each time. The method also includes determining the codewords at each pixel in the convex temporal average map, and generating a depth map from the determined codewords.
Abstract:
A method operational on a receiver device for decoding a codeword is provided. At least a portion of a composite code mask is obtained, via a receiver sensor, and projected on the surface of a target object. The composite code mask may be defined by a code layer and a carrier layer. A code layer of uniquely identifiable spatially-coded codewords may be defined by a plurality of symbols. A carrier layer may be independently ascertainable and distinct from the code layer and may include a plurality of reference objects that are robust to distortion upon projection. At least one of the code layer and carrier layer may have been pre-shaped by a synthetic point spread function prior to projection. The code layer may be adjusted, at a processing circuit, for distortion based on the reference objects within the portion of the composite code mask.
Abstract:
Aspects relate to a method of generating a high-resolution image containing depth information of an object. In one aspect, the method includes downsampling a first reference image and a second reference image from a first resolution to a second resolution, wherein the first resolution is higher than the second resolution, and wherein the first reference image and the second reference image comprising a stereo image pair. The method further includes generating a depth map at the second resolution based on global minimization techniques, using the downsampled stereo image pair. The method also includes upsampling the depth map from the second resolution to the first resolution and using a guided filter to align contours of the upsampled depth map to contours of the first reference image.
Abstract:
Aspects relate to an depth sensing system for capturing an image containing depth information of an object. In one embodiment, a depth sensing device for use in conjunction with multiple depth sensing devices for capturing an image containing depth information of an object comprises a near-infrared transmitter comprising a laser capable of producing a near infra-red light beam, a diffractive optical element positioned to receive a light beam emitted from the laser, the diffractive optical element, and a collimating lens, and a near-infrared receiver coupled to the transmitter in a relative position, the receiver comprising a sensor assembly capable of producing an image of the received light, the depth sensing device being configured to transmit and receive near infra-red light beams during a time period that is different than any of the other of two or more transmitter-receiver pairs of devices in communication with the depth sensing device.
Abstract:
Certain aspects relate to systems and techniques for performing local intensity equalization on images in a set of images exhibiting local intensity variations. For example, the local intensity equalization can be used to perform accurate region matching and alignment of the images. The images can be partitioned into regions of pixel blocks, for instance based on location, shape, and size of identified keypoints in the images. Regions depicting the same feature in the images can be equalized with respect to intensity. Region matching based on the keypoints in the intensity-equalized regions can be performed with accuracy even in images captured by asymmetric sensors or exhibiting spatially varying intensity.
Abstract:
Described are systems and methods for measuring objects using stereoscopic imaging. After determining keypoints within a set of stereoscopic images, a user may select a desired object within an imaged scene to be measured. Using depth map information and information about the boundary of the selected object, the desired measurement may be calculated and displayed to the user on a display device. Tracking of the object in three dimensions and continuous updating of the measurement of a selected object may also be performed as the object or the imaging device is moved.
Abstract:
A method for generating codes for a code mask is provided. A plurality of symbols may be arranged into an n1 by n2 symbol structure, where n1 and n2 are integer values. A plurality of codewords may be defined from different overlapping k1 by k2 windows within the symbol structure, wherein co-linear and spatially overlapping windows define unique codewords, and the codewords are unique in a first direction of the symbol structure but are repeated in a second direction that is perpendicular to the first direction. A plurality of the symbol structures as a code mask, wherein symbols in two adjacent k1 by k2 windows are selected so as to avoid codeword aliasing of codewords in the two adjacent k1 by k2 windows.