Abstract:
Methods and systems are provided for performing material capture to determine properties of an imaged surface. A plurality of images can be received depicting a material surface. The plurality of images can be calibrated to align corresponding pixels of the images and determine reflectance information for at least a portion of the aligned pixels. After calibration, a set of reference materials from a material library can be selected using the calibrated images. The set of reference materials can be used to determine a material model that accurately represents properties of the material surface.
Abstract:
Embodiments of the present invention facilitate automatically white-balancing images captured under mixed lighting conditions. More particularly, some embodiments are directed to leveraging flash photography to capture two images in quick succession, one with the flash activated and one without. By combining these two images, a per-pixel white balance kernel can be automatically determined and used to generate a white-balanced image without requiring any user input or user assumptions about the lighting in the scene. In embodiments, the white balance kernels for pixels that lie in the shadow or appear as specular spots are masked and can be interpolated from the white balance kernels at unmasked kernels. Motion effects between the flash and non-flash images may also be reduced.
Abstract:
Systems and methods are disclosed herein for 3-Dimensional portrait reconstruction from a single photo. A face portion of a person depicted in a portrait photo is detected and a 3-Dimensional model of the person depicted in the portrait photo constructed. In one embodiment, constructing the 3-Dimensional model involves fitting hair portions of the portrait photo to one or more helices. In another embodiment, constructing the 3-Dimensional model involves applying positional and normal boundary conditions determined based on one or more relationships between face portion shape and hair portion shape. In yet another embodiment, constructing the 3-Dimensional model involves using shape from shading to capture fine-scale details in a form of surface normals, the shape from shading based on an adaptive albedo model and/or a lighting condition estimated based on shape fitting the face portion.
Abstract:
Image editing techniques are disclosed that support a number of physically-based image editing tasks, including object insertion and relighting. The techniques can be implemented, for example in an image editing application that is executable on a computing system. In one such embodiment, the editing application is configured to compute a scene from a single image, by automatically estimating dense depth and diffuse reflectance, which respectively form the geometry and surface materials of the scene. Sources of illumination are then inferred, conditioned on the estimated scene geometry and surface materials and without any user input, to form a complete 3D physical scene model corresponding to the image. The scene model may include estimates of the geometry, illumination, and material properties represented in the scene, and various camera parameters. Using this scene model, objects can be readily inserted and composited into the input image with realistic lighting, shadowing, and perspective.
Abstract:
Systems and methods are provided for depth map estimation using three-dimensional epipolar data structures. The image manipulation application receives image data depicting an image space from a multiple perspectives. The image manipulation application generates at least one three-dimensional epipolar data structure from the image data. The at least one three-dimensional epipolar data structure includes data describing the difference in position of at least one object between the perspectives. The at least one three-dimensional epipolar data structure corresponds to at least one region of the image space. The image manipulation application generates a depth map based on the at least one three-dimensional epipolar data structure.
Abstract:
Systems and methods are provided for depth map estimation using three-dimensional epipolar data structures. The image manipulation application receives image data depicting an image space from a multiple perspectives. The image manipulation application generates at least one three-dimensional epipolar data structure from the image data. The at least one three-dimensional epipolar data structure includes data describing the difference in position of at least one object between the perspectives. The at least one three-dimensional epipolar data structure corresponds to at least one region of the image space. The image manipulation application generates a depth map based on the at least one three-dimensional epipolar data structure.
Abstract:
Techniques are disclosed for performing manipulation of facial images using an artificial neural network. A facial rendering and generation network and method learns one or more compact, meaningful manifolds of facial appearance, by disentanglement of a facial image into intrinsic facial properties, and enables facial edits by traversing paths of such manifold(s). The facial rendering and generation network is able to handle a much wider range of manipulations including changes to, for example, viewpoint, lighting, expression, and even higher-level attributes like facial hair and age—aspects that cannot be represented using previous models.
Abstract:
Embodiments of the present invention facilitate automatically white-balancing images captured under mixed lighting conditions. More particularly, some embodiments are directed to leveraging flash photography to capture two images in quick succession, one with the flash activated and one without. By combining these two images, a per-pixel white balance kernel can be automatically determined and used to generate a white-balanced image without requiring any user input or user assumptions about the lighting in the scene. In embodiments, the white balance kernels for pixels that lie in the shadow or appear as specular spots are masked and can be interpolated from the white balance kernels at unmasked kernels. Motion effects between the flash and non-flash images may also be reduced.
Abstract:
Embodiments of the present invention provide systems, methods, and computer storage media directed at relighting a target image based on a lighting effect from a reference image. In one embodiment, a target image and a reference image are received, the reference image includes a lighting effect desired to be applied to the target image. A lighting transfer is performed using color data and geometrical data associated with the reference image and color data and geometrical data associated with the target image. The lighting transfer causes generation of a relit image that corresponds with the target image having a lighting effect of the reference image. The relit image is provided for display to a user via one or more output devices. Other embodiments may be described and/or claimed.
Abstract:
Systems and methods are disclosed herein for 3-Dimensional portrait reconstruction from a single photo. A face portion of a person depicted in a portrait photo is detected and a 3-Dimensional model of the person depicted in the portrait photo constructed. In one embodiment, constructing the 3-Dimensional model involves fitting hair portions of the portrait photo to one or more helices. In another embodiment, constructing the 3-Dimensional model involves applying positional and normal boundary conditions determined based on one or more relationships between face portion shape and hair portion shape. In yet another embodiment, constructing the 3-Dimensional model involves using shape from shading to capture fine-scale details in a form of surface normals, the shape from shading based on an adaptive albedo model and/or a lighting condition estimated based on shape fitting the face portion.