摘要:
An image is segmented into superpixels by constructing a graph with vertices connected by edges, wherein each vertex corresponds to a pixel in the image, and each edge is associated with a weight indicating a similarity of the corresponding pixels, A subset of edges in the graph are selected to segment the graph into subgraphs, wherein the selecting maximizes an objective function based on an entropy rate and a balancing term. The edges with maximum gains are added to the graph until a number of subgraphs is equal to some threshold.
摘要:
An image is segmented into superpixels by constructing a graph with vertices connected by edges, wherein each vertex corresponds to a pixel in the image, and each edge is associated with a weight indicating a similarity of the corresponding pixels, A subset of edges in the graph are selected to segment the graph into subgraphs, wherein the selecting maximizes an objective function based on an entropy rate and a balancing term. The edges with maximum gains are added to the graph until a number of subgraphs is equal to some threshold.
摘要:
A pose of an object is determine by acquiring sets of images of the object by a camera, wherein the object has a thread arranged on a surface such that a local region of the object appears substantially spherical, wherein the camera is at a different point of view for each set, and wherein each image in each set is acquired while the scene is illuminated from a different direction. A set of features is extracted from each image, wherein the features correspond to points on the surface having normals towards the camera. A parametric line is fitted to the points for each image, wherein the line lies on a plane joining a center of the camera and an axis of the object. Then, geometric constraints are applied to lines to determine the pose of the object.
摘要:
A pose of an object is estimated by first defining a set of pair features as pairs of geometric primitives, wherein the geometric primitives include oriented surface points, oriented boundary points, and boundary line segments. Model pair features are determined based on the set of pair features for a model of the object. Scene pair features are determined based on the set of pair features from data acquired by a 3D sensor, and then the model pair features are matched with the scene pair features to estimate the pose of the object.
摘要:
A pose of an object is estimated by first defining a set of pair features as pairs of geometric primitives, wherein the geometric primitives include oriented surface points, oriented boundary points, and boundary line segments. Model pair features are determined based on the set of pair features for a model of the object. Scene pair features are determined based on the set of pair features from data acquired by a 3D sensor, and then the model pair features are matched with the scene pair features to estimate the pose of the object.
摘要:
A three-dimensional (3D) pose of a 3D object in an environment is determined by extracting features from an image acquired of the environment by a camera. The features are matched to a 3D model of the environment to determine correspondences. A camera reference frame of the image and a world reference frame of the environment are transformed to a corresponding intermediate camera reference frame and a corresponding world reference frame using the correspondences. Geometrical constraints are applied to the intermediate camera reference frame and the intermediate world reference frame to obtain a constrained intermediate world reference frame and a constrained world reference frame. The 3D pose is then determined from parameters of the constrained intermediate world reference frame and the constrained world reference frame.
摘要:
A location and orientation in an environment is determined by first acquiring a real omni-directional image of an unknown skyline in the environment. A set of virtual omni-directional images of known skylines are synthesized from a 3D model of the environment, wherein each virtual omni-directional image is associated with a known location and orientation. The real omni-directional image with each virtual omni-directional images to determine a best matching virtual omni-directional image with the associated known location and orientation.
摘要:
Poses of a movable camera relative to an environment are obtained by determining point correspondences from a set of initial images and then applying 2-point motion estimation to the point correspondences to determine a set of initial poses of the camera. A point cloud is generated from the set of initial poses and the point correspondences. Then, for each next image, the point correspondences and corresponding poses are determined, while updating the point cloud.
摘要:
A three-dimensional (3D) location of a reflection point of a ray between a point in a scene (PS) and a center of projection (COP) of a camera of a catadioptric system is determined. The catadioptric system is non-central and includes the camera and a reflector, wherein a surface of the reflector is a quadric surface rotationally symmetric around an axis of symmetry. The 3D location of the reflection point is determined based on a law of reflection, an equation of the reflector, and an equation describing a reflection plane defined by the COP, the PS, and a point of intersection of a normal to the reflector at the reflection point with the axis of symmetry.
摘要:
Embodiment of invention discloses a system and a method for determining a three-dimensional (3D) location of a folding point of a ray between a point in a scene (PS) and a center of projection (COP) of a camera of a catadioptric system. One embodiment maps the catadioptric system, including 3D locations of the PS and the COP on a two-dimensional (2D) plane defined by an axis of symmetry of a folding optical element and the PS to produce a conic and 2D locations of the PS and COP on the 2D plane, and determines a 2D location of the folding point on the 2D plane based on the conic, the 2D locations of the PS and the COP. Next, the embodiment determines the 3D location of the folding point from the 2D location of the folding point on the 2D plane.