摘要:
A homography between two captured images of a planar object is decomposed into at least one possible solution, and typically at least two ambiguous solutions. The ambiguity between the two solutions is removed, or a single solution validated, using measurements from orientation sensors. The measurements from orientation sensors may be used by comparing at least one of the yaw, pitch, and/or roll angles derived from a relative rotation matrix for the one or more solutions to a corresponding at least one of the yaw, pitch, and/or roll angles derived from the measurements from the orientation sensors.
摘要:
A homography between two captured images of a planar object is decomposed into at least one possible solution, and typically at least two ambiguous solutions. The ambiguity between the two solutions is removed, or a single solution validated, using measurements from orientation sensors. The measurements from orientation sensors may be used by comparing at least one of the yaw, pitch, and/or roll angles derived from a relative rotation matrix for the one or more solutions to a corresponding at least one of the yaw, pitch, and/or roll angles derived from the measurements from the orientation sensors.
摘要:
A multi-user augmented reality (AR) system operates without a previously acquired common reference by generating a reference image on the fly. The reference image is produced by capturing at least two images of a planar object and using the images to determine a pose (position and orientation) of a first mobile platform with respect to the planar object. Based on the orientation of the mobile platform, an image of the planar object, which may be one of the initial images or a subsequently captured image, is warped to produce the reference image of a front view of the planar object. The reference image may be produced by the mobile platform or by, e.g., a server. Other mobile platforms may determine their pose with respect to the planar object using the reference image to perform a multi-user augmented reality application.
摘要:
A reference patch of an unknown environment is generated on the fly for positioning and tracking. The reference patch is generated using a captured image of a planar object with two perpendicular sets of parallel lines. The planar object is detected in the image and axes of the world coordinate system are defined using the vanishing points for the two sets of parallel lines. The camera rotation is recovered based on the defined axes, and the reference patch of at least a portion of the image of the planar object is generated using the recovered camera rotation. The reference patch can then be used for vision based detection and tracking. The planar object may be detected in the image as sets of parallel lines or as a rectangle.
摘要:
A reference patch of an unknown environment is generated on the fly for positioning and tracking. The reference patch is generated using a captured image of a planar object with two perpendicular sets of parallel lines. The planar object is detected in the image and axes of the world coordinate system are defined using the vanishing points for the two sets of parallel lines. The camera rotation is recovered based on the defined axes, and the reference patch of at least a portion of the image of the planar object is generated using the recovered camera rotation. The reference patch can then be used for vision based detection and tracking. The planar object may be detected in the image as sets of parallel lines or as a rectangle.
摘要:
A multi-user augmented reality (AR) system operates without a previously acquired common reference by generating a reference image on the fly. The reference image is produced by capturing at least two images of a planar object and using the images to determine a pose (position and orientation) of a first mobile platform with respect to the planar object. Based on the orientation of the mobile platform, an image of the planar object, which may be one of the initial images or a subsequently captured image, is warped to produce the reference image of a front view of the planar object. The reference image may be produced by the mobile platform or by, e.g., a server. Other mobile platforms may determine their pose with respect to the planar object using the reference image to perform a multi-user augmented reality application.
摘要:
A mobile platform efficiently processes sensor data, including image data, using distributed processing in which latency sensitive operations are performed on the mobile platform, while latency insensitive, but computationally intensive operations are performed on a remote server. The mobile platform acquires sensor data, such as image data, and determines whether there is a trigger event to transmit the sensor data to the server. The trigger event may be a change in the sensor data relative to previously acquired sensor data, e.g., a scene change in an image. When a change is present, the sensor data may be transmitted to the server for processing. The server processes the sensor data and returns information related to the sensor data, such as identification of an object in an image or a reference image or model. The mobile platform may then perform reference based tracking using the identified object or reference image or model.
摘要:
Entropy based image segmentation determines entropy values for pixels in an image based on intensity or edge orientation. One or more threshold values are determined as a fraction of the entropy distribution over the image. For example, high and/or low thresholds may be generated to identify regions in the image associated with trees or sky, respectively. The entropy values are compared to the threshold(s) from which regions within the image can be segmented. Intensity based entropy has no structural information, and thus, proximity based clustering and pruning of the entropy points is performed. A mask may be applied to the segmented regions to remove the regions from the image, which is useful in, e.g., objection recognition processes. Additionally, separate buildings may be identified and segmented using edge orientation entropy with clustering and pruning.
摘要:
A mobile platform efficiently processes sensor data, including image data, using distributed processing in which latency sensitive operations are performed on the mobile platform, while latency insensitive, but computationally intensive operations are performed on a remote server. The mobile platform acquires sensor data, such as image data, and determines whether there is a trigger event to transmit the sensor data to the server. The trigger event may be a change in the sensor data relative to previously acquired sensor data, e.g., a scene change in an image. When a change is present, the sensor data may be transmitted to the server for processing. The server processes the sensor data and returns information related to the sensor data, such as identification of an object in an image or a reference image or model. The mobile platform may then perform reference based tracking using the identified object or reference image or model.
摘要:
A mobile station determines its orientation using an image of an object produced by the mobile station and a top view of that object obtained from an online server. The mobile station image is analyzed to identify lines on the object and to determine the direction of the lines with respect to the mobile station. The top view image, which may be a satellite image, is also analyzed to identify lines on the object that correspond to the lines identified in the mobile station image. The direction of the lines in the top view image are compared to the direction of lines in the mobile station image and based on their relative orientation the orientation of the mobile station may be determined. For example, the difference between the preliminary and corrected orientations may be stored as a calibration factor and used to correct subsequent orientation measurements from orientation sensors.