Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to detect object in images. One of the methods includes receiving a training image and object location data for the training image; providing the training image to a neural network and obtaining bounding box data for the training image from the neural network, wherein the bounding box data comprises data defining a plurality of candidate bounding boxes in the training image and a respective confidence score for each candidate bounding box in the training image; determining an optimal set of assignments using the object location data for the training image and the bounding box data for the training image, wherein the optimal set of assignments assigns a respective candidate bounding box to each of the object locations; and training the neural network on the training image using the optimal set of assignments.
Abstract:
An embodiment provides for enabling retrieval of a collection of captured images that form at least a portion of a library of images. For each image in the collection, a captured image may be analyzed to recognize information from image data contained in the captured image, and an index may be generated, where the index data is based on the recognized information. Using the index, functionality such as search and retrieval is enabled. Various recognition techniques, including those that use the face, clothing, apparel, and combinations of characteristics may be utilized. Recognition may be performed on, among other things, persons and text carried on objects.
Abstract:
A system and method are presented for estimating the orientation of a panoramic camera mounted on a vehicle relative to the vehicle coordinate frame. An initial pose estimate of the vehicle is determined based on global positioning system data, inertial measurement unit data, and wheel odometry data of the vehicle. Image data from images captured by the camera is processed to obtain one or more tracks, each track including a sequence of matched feature points stemming from a same three-dimensional location. A correction parameter determined from the initial pose estimate and tracks can then be used to correct the orientations of the images captured by the camera. The correction parameter can be optimized by deriving a correction parameter for each of a multitude of distinct subsequences of one or more runs. Statistical analysis can be performed on the determined correction parameters to produce robust estimates.
Abstract:
A hierarchy of clusters is determined, where each leave of the hierarchy corresponds to one of the images in a group, and each cluster in the hierarchy identifies images in the group that are deemed similar to one another. The hierarchy identifies a similarity between each of the plurality of clusters.
Abstract:
Methods and systems permit automatic matching of videos with images from dense image-based geographic information systems. In some embodiments, video data including image frames is accessed. The video data may be segmented to determine a first image frame of a segment of the video data. Data representing information from the first image frame may be automatically compared with data representing information from a plurality of image frames of an image-based geographic information data system. Such a comparison may, for example, involve a search for a best match between geometric features, histograms, color data, texture data, etc. of the compared images. Based on the automatic comparing, an association between the video and one or more images of the image-based geographic information data system may be generated. The association may represent a geographic correlation between selected images of the system and the video data.
Abstract:
Initial trajectory data that provides an initial description of an approximate trajectory of a device during a time period, and correction data that indicates a location of the device outside the approximate trajectory of the device within the time period, are received. A modified trajectory data that provides a modified description of a corrected trajectory of the device is generated. In particular, terms to express (i) location constraints that limit deformation of the approximate trajectory of the device and (ii) a modification constraint that limits departure of the corrected trajectory of the device from the location indicated by the correction data are generated, and the initial description of the approximate trajectory of the device is modified using the generated terms.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to detect object in images. One of the methods includes receiving a training image and object location data for the training image; providing the training image to a neural network and obtaining bounding box data for the training image from the neural network, wherein the bounding box data comprises data defining a plurality of candidate bounding boxes in the training image and a respective confidence score for each candidate bounding box in the training image; determining an optimal set of assignments using the object location data for the training image and the bounding box data for the training image, wherein the optimal set of assignments assigns a respective candidate bounding box to each of the object locations; and training the neural network on the training image using the optimal set of assignments.
Abstract:
A hierarchy of clusters is determined, where each leave of the hierarchy corresponds to one of the images in a group, and each cluster in the hierarchy identifies images in the group that are deemed similar to one another. The hierarchy identifies a similarity between each of the plurality of clusters.
Abstract:
An embodiment provides for enabling retrieval of a collection of captured images that form at least a portion of a library of images. For each image in the collection, a captured image may be analyzed to recognize information from image data contained in the captured image, and an index may be generated, where the index data is based on the recognized information. Using the index, functionality such as search and retrieval is enabled. Various recognition techniques, including those that use the face, clothing, apparel, and combinations of characteristics may be utilized. Recognition may be performed on, among other things, persons and text carried on objects.
Abstract:
Embodiments described herein facilitate or enhance the implementation of image recognition processes which can perform recognition on images to identify objects and/or faces by class or by people.