Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for object detection are disclosed. Methods can include, for each of a plurality of locations in one or more positive images, image filters are identified, each image filter representing visual features of a location in a positive image (e.g., an image that includes a particular object). Positive location feature scores and negative location feature scores are determined for locations within images. A positive location feature score is based on a similarity between the image filter and feature values for a positive image. A negative location feature score is determined based on a similarity between the image filter and feature values for a negative image. A distinctive location is identified based on the positive and negative location feature scores, and distinguishing feature values for identifying the particular object are identified for the distinctive location.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for detecting objects in images. One of the methods includes receiving an input image. A full object mask is generated by providing the input image to a first deep neural network object detector that produces a full object mask for an object of a particular object type depicted in the input image. A partial object mask is generated by providing the input image to a second deep neural network object detector that produces a partial object mask for a portion of the object of the particular object type depicted in the input image. A bounding box is determined for the object in the image using the full object mask and the partial object mask.
Abstract:
Implementations relate to techniques for classifying images. Some techniques utilize weights associated with local descriptors to classify images. Some techniques utilize visual phrase matching to classify images. The resulting image classifications can be used in part to assist in internet searches.