Abstract:
A computer-implemented method for detecting objects by using subcategory-aware convolutional neural networks (CNNs) is presented. The method includes generating object region proposals from an image by a region proposal network (RPN) which utilizes subcategory information, and classifying and refining the object region proposals by an object detection network (ODN) that simultaneously performs object category classification, subcategory classification, and bounding box regression. The image is an image pyramid used as input to the RPN and the ODN. The RPN and the ODN each include a feature extrapolating layer to detect object categories with scale variations among the objects.
Abstract:
Systems and methods for object detection by receiving an image; segmenting the image and identifying candidate bounding boxes which may contain an object; for each candidate bounding box, dividing the box into overlapped small patches, and extracting dense features from the patches; during a training phase, applying a learning process to learn one or more discriminative classification models to classify negative boxes and positive boxes; and during an operational phase, for a new box generated from the image, applying the learned classification model to classify whether the box contains an object.
Abstract:
Systems and methods are disclosed for deep learning and classifying images of objects by receiving images of objects for training or classification of the objects; producing fine-grained labels of the objects; providing object images to a multi-class convolutional neural network (CNN) having a softmax layer and a final fully connected layer to explicitly model bipartite-graph labels (BGLs); and optimizing the CNN with global back-propagation.
Abstract:
Systems and methods are disclosed for detecting an object in an image by determining convolutional neural network responses on the image; mapping the responses back to their spatial locations in the image; and constructing features densely extract shift invariant activations of a convolutional neural network to produce dense features for the image.
Abstract:
Systems and methods are disclosed for object detection by receiving an image and extracting features therefrom; applying a learning process to determine sub-regions and select predetermined pooling regions; and performing selective max-pooling to choose one or more feature regions without noises.
Abstract:
Systems and methods are disclosed for object detection by receiving an image; segmenting the image; extracting features from the image; and performing a dimension-wise spatial layout selection to pick up dimensions inside a discriminative spatial region for classification.
Abstract:
An image retrieval method includes learning multiple object category classifiers with a processor offline and generating classifications scores of images as the semantic attributes; performing vocabulary tree based image retrieval using local features with semantic-aware co-indexing to jointly embed two distinct cues offline for near-duplicate image retrieval; and identifying top similar or dissimilar images using multiple semantic attributes.
Abstract:
Systems and methods are disclosed for object detection by receiving an image and extracting features therefrom; applying a learning process to determine sub-regions and select predetermined pooling regions; and performing selective max-pooling to choose one or more feature regions without noises.
Abstract:
A computer-implemented method for training a convolutional neural network (CNN) is presented. The method includes receiving regions of interest from an image, generating one or more convolutional layers from the image, each of the one or more convolutional layers having at least one convolutional feature within a region of interest, applying at least one cascaded rejection classifier to the regions of interest to generate a subset of the regions of interest, and applying scale dependent pooling to convolutional features within the subset to determine a likelihood of an object category.
Abstract:
An object detector includes a bottom-up object hypotheses generation unit; a top-down object search with supervised descent unit; and an object re-localization unit with a localization model.