摘要:
An in-place learning algorithm is provided for a multi-layer developmental network. The algorithm includes: defining a sample space as a plurality of cells fully connected to a common input; dividing the sample space into mutually non-overlapping regions, where each region is a represented by a neuron having a single feature vector; and estimating a feature vector of a given neuron by an amnesic average of an input vector weighted by a response of the given neuron, where amnesic is a recursive computation of the input vector weighted by the response such that the direction of the feature vector and the variance of signal in the region projected onto the feature vector are both recursively estimated with plasticity scheduling.
摘要:
Image features are generated by performing wavelet transformations at sample points on images stored in electronic form. Multiple wavelet transformations at a point are combined to form an image feature vector. A prototypical set of feature vectors, or atoms, is derived from the set of feature vectors to form an “atomic vocabulary.” The prototypical feature vectors are derived using a vector quantization method, e.g., using neural network self-organization techniques, in which a vector quantization network is also generated. The atomic vocabulary is used to define new images. Meaning is established between atoms in the atomic vocabulary. High-dimensional context vectors are assigned to each atom. The context vectors are then trained as a function of the proximity and co-occurrence of each atom to other atoms in the image. After training, the context vectors associated with the atoms that comprise an image are combined to form a summary vector for the image. Images are retrieved using a number of query methods, e.g., images, image portions, vocabulary atoms, index terms. The user's query is converted into a query context vector. A dot product is calculated between the query vector and the summary vectors to locate images having the closest meaning. The invention is also applicable to video or temporally related images, and can also be used in conjunction with other context vector data domains such as text or audio, thereby linking images to such data domains.
摘要:
The model is a third generation neurosimulator. It has a plurality of areas whose functions can be identified with the functions of the areas of the dorsal and ventral path of the visual cortex of the human brain. Feedback is provided between different areas during processing. There is additionally provided competition for attention between different features and/or different spatial regions. The model is very flexibly suitable for image processing. It simulates natural human image processing and explains many experimentally observed phenomena.
摘要:
A method for determining whether a fuzzy symbol matches a predetermined reference pattern by generating membership functions that collectively represent a reference pattern having identifying features; sampling the fuzzy symbol to generate an input pattern representative of the fuzzy symbol; transforming the input pattern to generate an inverted input pattern; comparing the input pattern with a first membership function to determine a first quantity of identifying features of the reference pattern that are present in the fuzzy symbol; comparing the inverted input pattern with a second membership function to determine a second quantity of identifying features of the reference pattern that are present in the fuzzy symbol; and determining that the fuzzy symbol matches the reference pattern if the first and second quantities are sufficiently high.
摘要:
Methods and apparatus for tracking and discerning objects using their saliency. In one embodiment of the present disclosure, the tracking of objects is based on a combination of object saliency and additional sources of signal about object identity. Under certain simplifying assumptions, the present disclosure allows for robust tracking of simple objects with limited processing resources. In one or more variants, efficient implementation of the methods described allow sensors (e.g., cameras) to be used on board a robot (or autonomous vehicle) on a mobile determining platform, such as to capture images to determine the presence and/or identity of salient objects. Such determination of salient objects allow for e.g., adjustments to vehicle or other moving object trajectory.
摘要:
Disclosed systems and methods relate to remote sensing, deep learning, and object detection. Some embodiments relate to machine learning for object detection, which includes, for example, identifying a class of pixel in a target image and generating a label image based on a parameter set. Other embodiments relate to machine learning for geometry extraction, which includes, for example, determining heights of one or more regions in a target image and determining a geometric object property in a target image. Yet other embodiments relate to machine learning for alignment, which includes, for example, aligning images via direct or indirect estimation of transformation parameters.
摘要:
An inference system performs inference, such as object recognition, based on sensory inputs generated by sensors and control information associated with the sensory inputs. The sensory inputs describe one or more features of the objects. The control information describes movement of the sensors or known locations of the sensors relative to a reference point. For a particular object, an inference system learns a set of object-location representations of the object. An object-location representation is a unique characterization of an object-centric location relative to the particular object. The inference system also learns a set of feature-location representations associated with the object-location representation that indicate presence of features at the corresponding object-location pair. The inference system can perform inference on an unknown object by identifying candidate object-location representations consistent with feature-location representations observed from the sensory input data and control information.
摘要:
Apparatus and methods for detecting and utilizing saliency in digital images. In one implementation, salient objects may be detected based on analysis of pixel characteristics. Least frequently occurring pixel values may be deemed as salient. Pixel values in an image may be compared to a reference. Color distance may be determined based on a difference between reference color and pixel color. Individual image channels may be scaled when determining saliency in a multi-channel image. Areas of high saliency may be analyzed to determine object position, shape, and/or color. Multiple saliency maps may be additively or multiplicative combined in order to improve detection performance (e.g., reduce number of false positives). Methodologies described herein may enable robust tracking of objects utilizing fewer determination resources. Efficient implementation of the methods described below may allow them to be used for example on board a robot (or autonomous vehicle) or a mobile determining platform.
摘要:
A method of monitoring a blind spot of a monitoring vehicle by using a blind spot monitor is provided. The method includes steps of: the blind spot monitor (a) acquiring a feature map from rear video images, on condition that video images with reference vehicles in the blind spot are acquired, reference boxes for the reference vehicles are created, and the reference boxes are set as proposal boxes; (b) acquiring feature vectors for the proposal boxes on the feature map by pooling, inputting the feature vectors into a fully connected layer, acquiring classification and regression information; and (c) selecting proposal boxes by referring to the classification information, acquiring bounding boxes for the proposal boxes by using the regression information, determining the pose of the monitored vehicle corresponding to each of the bounding boxes, and determining whether a haphazard vehicle is located in the blind spot of the monitoring vehicle.
摘要:
Apparatus and methods for detecting and utilizing saliency in digital images. In one implementation, salient objects may be detected based on analysis of pixel characteristics. Least frequently occurring pixel values may be deemed as salient. Pixel values in an image may be compared to a reference. Color distance may be determined based on a difference between reference color and pixel color. Individual image channels may be scaled when determining saliency in a multi-channel image. Areas of high saliency may be analyzed to determine object position, shape, and/or color. Multiple saliency maps may be additively or multiplicative combined in order to improve detection performance (e.g., reduce number of false positives). Methodologies described herein may enable robust tracking of objects utilizing fewer determination resources. Efficient implementation of the methods described below may allow them to be used for example on board a robot (or autonomous vehicle) or a mobile determining platform.