摘要:
A method for representing an input image, the method including the steps of applying a trained neural network (NN) on the input image, selecting a plurality of feature maps, determining a location of each of the feature maps in an image space of the input image, defining a plurality of interest points of the input image, representing the input image as a graph according to the interest points and geometric relations between the interest points, and employing the graph for performing a visual task, the graph including a plurality of vertices and edges, and maintaining the data respective of the geometric relations, the feature maps being selected of an output of at least one selected layer of the trained NN according to values attributed to the feature maps by the trained NN, the interest points of the input image being defined based on the locations corresponding to the feature maps.
摘要:
A method for representing an input image includes the steps of applying a trained neural network on the input image, selecting a plurality of feature maps, determining a location of each of the plurality of feature maps in an image space of the input image, defining a plurality of interest points of the input image, and employing the plurality of interest points for representing the input image for performing a visual task. The plurality of feature maps are selected of an output of at least one selected layer of the trained neural network according to values attributed to the plurality of feature maps by the trained neural network. The plurality of interest points of the input image are defined based on the locations corresponding to the plurality of feature maps.
摘要:
A method for producing a graph representation of an input image, the method including the procedures of applying convolutional layers of a trained convolutional neural network on the input image, defining a receptive field of a last convolutional layer of the trained convolutional neural network as a vertex of the graph representation, defining a vector of a three dimensional output matrix of the last convolutional layer that is mapped to the receptive field as a descriptor for the vertex and determining an edge between a pair of vertices of the graph representation by applying an operator on a pair of descriptors respective of the pair of vertices.
摘要:
A method for determining image similarity as a function of weighted descriptor similarities, including the procedures of feeding a query image to a network including a plurality of layers and defining an output of each of the layers as a descriptor of the query image, feeding a reference image to the network and defining an output of each of the layers as a descriptor of the reference image, determining a descriptor similarity score for respective descriptors that were produced by the same layer of the network fed the query image and the reference image, assigning a respective weight to each descriptor similarity score and defining an image similarity between the query image and the reference image as a function of the weighted descriptor similarity scores.