摘要:
A technique for image retrieval is proposed based on a probabilistic non-parametric geometric verification which includes checking the intersection property within keypoint quadruplets of images. Additional checks may be performed for convexity and alignment of keypoints and inlier ratio. The technique yields better performance than standard techniques based on geometric models for cases of non-planar deformations as in 3D objects with larges changes in venture point, such as buildings, or lenses with large geometrical deformations, such as fisheyes. The technique may also be used to detect outliers which may be discarded, prior to the calculation of a geometric model, in order to improve the performance of such model.
摘要:
A method for creating a menu (T) for audio content, e.g. music tracks, uses means (CL) for classifying the audio content into clusters (C1,...,C3) of similar tracks, the similarity referring to physical, perceptual and psychological features of the tracks. The method comprises a means (R) for automatic representative selection for clusters (C1,...,C3), and a means (X) for generating thumbnail representations of audio tracks. Said audio thumbnails are associated to the menu (T). Advantageously, no graphical or textual display is required for navigation, since the user may listen to an audio thumbnail and then enter a command, e.g. by pressing an appropriate button, for either listening to the related track or a similar track belonging to the same cluster, or listening to another type of music by selecting another thumbnail representing another cluster.
摘要:
The method involves classifying an audio track into clusters, where classification is performed based on a characteristic parameter of the track. The track that is being a representative for the cluster is selected, where selection is performed based on the parameter of track and of the other tracks of cluster. A reproducible audio extract is generated from the representative track and the extract is associated to a menu list (T). An independent claim is also included for an apparatus for creating or accessing a menu for audio content stored on a storage unit.
摘要:
The invention relates to a method for determining identifiers associated with segments of a document. Each segment consists of a series of individual elements such as images or sound sequences. Each segment of the document is subdivided into a determined number of portions comprising the same number of individual elements. An individual element is extracted from the most central portion of each segment and associated with the segment as identifier. The invention also relates to the receiver capable of implementing the method.
摘要:
Two of the primaries G1', G2' are distributed in the green part of the visible spectrum, one G1' mainly distributed between 500 and 540 nm, the other one G2' mainly distributed between 540 nm and 585 nm. Thank to the specific spectral definition of the primaries B', G1', G2', R1' and R2', colors can be easily displayed that are metameric for most of the viewers, allowing notably anti-camcorder defeat in movie theatres without prejudice for the quality of display for most of the viewers.
摘要:
E.g. in multimedia asset management systems, classification algorithms are used to classify multimedia data into different data classes, for instance such as indoor, landscape and city scenes, to enhance database (MDB) organisation. A problem is the confidence in classification performance. According to the invention, confidence measure values (c) about data classification results are calculated (MMDCL) in addition and can be included in user interaction. The confidence measure values are combined from a first part value ( c 1 ) that is related to the confidence in classification and from a second part value ( c 2 ) that is related to the confidence in said class models.
摘要:
The invention relates to a method for bounding an object in a video sequence F x,y,t . The method comprises obtaining a subset of pixels located in the object to annotate, in each frame of the video sequence. Spatio-temporal slicing is performed on the video sequence F x,y,t , centered on the obtained subsets of pixels, resulting in a first image F y,t obtained by an horizontal concatenation of first slices, comprising the obtained subsets of pixels, and resulting in a second image F x,t obtained by a vertical concatenation of second slices. A trajectory of the obtained subsets of pixels is displayed on both the first F y,t and second F x,t image. By using contour detection methods, a first and a second boundary are obtained on both the first F y,t and second F x,t image , around the trajectory of the obtained subsets of pixels. A bounding form around the object to annotate is obtained out of four points in each frame of the video sequence, wherein the coordinates of the four points of a frame t are obtained from the coordinates of the points located in the first and second boundary of the first and second image for that frame t. Advantageously, the bounding form is a rectangle drawn out of the four points, or an ellipse inscribed in that rectangle, or an ellipse comprising the four points.
摘要:
A method for pairwise matching features between two pieces of content is disclosed. It builds upon the idea that the best backward matching pairs generally belong to the k best forward matching pairs. Consequently, the pairwise matching method between a first set of features in a first piece of content and a second set of features in a second piece of content comprises a first step of forward matching resulting in forward matched pairs, wherein for each feature of the first set of features, a plurality of candidate forward matched pairs are selected, and a second step of backward matching applied in said plurality of candidate forward matched pairs for rejecting the forward matched pairs that do not belong to backward matched pairs.