摘要:
A system for determining a key frame of an image sequence wherein the key frame includes the clearest image of the face of a person from the image sequence, the system included an image input means for receiving the image sequence of the person and a processing means for identifying the face of the person in each frame of the image sequence and then determining which frame is the clearest image of the persons face.
摘要:
An ImageWiki architecture is used to generate an image-based web page for an image on the Web. An ImageWiki page may be created automatically or individually, by a user of the Web. Additionally, a user may revise existing ImageWiki pages to update a particular page or correct an incorrect or misleading previous entry. The ImageWiki application indexes images located on the Web. Once the images are indexed, the information related to the images is mined and extracted from various sources of web data. Finally, an ImageWiki page or web page is generated for each image. The resulting ImageWiki page contains the image as well as the aggregated information relating to the image.
摘要:
A method and system for generating an entirely well-focused image of a three-dimensional scene. The method comprises the steps of a) learning a prediction model including at least a focal depth probability density function (PDF), h(k), for all depth values k, from historical tiles of the scene; b) predicting the possible focal surfaces in subsequent tiles of the scene by applying the prediction model; c) for each value of k, examining h(k) such that if h(k) is below a first threshold, no image is acquired at the depth k′ for said one tile; and if h(k) is above or equal to a first threshold, one or more images are acquired in a depth range around said value of k for said one tile; and d) processing the acquired images to generate a pixel focus map for said one tile.
摘要:
A method of annotating footage that includes a structured text broadcast stream, a video stream and an audio stream, the method includes the steps of: extracting directly or indirectly one or more keywords and/or features from at least said structured text broadcast streams, temporally annotating said footage with said keywords and/or features analysing temporally adjacent annotated keywords and/or features to determine information about one or more events within said footage. Also provided are: a data store for storing video footage, a method of generation of a personalised video summary, a system for annotating footage and a system for generation of a personalised video summary.
摘要:
An ImageWiki architecture is used to generate an image-based web page for an image on the Web. An ImageWiki page may be created automatically or individually, by a user of the Web. Additionally, a user may revise existing ImageWiki pages to update a particular page or correct an incorrect or misleading previous entry. The ImageWiki application indexes images located on the Web. Once the images are indexed, the information related to the images is mined and extracted from various sources of web data. Finally, an ImageWiki page or web page is generated for each image. The resulting ImageWiki page contains the image as well as the aggregated information relating to the image.
摘要:
Adaptive image retrieval image allows retrieval of images that are more likely to reflect a current trend of user preferences and/or interests, and therefore can provide relevant results to an image search. Adaptive image retrieval includes receiving image query log data from one or more clients, and updating a codebook of features based on the received query log data. The image query log data includes images that have been queried by the one or more clients within a predetermined period of time.
摘要:
A method for use in indexing video footage, the video footage comprising an image signal and a corresponding audio signal relating to the image signals, the method comprising extracting audio features from the audio signal of the video footage and visual features from the image signal of the video footage; comparing the extracted audio and visual features with predetermined audio and visual keywords; identifying the audio and visual keywords associated with the video footage based on the comparison of the extracted video and visual features with the predetermine audio and visual keywords; and determining the presence of events in the video footage based on the audio and visual keywords associated with the video footage.
摘要:
A hierarchical sparse codebook allows efficient search and comparison of images in image retrieval. The hierarchical sparse codebook includes multiple levels and allows a gradual determination/classification of an image feature of an image into one or more groups or nodes by traversing the image feature through one or more paths to the one or more groups or nodes of the codebook. The image feature is compared with a subset of nodes at each level of the codebook, thereby reducing processing time.
摘要:
A method for visualizing multimedia objects assigns a feature vector to each multimedia object. The feature vector of each multimedia object is reduced to a location vector having a dimensionality of a display device. A cost function is evaluated to determine an optimal location vector for each multimedia object, and each multimedia object is displayed on a display device according to the optimal location vector. The reducing can use principle component analysis. In addition, a relevance score can be determined for each displayed multimedia object, and the multimedia objects can than be visually enhanced according to the relevance score.
摘要:
A method and System for identifying repeat clip instances in video data. The method comprises partitioning the video data into ordered video units utilising content-based keyframe sampling, wherein each video unit comprises a sequence interval between two consecutive keyframes; creating a fingerprint for each video unit; grouping at least two consecutive video units into one time-indexed video segment; and identifying the repeat clip instances based on correlation of the video segments. The method can be used for both discovering unknown repeat video clips and identifying instances of known repeat video clips automatically. The method can be used to identify short repeat video clips from less than a second long to a few minutes, such as tv station logos, program logos, tv commercials which are widely used in news video and other daily broadcasting programs.