摘要:
In one aspect, the present invention is directed to a method and an apparatus for organizing digital media, particularly digital photos, using face recognition. According to a first aspect of the present invention, a computer-based method for organizing digital photos comprises: extracting objects of interest from a plurality of photographs; cropping said plurality of photographs to generate images of isolated objects of interest; applying a recognition algorithm to determine the similarity of isolated objects of interest with a reference; displaying a plurality of objects arranged as a function of the determined similarity; and receiving user input to associate said objects with a particular classification.
摘要:
A system and method for identifying query-related keywords in documents found in a search using latent semantic analysis. The documents are represented as a document term matrix M containing one or more document term-weight vectors d, which may be term-frequency (tf) vectors or term-frequency inverse-document-frequency (tf-idf) vectors. This matrix is subjected to a truncated singular value decomposition. The resulting transform matrix U can be used to project a query term-weight vector q into the reduced N-dimensional space, followed by its expansion back into the full vector space using the inverse of U.To perform a search, the similarity of qexpanded is measured relative to each candidate document vector in this space. Exemplary similarity functions are dot product and cosine similarity. Keywords are selected with the highest values in qexpanded that are also comprised in at least one document. Matching keywords from the query may be highlighted in the search results.
摘要:
In one aspect, the present invention is directed to a method and an apparatus for organizing digital media, particularly digital photos, using face recognition. According to a first aspect of the present invention, a computer-based method for organizing digital photos comprises: extracting objects of interest from a plurality of photographs; cropping said plurality of photographs to generate images of isolated objects of interest; applying a recognition algorithm to determine the similarity of isolated objects of interest with a reference; displaying a plurality of objects arranged as a function of the determined similarity; and receiving user input to associate said objects with a particular classification.
摘要:
Embodiments of the present invention provide the ability to navigate, view, and manipulate a collection of digital images utilizing a GUI that has the familiar context of a calendar. Graphical objects representative of digital images are displayed within a particular day displayed in a calendar-based GUI. A user may group digital images into groups, modify the date with which a digital image is associated and perform various other manipulations using embodiments of a calendar-based GUI.
摘要:
The invention displays video search results in a form that makes it easy for users to determine which results are truly relevant. Each story returned as a search result is visualized as a collage of keyframes from the story's shots. The selected keyframes and their sizes depend on the corresponding shots' respective relevance. Shot relevance depends on the search retrieval score of the shot and, in some embodiments, also depends on the search retrieval score of the shot's parent story. Once areas have been determined, the keyframes are scaled and/or cropped to fit into the area. In one embodiment, users can mark one or more shots as being relevant to the search. In one embodiment, a timeline is created and displayed with one or more neighbor stories that are each part of the video and which are closest in time of creation to the selected story.
摘要:
The invention segments detector input according to the time and the level of activity in different geographic regions of a locality. In one embodiment of the invention the detector input is comprised of video stream from one or more cameras to identify activity in the video. In one embodiment of the invention the detector input is comprised of sensor outputs such as RFID, pressure plates, etc. Various embodiments of the invention include identifying boundaries based on the level of activity. In embodiments of the invention, the boundaries can be used to select time dimensions. In one embodiment, by recognizing time dimensions with distinctive activity patterns, systems can better present overviews of activity over time.
摘要:
A computer-based method is provided for enabling navigation of video using a keyframe-based video browser on a display device with a limited screen size, for a video segmented into video shots. The video shots are clustered by similarity, while temporal order of the video shots is maintained. A hierarchically organized navigation tree is produced for the clusters of video shots, while the path lengths of the tree are minimized.
摘要:
An interface and display of video from multiple fixed-position cameras is provided. A main video stream captured by a camera is selected to be the main video stream and is displayed to the interface. Video streams captured by the set of cameras and the main camera that are temporally related to the displayed main video stream are selected, including playback positions from one or more of a first segment of time in each of their respective video streams at the time of the main video stream, a second segment of time in each of their respective video streams prior to the time of the main video stream, and a third segment of time in each of their respective video streams after the time of the main video stream. The selected video streams are displayed to the interface in temporal relation to the display of the main video stream.
摘要:
A hypervideo summary comprised of multiple levels of related content and appropriate navigational links can be automatically generated from a media file such as a linear video. A number of algorithms and selection criteria can be used to modify how such a summary is generated. Viewers of an automatically-generated hypervideo summary can interactively select the amount of detail displayed for each portion of the summary. This selection can be done by following explicit navigational links, or by changing between media channels that are mapped to the various levels of related content.This description is not intended to be a complete description of, or limit the scope of, the invention. Other features, aspects, and objects of the invention can be obtained from a review of the specification, the figures, and the claims.
摘要:
A hypervideo summary comprised of multiple levels of related content and appropriate navigational links can be automatically generated from a media file such as a linear video. A number of algorithms and selection criteria can be used to modify how such a summary is generated. Viewers of an automatically-generated hypervideo summary can interactively select the amount of detail displayed for each portion of the summary. This selection can be done by following explicit navigational links, or by changing between media channels that are mapped to the various levels of related content.