摘要:
Embodiments of the present invention describe a collaborative framework for mining of surveillance videos to detect abnormal events, which introduces a two-stage training process to alleviate the high false alarm problem. In the first stage, unsupervised clustering is performed on the segments of the video streams and a set of abnormal events are combined with user feedback to generate a clean training set. In the second stage, the clean training set is used to train a more precise model for the analysis of normal events and the motion detection results from multiple cameras can be cross validated and combined. This description is not intended to be a complete description of, or limit the scope of, the invention. Other features, aspects, and objects of the invention can be obtained from a review of the specification, the figures, and the claims.
摘要:
Embodiments of the present invention introduce a novel technique to analyze and monitor video streams captured from multiple cameras. It highlights the foreground region of the video streams via local alpha blending and displays the videos in an immersive 3-D environment. The spatial arrangement of the displays can be generated by multi-dimensional scaling of the amount of simultaneous motion across different video streams. This description is not intended to be a complete description of, or limit the scope of, the invention. Other features, aspects, and objects of the invention can be obtained from a review of the specification, the figures, and the claims.
摘要:
Embodiments of the present invention introduce a user navigation interface that allows a user to monitor/navigate video streams captured from multiple cameras. It integrates video streams from multiple cameras with the semantic layout into a 3-D immersive environment and renders the video streams in multiple displays on a user navigation interface. It conveys the spatial distribution of the cameras as well as their fields of view and allows a user to navigate freely or switch among preset views. This description is not intended to be a complete description of, or limit the scope of, the invention. Other features, aspects, and objects of the invention can be obtained from a review of the specification, the figures, and the claims.
摘要:
Embodiments of the present invention describe a collaborative framework for mining of surveillance videos to detect abnormal events, which introduces a two-stage training process to alleviate the high false alarm problem. In the first stage, unsupervised clustering is performed on the segments of the video streams and a set of abnormal events are combined with user feedback to generate a clean training set. In the second stage, the clean training set is used to train a more precise model for the analysis of normal events and the motion detection results from multiple cameras can be cross validated and combined. This description is not intended to be a complete description of, or limit the scope of, the invention. Other features, aspects, and objects of the invention can be obtained from a review of the specification, the figures, and the claims.
摘要:
Embodiments of the present invention introduce a user navigation interface that allows a user to monitor/navigate video streams captured from multiple cameras. It integrates video streams from multiple cameras with the semantic layout into a 3-D immersive environment and renders the video streams in multiple displays on a user navigation interface. It conveys the spatial distribution of the cameras as well as their fields of view and allows a user to navigate freely or switch among preset views. This description is not intended to be a complete description of, or limit the scope of, the invention. Other features, aspects, and objects of the invention can be obtained from a review of the specification, the figures, and the claims.
摘要:
Embodiments of the present invention introduce a novel technique to analyze and monitor video streams captured from multiple cameras. It highlights the foreground region of the video streams via local alpha blending and displays the videos in an immersive 3-D environment. The spatial arrangement of the displays can be generated by multi-dimensional scaling of the amount of simultaneous motion across different video streams. This description is not intended to be a complete description of, or limit the scope of, the invention. Other features, aspects, and objects of the invention can be obtained from a review of the specification, the figures, and the claims.
摘要:
Techniques for generating a storyboard are disclosed. In one embodiment of the invention the storyboard is comprised of videos from one or more cameras based on the identification of activity in the video. Various embodiments of the invention include an assessment of the importance of the activity, the creation of a storyboard presentation based on importance and interaction techniques for seeing more details or alternate views of the video. In one embodiment, motion detection is used to determine activity in one or more synchronized video streams. Periods of activity are recognized and assigned importance assessments based on the activity, important locations in the video streams, and events from other sensors. In different embodiments, the interface consists of a storyboard and a map.
摘要:
Techniques for generating timelines and event logs from one or more fixed-position cameras based on the identification of activity in the video are presented. Various embodiments of the invention include an assessment of the importance of the activity, the creation of a timeline identifying events of interest, and interaction techniques for seeing more details of an event or alternate views of the video. In one embodiment, motion detection is used to determine activity in one or more synchronized video streams. In another embodiment, events are determined based on periods of activity and assigned importance assessments based on the activity, important locations in the video streams, and events from other sensors. In different embodiments, the interface consists of a timeline, event log, and map.
摘要:
Embedded Media Markers (EMM) are optical-filter-like overlaid marks that can be printed on a part of a paper document that signify the existence of media associated with that part of the document. Users take a picture of an EMM-signified document patch using a cell phone camera, and the media associated with the EMM-signified document location is retrieved and displayed on the phone. Unlike bar codes, EMMs are nearly transparent and thus do not interfere with the document contents. Retrieval of media associated with an EMM is based on image local features of the captured EMM-signified document patch. A semi-automatically technique places an EMM at a location in a document, in such a way that the EMM encompasses sufficient identification features with minimal disturbance to the original document.
摘要:
An adaptive, interactive visual workspace for viewing groups of files based on their relationships. Relationships of files are visualized using iterative refinement of categories through a direct-manipulation graph-based layout. The visual workspace starts with a fully connected graph linking thumbnail images of related files that is then partitioned into neighborhoods in response to a user creating file stacks corresponding to different categories. Normalized spring lengths improve the overall quality of the layout. Different modes for membership in neighborhoods avoid confusing motion of files and help a user to manually organize the workspace. Additionally, retrieved files can be added without having to significantly move the previous files. Different visualization techniques indicate which files are related to each other. Different zoom rates are used for file location, and surrogate sizes allow users to increase the separation between files while still increasing the surrogate sizes.