摘要:
A system for automatically acquiring high-resolution images by steering a pan-tilt-zoom camera at targets detected in a fixed camera view is provided. The system uses automatic or manual calibration between multiple cameras. Using automatic calibration, the homography between the cameras in a home position is estimated together with the effects of pan and tilt controls and the expected height of a person in the image. These calibrations are chained together to steer a slave camera. The manual calibration scheme steers a camera to the desired region of interest and calculates the pan, tile and zoom parameters accordingly.
摘要:
The present invention includes a method, system, and program product for detecting an event that includes receiving at least one data input stream from one or more sensors, selecting a data input stream from one of the one or more sensors, recording the data input stream on a recordable medium, specifying a rule comprising an event in the data input stream, and detecting at least one event in the data input stream based upon the rule.
摘要:
One aspect of the invention is directed to a system and method for video cataloging. The video is cataloged according to predefined or user definable metadata. The metadata is used to index and then retrieve encoded video. Video feature extractors produce metadata tracks from the video information, and each metadata track indexes the stored video information. A feature extractor registration interface registers the video feature extractors, providing for registration with the video engine of new video feature extractors with new metadata tracks.
摘要:
A system and method for improving the retrieval performance of a query engine in a visual information retrieval (VIR) system by encoding domain-specific knowledge into the VIR system through a visual dictionary or "victionary". The victionary is a dictionary-like information-mapping module that is used to retrieve visual information at a "semantic" level. A VIR system that performs generic image processing is enhanced by adding a query transformation unit and a query expansion unit, i.e., the victionary. With these additional components, a user may present a query either as a text term (such as a keyword or phrase), or as an image (with weights) and execute a "semantic query". During semantic query processing, the victionary-enhanced system transforms the user's original term (or image query) to a set of equivalent queries, and internally executes all the equivalent queries before presenting the results to the user. The victionary unit is responsible for taking the term (or image query) and finding the equivalent feature vectors (and weights). A result processor accumulates the score sheets of each equivalent query and presents a composite ranking that reflects a faithful representation of each equivalent query to the user. The architecture of the victionary-enhanced system is open and extensible, so that one or more domain-specific victionary modules can be plugged into the system. The plug-in architecture of the victionary module is effected through an application programming interface (API).
摘要:
An approach for infrastructure asset management is provided. This approach comprises an end-to-end analytics driven maintenance approach that can take data about physical assets and additional external data, and apply advanced analytics to the data to generate business insight, foresight and planning information. Specifically, this approach uses a maintenance analysis tool, which is configured to: receive data about a set of physical assets of an infrastructure, and analyze the data about the set of physical assets to predict maintenance requirements for each of the set of physical assets. The maintenance analysis tool further comprises an output component configured to generate a maintenance plan based on the predicted maintenance requirements for each of the set of physical assets.
摘要:
Multiple event types are monitored for events, and surveillance data is stored for each event. Surveillance data for a primary event of one event type can be presented to a user, and surveillance data for a set of related events corresponding to another event type can be presented based on a set of relatedness criteria and the surveillance data for the primary event. A user can adjust the relatedness criteria to filter/adjust the surveillance data presented for the related event(s). A user interface can enable the user to simultaneously view the surveillance data for both events and adjust the relatedness criteria. In an illustrative application, the invention is utilized to detect fraudulent merchandise returns in a retail store.
摘要:
A system, method and computer program product for mining a rule including spatial information and non-spatial information by using a SAR (Spatial Association Rule) mining tool. The computing system is configured to construct an expanded spatial predicate transaction table for reference spatial objects and a generalized taxonomy for task-relevant spatial objects. The computing system is configured to run the SAR mining tool with the constructed expanded spatial predicate transaction and the generalized taxonomy. The computing system outputs, from the SAR mining tool, a set of generalized spatial association rules for the reference spatial objects. The generalized spatial association rule includes the spatial information and non-spatial information, associated with both the reference spatial objects and the task-relevant spatial objects.
摘要:
An approach that allows for model based people counting is provided. In one embodiment, there is a generating tool configured to generate a set of person-shape models based on results of a cumulative training process; a detecting tool configured to detect persons in a camera field-of-view by using the set of person-shape models, and a counting tool configured to track detected persons upon crossing by the detected persons of a previously established virtual boundary.
摘要:
Techniques for classifying one or more objects in at least one video, wherein the at least one video comprises a plurality of frames are provided. One or more objects in the plurality of frames are tracked. A level of deformation is computed for each of the one or more tracked objects in accordance with at least one change in a plurality of histograms of oriented gradients for a corresponding tracked object. Each of the one or more tracked objects is classified in accordance with the computed level of deformation.
摘要:
A method for real time processing of a sequence of video frames. The video frames are received in synchronization with a recording of the video frames in real time for triggering an alert. The method is implemented by execution of program code on a processor of a computer system. Each frame includes a two-dimensional array of pixels and a frame-dependent color intensity at each pixel. An algorithm determines whether a static object in a current frame of the video frames is an abandoned object or a removed object. The determined status, the current frame time, the static region, and the static object are stored in a data storage medium of the computer system. An alarm is triggered in response to satisfaction of requirements that include a persistence requirement, a non-persistence duration requirement, and a persistence duration requirement.