摘要:
The invention provides an improved method to detect semantic attributes of human body in computer vision. In detecting semantic attributes of human body in computer vision, the invention maintains a list of semantic attributes, each of which corresponds to a human body part. A computer module then analyzes segments of a frame of a digital video to detect each semantic attribute by finding a most likely attribute for each segment. A threshold is applied to select candidate segments of the frame for further analysis. The candidate segments of the frame then go through geometric and resolution context analysis by applying the physical structure principles of a human body and by analyzing increasingly higher resolution versions of the image to verify the existence and accuracy of parts and attributes. A computer module computes a resolution context score for a lower resolution version of the image based on a weighted average score computed for a higher resolution version of the image by evaluating appearance features, geometric features, and resolution context features when available on the higher resolution version of the image. Finally, an optimal configuration step is performed via dynamic programming to select an optimal output with both semantic attributes and spatial positions of human body parts on the frame.
摘要:
Techniques for detecting an attribute in video surveillance include generating training sets of multispectral images, generating a group of multispectral box features comprising receiving input of a detector size of a width and height, a number of spectral bands in the multispectral images, and integer values representing a minimum and maximum width and height of multispectral box features, fixing a feature width and to height, generating feature building blocks with the fixed width and height, placing a feature building block at a same location for each spectral band level, and enumerating combinations of the feature building blocks through each spectral level until all sizes within the integer values have been covered, and wherein each combination determines a multispectral box feature, using the training sets to select multispectral box features to generate a multispectral attribute detector, and using the multispectral attribute detector to identify a location of an attribute in video surveillance.
摘要:
An invention for generating a coding schema for identifying a spatial location of an event within video image data is provided. In one embodiment, there is a spatial representation tool, including a compression component configured to receive trajectory data of an event within video image data, generate a lossless compressed contour-coded blob to encode the trajectory data of the event within video image data, and generate a lossy searchable code to enable searching of a relational database based on the trajectory data of the event within the video image data.
摘要:
An approach that evaluates an effectiveness of a monitoring system is provided. In one embodiment, there is a performance evaluation tool, including an evaluation component configured to receive visual media at a monitoring system monitored by a set of users, and evaluate a monitoring performance of the set of users in response to a simulated alert within the visual media. The performance evaluation tool further comprises an analysis component configured to predict an effectiveness of the monitoring system based on the performance of the set of users in response to the simulated alert within the visual media.
摘要:
An approach that allows for model based people counting is provided. In one embodiment, there is a generating tool configured to generate a set of person-shape models based on results of a cumulative training process; a detecting tool configured to detect persons in a camera field-of-view by using the set of person-shape models, and a counting tool configured to track detected persons upon crossing by the detected persons of a previously established virtual boundary.
摘要:
Systems and methods for foreground analysis in real-time video include background subtraction and foreground detection, shadow removal, quick lighting change adaptation, static foreground region detection, foreground fragment reduction, and frame level change detection. Processes include background image extraction and foreground detection, integrating texture information of the background image and a current frame to remove false positive foreground areas resulting from lighting changes, integrating pixel intensity information by determining a cross-correlation of intensities between a current frame and the background image for each pixel in a foreground mask to remove image shadows. Static foreground region detection and fragment reduction are also included.
摘要:
A surveillance system and method includes a plurality of sensors configured to monitor an environment. A plurality of analytic engines is associated with each of the plurality of sensors. The plurality of analytic engines employs different technologies and is configured to analyze input from the sensors to determine whether an event has occurred in a respective technology. A unifying data model is configured to cross correlate detected events from the different technologies to gain integrated situation awareness across the different technologies.
摘要:
A system and method for generating a viewable video index for low bandwidth applications are provided. The exemplary aspects of the present invention solve the problems with the prior art systems by incorporating information for generating a viewable representation of the video data into the index, thus generating a viewable video index. The viewable video index contains information for generating a visual representation of moving objects in the video data, a visual representation of the background of the video capture area, i.e. the scene, a representation of the object trajectory, a representation of the object attributes, and a representation of detected events. The result is that the viewable video index may be transmitted to a low bandwidth application on a client device and may be used along with associated object and background models to generate a representation of the actual video data without requiring that the original video data itself be streamed to the client device.
摘要:
A method, system and computer program product for detecting presence of an object in an image are disclosed. According to an embodiment, a method for detecting a presence of an object in an image comprises: receiving multiple training image samples; determining a set of adaptive features for each training image sample, the set of adaptive features matching the local structure of each training image sample; integrating the sets of adaptive features of the multiple training image samples to generate an adaptive feature pool; determining a general feature based on the adaptive feature pool; and examining the image using a classifier determined based on the general feature to detect the presence of the object.
摘要:
A system and method for monitoring events includes defining alert events based on primitive event types in an environment monitored by monitoring devices. Alert parameters are set for each alert event, and alert events are combined to create a composite alert event based on a plurality of alert events. A plurality of engines is monitored to determine if multiple events comprising the composite alert event have occurred.