摘要:
Embodiments of the disclosure provide for systems and methods for searching video data for events and/or behaviors. An inference engine can be used to aide in the searching. In some embodiments, a user can specify various search criteria, for example, a video source(s), an event(s) or behavior(s) to search, and an action(s) to perform in the event of a successful search. The search can be performed by analyzing an object(s) found within scenes of the video data. An object can be identified by a number of attributes specified by the user. Once the search criteria has been received from the user, the video data can be received (or extracted from storage), the data analyzed for the specified events (or behaviors), and the specified action performed in the event a successful search occurs.
摘要:
An image capture system includes: an image capture unit configured to capture a first image frame comprising a set of pixels; and a processor coupled to the image capture unit and configured to: determine a normalized distance of a pixel characteristic between the first image frame and a second image frame for each pixel in the first image frame; compare the normalized distance for each pixel in the first image frame against a pixel sensitivity value for that pixel; determine that a particular pixel of the first image frame is a foreground or background pixel based on the normalized distance of the particular pixel relative to the pixel sensitivity value for the particular pixel; and adapt the pixel sensitivity value for each pixel over a range of allowable pixel sensitivity values.
摘要:
Systems and methods are described herein that provide a three-tier intelligent video surveillance management system. An example of a system described herein includes a gateway configured to obtain video content and metadata relating to the video content from a plurality of network devices, a metadata processing module communicatively coupled to the gateway and configured to filter the metadata according to one or more criteria to obtain a filtered set of metadata, a video processing module communicatively coupled to the gateway and the metadata processing module and configured to isolate video portions, of video the content, associated with respective first portions of the filtered set of metadata, and a cloud services interface communicatively coupled to the gateway, the metadata processing module and the video processing module and configured to provide at least some of the filtered set of metadata or the isolated video portions to a cloud computing service.
摘要:
Embodiments of the disclosure provide for systems and methods for searching video data for events and/or behaviors. An inference engine can be used to aide in the searching. In some embodiments, a user can specify various search criteria, for example, a video source(s), an event(s) or behavior(s) to search, and an action(s) to perform in the event of a successful search. The search can be performed by analyzing an object(s) found within scenes of the video data. An object can be identified by a number of attributes specified by the user. Once the search criteria has been received from the user, the video data can be received (or extracted from storage), the data analyzed for the specified events (or behaviors), and the specified action performed in the event a successful search occurs.
摘要:
A method of video coding includes receiving a video image having a plurality of pixels, selecting a plurality of Gaussian models for each pixel, classifying each pixel as a background or foreground pixel, determining if an object is always moving locally by detecting if an object is always moving within a predetermined distance and changing the direction of moving frequently for a second predetermined period of time, removing an object from classification as an object and ignoring motion of the object if the object is always moving locally, marking the area that the object moves within as a noisy area on a noise mask buffer, updating background models based on motion detection and noise filtering, updating current and evolving background images, and coding the current background image, background update blocks and objects for transmission.
摘要:
Systems and methods are described herein that provide a three-tier intelligent video surveillance management system. An example of a system described herein includes a gateway configured to obtain video content and metadata relating to the video content from a plurality of network devices, a metadata processing module communicatively coupled to the gateway and configured to filter the metadata according to one or more criteria to obtain a filtered set of metadata, a video processing module communicatively coupled to the gateway and the metadata processing module and configured to isolate video portions, of video the content, associated with respective first portions of the filtered set of metadata, and a cloud services interface communicatively coupled to the gateway, the metadata processing module and the video processing module and configured to provide at least some of the filtered set of metadata or the isolated video portions to a cloud computing service.
摘要:
An image capture system includes: an image capture unit configured to capture a first image frame comprising a set of pixels; and a processor coupled to the image capture unit and configured to: determine a normalized distance of a pixel characteristic between the first image frame and a second image frame for each pixel in the first image frame; compare the normalized distance for each pixel in the first image frame against a pixel sensitivity value for that pixel; determine that a particular pixel of the first image frame is a foreground or background pixel based on the normalized distance of the particular pixel relative to the pixel sensitivity value for the particular pixel; and adapt the pixel sensitivity value for each pixel over a range of allowable pixel sensitivity values.
摘要:
Techniques are discussed herein for providing mechanisms for coding and transmitting high definition video, e.g., over low bandwidth connections. In particular, foreground-objects are identified as distinct from the background of a scene represented by a plurality of video frames. In identifying foreground-objects, semantically significant and semantically insignificant movement (e.g., non-repetitive versus repetitive movement) is differentiated. For example, the swaying motion of a tree's leaves being minor and repetitive, can be determined to be semantically insignificant and to belong in a scene's background. Processing of the foreground-objects and background proceed at different update rates or frequencies. For example, foreground-objects can be updated 30 or 60 times per second. By contrast, a background is updated less frequently, e.g., once every 10 seconds. In some implementations, if no foreground-objects are identified, no live video is transmitted (e.g., if no motion is detected, static images are not configured to be repeatedly sent). Techniques described herein take advantage of the realization that, in the area of surveillance and wireless communications, updating video of semantically significant movement at a high frame rate is sufficient.
摘要:
An example of a method for identifying objects in video content according to the disclosure includes receiving video content of a scene captured by a video camera, detecting an object in the video content, identifying a track that the object follows over a series of frames of the video content, extracting object features for the object from the video content, and classifying the object based on the object features. Classifying the object further comprises: determining a track-level classification for the object using spatially invariant object features, determining a global-clustering classification for the object using spatially variant features, and determining an object type for the object based on the track-level classification and the global-clustering classification for the object.
摘要:
Disclosed are methods, systems, computer readable media and other implementations, including a method to calibrate a camera that includes capturing by the camera a frame of a scene, identifying features appearing in the captured frame, the features associated with pre-determined values representative of physical attributes of one or more objects, and determining parameters of the camera based on the identified features appearing in the captured frame and the pre-determined values associated with the identified features.