Abstract:
The method facilitates efficient motion estimation for video sequences captured with a stationary camera with respect to an object. For video captured with this type of camera, a main cause of changes between adjacent frames corresponds to object motion. In this setting the output from the motion compensation stage is the block matching algorithm describing the way pixel blocks move between adjacent frames. For video captured with cameras mounted on moving vehicles (e.g. school buses, public transportation vehicles and police cars), the motion of the vehicle itself is the largest source of apparent motion in the captured video. In both cases, the encoded set of motion vectors is a good descriptor of apparent motion of objects within the field of view of the camera.
Abstract:
A mobile electronic device processes a sequence of images to identify and re-identify an object of interest in the sequence. An image sensor of the device, receives a sequence of images. The device detects an object in a first image as well as positional parameters of the device that correspond to the object in the first image. The device determines a range of positional parameters within which the object may appear in a field of view of the device. When the device detects that the object of interest exited the field of view it subsequently uses motion sensor data to determine that the object of interest has likely re-entered the field of view, it will analyze the current frame to confirm that the object of interest has re-entered the field of view.
Abstract:
What is disclosed is a system and method for identifying a patient's breathing pattern for respiratory function assessment without contact and with a depth-capable imaging system. In one embodiment, a time-varying sequence of depth maps are received of a target region of a subject of interest over a period of inspiration and expiration. Once received, the depth maps are processed to obtain a breathing signal for the subject. The subject's breathing signal comprises a temporal sequence of instantaneous volumes. One or more segments of the subject's breathing signal are then compared against one or more reference breathing signals each associated with a known pattern of breathing. As a result of the comparison, a breathing pattern for the subject is identified. The identified breathing pattern is then used to assess the subject's respiratory function. The teachings hereof find their uses in an array of diverse medical applications. Various embodiments are disclosed.
Abstract:
A method, non-transitory computer readable medium, and apparatus for directing a vehicle in a side-by-side drive-thru are disclosed. For example, the method receives one or more video images of a side-by-side drive-thru comprising two or more lanes, detects a vehicle approaching an entrance of the side-by-side drive-thru, calculating an estimated order time for the vehicle and directs the vehicle to one of the two or more lanes based on the estimated order time for the vehicle or a previously estimated order time of each one of the a plurality of vehicles already in the first lane and the second lane of the drive-thru.
Abstract:
In a system for detecting location of an object inside of a building, an image capture device of a mobile electronic device captures an image of a boundary of a room in which the portable electronic device is positioned. The system extracts features of a boundary (ceiling, wall or floor) in the image to determine whether the mobile device is in a known location. When the system identifies a known location, it will take an action that provides the portable electronic device with additional functionality at the identified known location. Such functionality may include connecting to a wireless network or communicating with a stationary device at the known location.
Abstract:
A method and system for identifying content relevance comprises acquiring video data, mapping the acquired video data to a feature space to obtain a feature representation of the video data, assigning the acquired video data to at least one action class based on the feature representation of the video data, and determining a relevance of the acquired video data.
Abstract:
Block-based motion estimation of video compression estimates the direction and magnitude of motion of objects in the scene in a computationally efficient manner and accurately predicts the optimal search direction/neighborhood location for motion vectors. A system can include a motion detection module that detects apparent motion in the scene, a motion direction and magnitude prediction module that estimates the direction and magnitude of motion of the objects detected to be in motion by the motion detection module, and a block-based motion estimation module that performs searches in reduced neighborhoods of the target block according to the estimated motion by the motion direction and magnitude prediction module and only for the blocks determined to be in motion by the motion detection module. The Invention is particularly well suited for stationary traffic cameras that monitor roads and highways for traffic law enforcement purposes.
Abstract:
A method, computer readable medium and apparatus for verifying an identity of an individual based upon facial expressions as exhibited in a query video of the individual are disclosed. The method includes receiving a reference video for each one of a plurality of different individuals, wherein a plurality of facial gesture encoders is extracted from at least one frame of the reference video describing one or more facial expressions of each one of the plurality of different individuals, receiving the query video, calculating a similarity score for the reference video for the each one of the plurality of different individuals based on an analysis that compares the plurality of facial gesture encoders of the at least one frame of the reference video for the each one of the plurality of different individuals to a plurality of facial gesture encoders extracted from at least one frame of the query video.
Abstract:
What is disclosed is a system and method for enhancing a spatio-temporal resolution of a depth data stream. In one embodiment, time-sequential reflectance frames and time-sequential depth frames of a scene are received. If a temporal resolution of the reflectance frames is greater than the depth frames then a new depth frame is generated based on correlations determined between motion patterns in the sequence of reflectance frames and the sequence of depth frames. The new depth frame is inserted into the sequence of depth frames at a selected time point. If a spatial resolution of the reflectance frames is greater than the depth frames then the spatial resolution of a selected depth frame is enhanced by generating new pixel depth values which are added to the selected depth frame. The spatially enhanced depth frame is then inserted back into the sequence of depth frames.
Abstract:
What is disclosed is a system and method for adaptively reconstructing a depth map of a scene. In one embodiment, upon receiving a mask identifying a region of interest (ROI), a processor changes either a spatial attribute of a pattern of source light projected on the scene by a light modulator which projects an undistorted pattern of light with known spatio-temporal attributes on the scene, or changes an operative resolution of a depth map reconstruction module. A sensing device detects the reflected pattern of light. A depth map of the scene is generated by the depth map reconstruction module by establishing correspondences between spatial attributes in the detected pattern and spatial attributes of the projected undistorted pattern and triangulating the correspondences to characterize differences therebetween. The depth map is such that a spatial resolution in the ROI is higher relative to a spatial resolution of locations not within the ROI.