Abstract:
Described herein are systems and methods for multimodal recurrent network processing. In an embodiment, a system for evaluating multimodal data comprising a multimodal data input and a multimodal processing module is described. The multimodal data input may comprise the multimodal data, the multimodal data may comprise a first modality and a second modality. The multimodal processing module may be configured to receive the multimodal data comprising the first modality and the second modality; evaluate the first modality using a first recursive neural network comprising a first transformation matrix; evaluate the second modality using a second recursive neural network comprising the first transformation matrix; and determine an output based, at least in part, on evaluating the first modality and the second modality.
Abstract:
A method and apparatus for tracking an object across a plurality of sequential images, where certain of the images contain motion blur. A plurality of normal templates of a clear target object image and a plurality of blur templates of the target object are generated. In the next subsequent image frame, a plurality of bounding boxes are generated of potential object tracking positions about the target object location in the preceding image frame. For each bounding box image frame, a reconstruction error is generated that one bounding box has a maximum probability that it is the object tracking result in the subsequent image frame.
Abstract:
Described herein are systems and methods for multimodal recurrent network processing. In an embodiment, a system for evaluating multimodal data comprising a multimodal data input and a multimodal processing module is described. The multimodal data input may comprise the multimodal data, the multimodal data may comprise a first modality and a second modality. The multimodal processing module may be configured to receive the multimodal data comprising the first modality and the second modality; evaluate the first modality using a first recursive neural network comprising a first transformation matrix; evaluate the second modality using a second recursive neural network comprising the first transformation matrix; and determine an output based, at least in part, on evaluating the first modality and the second modality.
Abstract:
A method and apparatus for tracking an object across a plurality of sequential images, where certain of the images contain motion blur. A plurality of normal templates of a clear target object image and a plurality of blur templates of the target object are generated. In the next subsequent image frame, a plurality of bounding boxes are generated of potential object tracking positions about the target object location in the preceding image frame. For each bounding box image frame, a reconstruction error is generated that one bounding box has a maximum probability that it is the object tracking result in the subsequent image frame.