Abstract:
Disclosed herein is a gesture recognition device that includes an input interface configured to receive a sequence of images, each image showing a body part with which a gesture is performed from a viewpoint of a camera. The gesture recognition device also generates a sequence of motion-compensated images from the sequence comprising generating a motion-compensated image for an image of the sequence by compensating the movement of the camera viewpoint from a reference camera viewpoint to the viewpoint from which the image shows the body part based on the image and a motion-compensated image of the sequence generated for a preceding image of the sequence which precedes the image in the sequence and estimate the gesture from the sequence of motion-compensated images.
Abstract:
Apparatuses, methods and storage medium associated with animating and rendering an avatar are disclosed herein. In embodiments, an apparatus may include a facial mesh tracker to receive a plurality of image frames, detect facial action movements of a face and head pose gestures of a head within the plurality of image frames, and output a plurality of facial motion parameters and head pose parameters that depict facial action movements and head pose gestures detected, all in real time, for animation and rendering of an avatar. The facial action movements and head pose gestures may be detected through inter-frame differences for a mouth and an eye, or the head, based on pixel sampling of the image frames. The facial action movements may include opening or closing of a mouth, and blinking of an eye. The head pose gestures may include head rotation such as pitch, yaw, roll, and head movement along horizontal and vertical direction, and the head comes closer or goes farther from the camera. Other embodiments may be described and/or claimed.
Abstract:
A long-term object tracker employs a continuous learning framework to overcome drift in the tracking position of a tracked object. The continuous learning framework consists of a continuous learning module that accumulates samples of the tracked object to improve the accuracy of object tracking over extended periods of time. The continuous learning module can include a sample pre-processor to refine a location of a candidate object found during object tracking, and a cropper to crop a portion of a frame containing a tracked object as a sample and to insert the sample into a continuous learning database to support future tracking.
Abstract:
Disclosed herein are systems, methods, and devices for using adaptive learning to identify objects. An object-identifying device performs a first object identification based on one or more features of a first modality of an object retrieved from an image frame including the object and a first database including first modality identification features. A second object identification is performed based on one or more features of a second modality of the object retrieved from the image frame and a second database including second modality identification features. The second database is updated by adaptively learning a new second modality identification feature according to a first identification result of the first object identification. The second object identification is trained with the updated second database and determines a final identification result by integrating a first identification result of the first object identification and a second identification result of the second object identification.
Abstract:
A system, article, and method of neural network object recognition for image processing includes customizing a training database and adapting an instance segmentation neural network used to perform the customization.
Abstract:
Methods and apparatus to match images using semantic features are disclosed. An example apparatus includes a semantic labeler to determine a semantic label for each of a first set of points of a first image and each of a second set of points of a second image; a binary robust independent element features (BRIEF) determiner to determine semantic BRIEF descriptors for a first subset of the first set of points and a second subset of the second set of points based on the semantic labels; and a point matcher to match first points of the first subset of points to second points of the second subset of points based on the semantic BRIEF descriptors.