摘要:
An electronic device having a memory, a display, control apparatus and a data processor. When the data processor executes a program stored in the memory of the electronic device, the electronic device receives an image containing a person, the image further containing a face component, the face component containing the face of the person; displays the image on the display; locates the face component within the image; emphasizes the face component; receives image annotation information concerning the face component; and saves the image annotation information to the memory of the electronic device. The image annotation information may take the form of contact information for use in a contact database. A method uses the face component to locate other images in the database that also contain the person, and adds the annotation information to those images.
摘要:
An example apparatus is caused to receive a video sequence of a plurality of frames, and perform a number of operations as each of at least some of the frames is received but before all of the frames are received. The apparatus is caused to calculate a score for the frame, and compare the score for the frame to a predefined threshold. The apparatus is caused to cause output of the frame as a key frame in an instance in which the frame is received within a specified period of time and the score for the frame is above the predefined threshold. Otherwise, in an instance in which none of the scores for frames received within the specified period of time is above the predefined threshold, the apparatus is caused to cause output of one of the frames received within the specified period of time as a key frame.
摘要:
The invention concerns a gesture recognition method for gesture-based interaction at an apparatus. The method comprises receiving one or more images of an object; creating feature images for the received one or more images; determining binary values for pixels in corresponding locations of said feature images and concatenating the binary values to form a binary string for said pixel; repeating the previous step for each corresponding pixel of said feature image to form a feature map and forming a histogram representation of the feature map. The invention also concerns an apparatus and a computer program.
摘要:
A method, apparatus and computer program product are provided in order to augment an index image generated by a near eye display in order to more clearly present at least a portion of the index image. In the context of a method, a position of the mobile terminal relative to an index image generated by a near eye display is determined. The method also determines an image to be presented by the mobile terminal based upon the index image and the position of the mobile terminal relative to the index image. The method also causes the image to be presented by the mobile terminal. A corresponding apparatus and a computer program product are also provided.
摘要:
An example apparatus is caused to receive a video sequence of a plurality of frames, and perform a number of operations as each of at least some of the frames is received but before all of the frames are received. The apparatus is caused to calculate a score for the frame, and compare the score for the frame to a predefined threshold. The apparatus is caused to cause output of the frame as a key frame in an instance in which the frame is received within a specified period of time and the score for the frame is above the predefined threshold. Otherwise, in an instance in which none of the scores for frames received within the specified period of time is above the predefined threshold, the apparatus is caused to cause output of one of the frames received within the specified period of time as a key frame.
摘要:
An example apparatus is caused to receive a video sequence of a plurality of frames, and activate one of a plurality of available decoding processes based on a comparison of a size of the frames to a predefined threshold. The apparatus is also caused to select some but not all of the frames of the video sequence as potential key frames of the video sequence. The selected frames are located at or close to predefined positions along a length of the video sequence. The apparatus is also caused to decode the potential key frames according to the activated decoding process, and cause output of at least some of the potential key frames as key frames of the video sequence. The apparatus may be caused to discard from the potential key frames, one or more plain frames and/or a frame identified as being similar to other potential key frames.
摘要:
An approach is provided for providing collaborative recognition using media segments. The recognition platform causes, at least in part, a generation of a request to determine recognition information for one or more media items associated with a device, one or more segments of the one or more media items, or a combination thereof. Next, the recognition platform determines to transmit the request to one or more other devices based, at least in part, on one or more device selection criteria. Then, the recognition platform receives the recognition information in response to the request. Further, the recognition platform processes and/or facilitates a processing of the recognition information to determine one or more identities of one or more users, one or more objects, or a combination thereof represented in the one or more media items.
摘要:
Various methods for local binary pattern based facial feature localization. One example method includes determining an eye state classification of an input image. The example method may also include selecting a texture model for a global shape and an associated mean shape based on eye center positions and the eye state classification, and adjusting locations of feature points defined by the mean shape based on the texture model for the global shape and an associated global shape model. Similar and related example methods and example apparatuses are also provided.
摘要:
A system and method for using images captured from a digital camera to control navigation through a three-dimensional user interface. The sequence of images may be examined to identify feature points to be tracked through successive frames of the images captured by the camera. A plurality of classifiers may be used to discern shift from rotation gestures, based on expected behavior of feature points in the image when the camera is shifted or rotated in position. The various classifiers may generate voting values for shift and rotation gestures, and the system can use historical gesture information to assist in categorizing a current gesture.
摘要:
A method for providing face pose estimation for face detection may include utilizing a selected portion of classifiers in detectors to determine coarse pose information for a candidate face in an image, determining fine pose information for the candidate face based at least in part on the determined coarse pose information, and employing another portion of the classifiers in the detectors to perform face detection based at least in part on the fine pose information to determine whether the candidate face corresponds to a face. An apparatus and computer program product corresponding to the method are also provided.