摘要:
By understanding a website author's intention through an analysis of the function of a website, website content can be adapted for presentation or rendering in a manner that more closely appreciates and respects the function behind the website. Various inventive systems and methods analyze a website's function so that its content can be adapted to different client environments, e.g. devices, network conditions, or user preferences. A novel function-based object model automatically identifies objects associated with a website, and analyzes those objects in terms of their functions. The function-based object model permits consistent, informed decisions to be made in the adaptation process, so that web content is displayed not only in an organized manner, but in a manner that reflects the author's intention.
摘要:
A method and system for generating a classifier to classify sub-objects of an object based on a relationship between sub-objects is provided. The classification system provides training sub-objects along with the actual classification of each training sub-object. The classification system may iteratively train sub-classifiers based on feature vectors representing the features of each sub-object, the actual classification of the sub-object, and a weight associated with the sub-object. After a sub-classifier is trained, the classification system classifies the training sub-objects using the trained sub-classifier. The classification system then adjusts the classifications based on relationships between training sub-objects. The classification system assigns a weight for the sub-classifier and weight for each sub-object based on the accuracy of the adjusted classifications.
摘要:
A method and system for generating 3D images of faces from 2D images, for generating 2D images of the faces at different image conditions from the 3D images, and for recognizing a 2D image of a target face based on the generated 2D images is provided. The recognition system provides a 3D model of a face that includes a 3D image of a standard face under a standard image condition and parameters indicating variations of an individual face from the standard face. To generate the 3D image of a face, the recognition system inputs a 2D image of the face under a standard image condition. The recognition system then calculates parameters that map the points of the 2D image to the corresponding points of a 2D image of the standard face. The recognition system uses these parameters with the 3D model to generate 3D images of the face at different image conditions.
摘要:
A process for comparing two digital images is described. The process includes comparing texture moment data for the two images to provide a similarity index, combining the similarity index with other data to provide a similarity value and determining that the two images match when the similarity value exceeds a first threshold value.
摘要:
A system and process for video characterization that facilitates video classification and retrieval, as well as motion detection, applications. This involves characterizing a video sequence with a gray scale image having pixel levels that reflect the intensity of motion associated with a corresponding region in the sequence of video frames. The intensity of motion is defined using any of three characterizing processes. Namely, a perceived motion energy spectrum (PMES) characterizing process that represents object-based motion intensity over the sequence of frames, a spatio-temporal entropy (STE) characterizing process that represents the intensity of motion based on color variation at each pixel location, a motion vector angle entropy (MVAE) characterizing process which represents the intensity of motion based on the variation of motion vector angles.
摘要:
The described arrangements and procedures provide an intelligent media agent to autonomously collect semantic multimedia data text descriptions on behalf of a user whenever and wherever the user accesses media content. The media agent analyzes these semantic multimedia data text descriptions in view of user behavior patterns and actions to assist the user in identifying multimedia content and related information that is appropriate to the context within which the user is operating or working. For instance, the media agent detects insertion of text and analyzes the inserted text. Based on the analysis, the agent predicts whether a user intends to access media content. If so, the agent retrieves information corresponding to media content from a media content source and presents the information to a user as a suggestion.
摘要:
A video skim is assembled by identifying one or more key frames from a video shot. Certain lengths of frames to the left and right of the key frame are measured for visual content variety. Depending upon the measured visual content variety to the left and right of the key frame, the video skim is assembled that has L frames to the left of the key frame and R frames to the right of the key frame. Measuring the visual content variety to the left and right of the key frame, provides a video skim that incorporates the more salient features of a shot.
摘要:
A face model having outer and inner facial features is matched to that of first and second models. Each facial feature of the first and second models is represented by plurality of points that are adjusted for each matching outer and inner facial feature of the first and second models using 1) the corresponding epipolar constraint for the inner features of the first and second models. 2) Local grey-level structure of both outer and inner features of the first and second models. The matching and the adjusting are repeated, for each of the first and second models, until the points for each of the outer and inner facial features on the respective first and second models that are found to match that of the face model have a relative offset there between of not greater than a predetermined convergence tolerance. The inner facial features can include a pair of eyes, a nose and a mouth. The outer facial features can include a pair of eyebrows and a silhouette of the jaw, chin, and cheeks.
摘要:
Improved methods and apparatuses are provided for use in face detection. The methods and apparatuses significantly reduce the number of candidate windows within a digital image that need to be processed using more complex and/or time consuming face detection algorithms. The improved methods and apparatuses include a skin color filter and an adaptive non-face skipping scheme.
摘要:
A system that analyzes music to detect musical beats and to rectify beats that are out of sync with the actual beat phase of the music. The music analysis includes onset detection, tempo/meter estimation, and beat analysis, which includes the rectification of out-of-sync beats.