摘要:
Provided is a method of tagging media. The method identifies at least one region of interest in a media based on a user input and assigns a higher weighted tag to an object identified in at least one region of interest compared to an object present in another region of the media.
摘要:
System and method for using information extracted from intuitive multimodal interactions in the context of media for media tagging are disclosed. In one embodiment, multimodal information related to media is captured during multimodal interactions of a plurality of users. The multimodal information includes speech information and gesture information. Further, the multimodal information is analyzed to identify speech portions of interest. Furthermore, relevant tags for tagging the media are extracted from the speech portions of interest.
摘要:
A method for enabling organization of a plurality of media objects is disclosed. The method comprises playing a digital media object to a user; capturing the interaction of the user with the played digital media object; and tagging the played digital media object based on said interaction. A software program product implementing this method, a system comprising the software program product and a digital media object tagged in accordance with this method are also disclosed.
摘要:
System and method for using information extracted from intuitive multimodal interactions in the context of media for media tagging are disclosed. In one embodiment, multimodal information related to media is captured during multimodal interactions of a plurality of users. The multimodal information includes speech information and gesture information. Further, the multimodal information is analyzed to identify speech portions of interest. Furthermore, relevant tags for tagging the media are extracted from the speech portions of interest.
摘要:
Provided is a method of assigning user interaction controls. The method assigns, in a scenario where multiple co-present users are simultaneously providing user inputs to a computing device, a first level of user interaction controls related to an object on the computing device to a single user and a second level of user interaction controls related to the object to all co-present simultaneous users of the computing device.
摘要:
In one example, a method for multimodal human-machine interaction includes sensing a body posture of a participant using a camera (605) and evaluating the body posture to determine a posture-based probability of communication modalities from the participant (610). The method further includes detecting control input through a communication modality from the participant to the multimedia device (615) and weighting the control input by the posture-based probability (620).
摘要:
Creating a multimodal object of a user response to a media object can include capturing a multimodal user response to the media object, mapping the multimodal user response to a file of the media object, and creating a multimodal object including the mapped multimodal user response and the media object.
摘要:
Presented is method and system for processing a gesture performed by a user of a first input device. The method comprises detecting the gesture and detecting a user-provided parameter for disambiguating the gesture. A user command is then determined based on the detected gesture and the detected parameter.
摘要:
Disclosed is a method and computer program product of determining the relevance of at least a part of an electronic document comprising a plurality of terms distributed over a plurality of regions of said document, comprising displaying the electronic document to a user; determining the gaze characteristics of the person on a region of the electronic document; assigning a relevance score to an individual term in said region based on said characteristics; and generating a term relevance label for said electronic document, said term relevance label comprising relevance scores for the respective individual terms in said document The relevance scores may also be used to define a user profile for the user that can aid in retrieving future documents of relevance to the user.
摘要:
A method and system of distinguishing multimodal HCI from ambient human interactions using wake up commands is disclosed. In one embodiment, in a method of distinguishing multimodal HCI from ambient human interactions, a wake up command is detected by a computing system. The computing system is then woken up to receive a valid user command from a user upon detecting the wake up command. A countdown timer is substantially simultaneously turned on upon waking up the computing system to receive valid user commands. The countdown timer is set based on application usage parameters such as semantics of the valid user command and context of an application associated with the valid user command.