摘要:
A method for head pose estimation may include receiving block motion vectors for a frame of video from a block motion estimator, selecting at least one block for analysis, determining an average motion vector for the at least one selected block, combining the average motion vectors over time (all past frames of video) to determine an accumulated average motion vector, estimating the orientation of a user's head in the video frame based on the accumulated average motion vector, and outputting at least one parameter indicative of the estimated orientation.
摘要:
Disclosed are methods and apparatus for generating or selecting bookmarks for a multimedia presentation. These bookmarks may be encoded with the multimedia presentation into a multimedia container, which may then be transmitted to an end-user device. Also, these bookmarks may be made available to an end-user device over the Internet. Each bookmark may demarcate a content event and may comprise semantic information for that content event. Bookmarks may be generated automatically (e.g., by performing a media-analysis process) reviewed by a human. The end-user device may use the bookmarks to perform a trick play (e.g., semantic trick play) on the multimedia presentation.
摘要:
A method (100) and apparatus (300) for displaying operational information about an electronic device, that determines a change of an operational status of the electronic device, maps the operational status to at least one of an appearance characteristic and an action of an avatar (205, 210, 215, 220) related to the operational status changes the at least one of the appearance characteristic and action of the avatar in a manner related to the change of the operational status, and presents the avatar on a display (345) of the electronic device.
摘要:
A method and apparatus for providing communication between a sending terminal and one or more receiving terminals in a communication network. The media content of a signal transmitted by the sending terminal is detected and one or more of a voice stream, an avatar control parameter stream and a video stream are generated from the media content. At least one of the voice stream, the avatar control parameter stream and the video stream are selected as an output to be transmitted to the receiving terminal. The selection may be based on user preference, channel capacity, terminal capabilities or the load status of a network server performing the selection. The network server may be operable to generate synthetic video from the voice input, a natural video input and/or incoming avatar control parameters.
摘要:
Systems and methods are provided for presenting content to a user. An exemplary method involves establishing a relationship between a first device and the user, wherein, based on the relationship, one or more instances of secondary content are automatically excluded from display by the first device while primary content is displayed by the first device. The method continues by presenting an instance of secondary content to the user in a manner that is influenced by the relationship.
摘要:
A method and system for collaborative communications is described. In one embodiment, a central virtual reality communications environment is created. A plurality of client communication devices are connected to the central virtual reality communications environment. Each one of the connected plurality of client communication devices are represented as an avatar present in the central virtual reality communications environment. An uploaded data object is received from any one of the connected plurality of client communication devices. Finally, the data object is displayed in the central virtual reality communications environment to the connected plurality of client communication devices.
摘要:
Disclosed is a method of associating, at a secondary device, secondary media content with primary media content being output at a primary device. The method includes receiving, at the secondary device, first information based upon the primary content being output at the primary device, wherein the first information includes at least one of an audio and a visual signal, determining at the secondary device second information corresponding to the first information, receiving at the secondary device one or more portions of secondary media content that have been made available by a third device, determining at the secondary device whether one or more of the portions of the secondary media content match one or more portions of the second information, and taking at least one further action upon determining that there is a match.
摘要:
Disclosed are techniques that allow the user of a mobile device to select an avatar within a virtual world presented on the display screen of the mobile device. In some embodiments, a user manipulates a thumbwheel. As the thumbwheel is turned, the avatars on the display screen are highlighted one after another. The user then presses a thumbwheel button to select a desired avatar. Some embodiments allow the user to select more than one avatar at a time. Several highlighting techniques are available. In some embodiments, the user uses speech commands instead of a thumbwheel to highlight the avatars one by one. Speech input is also used to select one or more avatars. Some devices support a touch-screen interface. Embodiments for these devices allow the user to select an avatar by, for example, drawing an arc enclosing the avatar.
摘要:
A method and apparatus for collaborative design of a graphical structure is by users of a network. First, a description of the graphical structure is downloaded from a network server to client devices of the users. User modifications of the graphical structure are then uploaded to the network sever from the client devices. The modifications from multiple users are aggregated to produce an aggregated modification, which is then used to update the graphical structure. A description of the modifications may be a text-based description, in which case it is mapped to a numerical description of the modifications. Alternatively, the descriptions of the modifications may be numerical values. The modifications from a plurality of users (received during a specified time period) may be aggregated by calculating a statistical measure of numerical values corresponding to the modifications. The graphical structure may be an avatar, for example.
摘要:
A method for head pose estimation may include receiving block motion vectors for a frame of video from a block motion estimator, selecting at least one block for analysis, determining an average motion vector for the at least one selected block, combining the average motion vectors over time (all past frames of video) to determine an accumulated average motion vector, estimating the orientation of a user's head in the video frame based on the accumulated average motion vector, and outputting at least one parameter indicative of the estimated orientation.