摘要:
A method and apparatus are provided for video bit stream extension by video information annotation. In one embodiment, the invention may include gathering video data from a video source, gathering non-visual video information associated with the video data, maintaining a current state of the video information in storage, and annotating the video data with the current state of the video information.
摘要:
A method and apparatus are provided for annotating video and audio media with supplementary content for post video processing. In one embodiment, the invention may include maintaining a current state of auxiliary information regarding a sequence of video frames, the sequence of video frames being encoded as a video bit stream having video frame data for each respective video frame of the sequence of video frames. It may further include comparing the current state of auxiliary information with auxiliary information regarding a current video frame of the sequence of video frames to determine differential information, and annotating the differential information to the video bit stream as an annotation to the video frame data for the current video frame.
摘要:
A method and apparatus are provided for annotating video and audio media with supplementary content for post video processing. In one embodiment, the invention may include maintaining a current state of auxiliary information regarding a sequence of video frames, the sequence of video frames being encoded as a video bit stream having video frame data for each respective video frame of the sequence of video frames. It may further include comparing the current state of auxiliary information with auxiliary information regarding a current video frame of the sequence of video frames to determine differential information, and annotating the differential information to the video bit stream as an annotation to the video frame data for the current video frame.
摘要:
A method and apparatus are provided for annotating video and audio media with supplementary content for post video processing. The method includes the steps of accepting video data from a video source and storing video information associated with the video data as the video data is being accepted. Then, the video information may be appended to the video data for later use in the form of annotations, for example.
摘要:
A method for processing image data includes quantizing a region in a frame with an initial quantizer level. It is determined whether an amount of bits required for encoding the region after quantizing the region with the initial quantizer level is within a bit allocation budget. The region is re-quantized if the amount of bits is not within the bit allocation budget.
摘要:
Gesture recognition in which timing data is used to temporally segment video data into video clips. The timing data can be beat data extracted from an audio signal.
摘要:
A gesture recognition process includes tracking an object in two frames of video, determining differences between a location of the object in one frame of the video and a location of the object in another frame of the video, obtaining a direction of motion of the object based on the differences, and recognizing a gesture of the object based, at least in part, on the direction of motion of the object.
摘要:
A method and system for supporting personal computing in a public computing infrastructure. The system includes a plurality of computers to be used by patrons of the public computing infrastructure. The system includes a server coupled to the plurality of computers via a network connection. Each of the plurality of computers includes a virtual machine monitor, which includes a plurality of base virtual machine images. Each of the base virtual machine images is customized for a particular hardware and software configuration representing a specific computing environment. The virtual machine monitor launches one of the plurality of base virtual machine images, arbitrates access to system resources via the launched virtual machine image, stores the changes in the state of the virtual machine image when a user terminates a session, and returns a computer to an appropriate state to enable the user to resume the terminated session in subsequent sessions.
摘要:
A source model in combination with an interest structure is provided to generate a quantization value for use in encoding a video signal. The interest structure is generated from a region of interest manually identified by a user viewing the video on an interactive user display or automatically by a system which recognizes the regions of interest automatically. The region of interest in the video signal is encoded using a quantization value calculated from the interest structure in combination with the source model, and the region of interest is encoded at a higher resolution level than surrounding regions.
摘要:
Passively tracking a user's eye gaze while the user is browsing a web page and modifying the presentation of the web page to the user based on the tracked gaze. By combining historical information about a user's direction of gaze on individual cached web pages, a browser may be enabled to represent regions of a web page that have been previously glanced at by the user in a modified manner. For example, sections of a web page that a user has previously read or viewed may be represented in a changed form, such as in a different color, brightness, or contrast, for example. In one embodiment, the portions of the web page previously viewed by the user may be represented as “grayed out” so as to be unobtrusive.