Abstract:
A method and apparatus for modifying, or morphing, a video image. A video feed is received which include an image of a person's face. A number of facial features are detected from the video feed. One or more of those facial features are selected and modified, with the modification being applied to the video feed to produce a morphed image. For example, the morphing can produce fun effects such as oversized or too small eyes, lips, etc.
Abstract:
A low cost camera by implementing the major functions in host software is provided. This is accomplished by sending raw, digitized data from the camera directly to the host. The increased volume of raw data is handled by either an improved compression/decompression scheme using lossless compression, using lossy compression or using a shared bus with higher bandwidth. By moving such functions as color processing and scaling to the host, the pixel correction can also be moved to the host. This in turn allows the elimination of the frame buffer memory from the camera. Finally, the camera can use a low cost lens by implementing vignetting, distortion, gamma or aliasing correction with a correction value stored in a register of the camera for later access by the host to perform corrections.
Abstract:
The present invention provides a method, and corresponding apparatus, for use of emoticons in IM applications by using sensory information captured by a device. Such information can include video, still image, and/or audio information. In one embodiment, based on a trigger to the system, multimedia input is captured, and relevant features are extracted from it. The extracted information is interpreted, and the interpreted information is mapped onto one or more specific pre-existing emoticons. These specific emoticons are then inserted into the IM communication via an IM API. In another aspect of the present invention, new emoticons are created based on the multimedia information captured. This can include generation of realistic emoticons based on the expressions on the user's face. Animated emoticons can also be created.
Abstract:
The present invention provides a method and system for reliable and accurate presence/status management and identity detection in Instant Messaging (IM) applications by using video, still image, and/or audio information. In one embodiment, a device such as a camera captures still image, video, and/or audio data. Relevant information is then extracted from the captured data and analyzed. Known techniques such as face recognition, face tracking, and motion detection, can be sued for extracting and analyzing data. This information is then interpreted for the IM application, and provided to an Application Programs Interface (API) for the IM application. The API can use the information for various purposes, including updating the status of the user (e.g., available, busy, on the phone, away from desk, etc.) and updating the identity of the user.
Abstract:
A video camera includes a light collection array including a plurality of light collection cells configured to collect light from a scene and a processor coupled to the light collection array. The processor is configured determine a brightness of the scene based on the light collected by the light collection array from the scene. Based on the determined brightness, the processor is configured to determine if the brightness of the scene requires a light collection time of each cell to be less than a flicker on time of a light source lighting the scene so that the light collection array collects a sufficient amount of light so that a brightness of a video stream or a still image is substantially at a predetermined level. If the light collection time is determined to be less than the flicker on time to maintain brightness of the video stream or the still image at the predetermined level and if the brightness of the scene is less than a predetermined brightness, the processor is configured to set the light collection time at the flicker on time of the light source. If the light collection time is determined to be less than the flicker on time to maintain brightness of the video stream or the still image at the predetermined level and if the brightness of the scene is at, or greater than, the predetermined brightness, the processor is configured to set the light collection time less than the flicker on time of the light source.
Abstract:
A system and method for mechanically panning, tilting, and/or zooming a webcam to track a user's face. In one embodiment, such movement is controlled by kernel software in a host to which the webcam sends video data. In this way, a driver in the host kernel handles the face tracking, transparent to the application programs that would access the video. In an alternate embodiment, such movement is controlled by firmware in the webcam itself The video and control signals are sent over the same standard cable, such as a USB bus. In one embodiment, the video is supplied to an instant messaging application. The use of a standard bus (e.g., USB) and the offloading of the face tracking to the webcam and driver allows easy use by the instant messaging application.
Abstract:
A method and apparatus for modifying, or morphing, a video image. A video feed is received which include an image of a person's face. A number of facial features are detected from the video feed. One or more of those facial features are selected and modified, with the modification being applied to the video feed to produce a morphed image. For example, the morphing can produce fun effects such as oversized or too small eyes, lips, etc.
Abstract:
A single universal audio processing system intelligently and transparently processes audio streams in real-time. The system receives audio input from one or more sources, determines how the streams should be processed, and automatically processes them in real-time for delivery to an output system. The processing happens without any intervention from the output system, which is oblivious to this processing. A set of audio processing algorithms to accomplish acoustic echo cancellation (AEC), resampling, format conversion, channel mixing or any other desired audio processing function can be supported by a universal processing system, providing a universal solution to audio processing regardless of source or sink. In one embodiment, processing functionality is implemented in an upper filter driver created using a “framework” or software architecture that implements a conventional WDM filter and a dedicated environment for audio processing.
Abstract:
A system and method for mechanically panning, tilting, and/or zooming a webcam to track a user's face. In one embodiment, such movement is controlled by kernel software in a host to which the webcam sends video data. In this way, a driver in the host kernel handles the face tracking, transparent to the application programs that would access the video. In an alternate embodiment, such movement is controlled by firmware in the webcam itself. The video and control signals are sent over the same standard cable, such as a USB bus. In one embodiment, the video is supplied to an instant messaging application. The use of a standard bus (e.g., USB) and the offloading of the face tracking to the webcam and driver allows easy use by the instant messaging application.
Abstract:
A camera comprises a camera support; a camera housing rotatably disposed on the camera support for pan movement relative to the camera support; a camera lens rotatably disposed in the camera housing for tilt movement relative to the camera housing; a pan motor coupled with the camera housing to drive the camera housing in pan movement relative to the camera support; and a tilt motor coupled with the camera lens to drive the camera lens in tilt movement relative to the camera housing.