摘要:
An attention detection system detects a condition of shared attention of plural persons. The system includes plural body language detectors each associated with a different person for detecting body language of the associated person. An analyzer receives body language information from the body language detectors, analyzes the body language of the persons and determines when said body language information indicates shared attention between the persons. The attention detection system generates a signal that captures an image of the shared attention.
摘要:
Automatic view generation, such as rostrum view generation, may be used beneficially for viewing of still or video images on low resolution display devices such as televisions or mobile. However, the generation of good quality automatic presentations such as rostrum presentations presently requires skilled manual intervention. By recording important parts of the picture at the time of capture time based on conscious and subconscious user actions at the time of capture, extra information may be derived from the capturing process which helps to guide or determine a suitable automatic view generation for presentation of the captured image.
摘要:
Various embodiments provide a system and method for producing an edited video signal from a plurality of cameras. Briefly described, one embodiment is a method that produces an edited video signal from a plurality of cameras wherein at least the imaging lens of each of the camera being held or worn by a respective one of a plurality of participants simultaneously present at a site, comprising receiving contemporaneous audio streams from at least two participants, selectively deriving from the audio streams a single audio output signal according to a predetermined first set of one or more criteria, receiving contemporaneous video streams from at least two of the cameras, and deriving from the video streams a single video output signal according to a predetermined second set of one or more criteria.
摘要:
Embodiments provide a system and method for generating images of a document with interaction of a primary user with the document in an interaction session. Briefly described, one embodiment comprises an image capture means adapted to capture an initial image of the document without interaction by a user and to subsequently capture at least one additional image of the document during an interaction session including interaction from the user during that session, and a processing means adapted to generate a data set representing the interaction session from the initial image and the additional image, the data set containing at least the initial image along with information indicative of the interaction of the user during the session obtained from the additional image.
摘要:
Systems and methods of producing video data and/or audio-photos from a static digital image are disclosed. One such method, among others, comprises receiving input from a user indicating sequentially, in real time, a plurality of regions of the static digital image. The method also includes processing the user input to determine the visual content of each of a sequence of video frames and generating output data representative of the sequence of video frames. The sequence and composition of the video frames are determined such that the visual content of the video frames is taken from the static digital image. For each region of the static image indicated by the user, a video frame is composed such that the said region occupies a substantial part of the video frame. The sequence of video frames shows the regions indicated by the user in sequential correspondence with the sequence in which the user indicated the regions and substantially in pace with the time in which the user indicated the regions.
摘要:
An electronic image capture system for capturing a reduced noise image of a scene includes a detector array and an image processing apparatus. The detector array provides to the image processing apparatus data representing at least one image of a scene detected by the array. The image processing apparatus holds a noise model that characterizes the noise performance of the image capture system. Based on the image data and the noise model the image processing apparatus identifies one or more portions of the scene that are predicted to contribute disproportionately to visible noise in an image formed from said image data. Based on the identified portions of the scene and the noise model, the image processing apparatus determines an exposure pattern for the image capture system that is predicted to produce multiple exposures of the scene that are combinable to produce an image with a minimal predicted noise.
摘要:
One embodiment is a method for reviewing videos, comprising: deriving at least two video segments from unedited video footage based upon a previously determined unique saliency, each saliency associated with a corresponding one of the video segments; and displaying a display window for each of the derived video segments substantially concurrently.
摘要:
The present invention relates to an electronic image capture system for capturing an electronic image having reduced noise, and a method for capturing an electronic image having reduced noise, for example from a digital camera device. An electronic image capture system (1) for capturing a reduced noise image of a scene (2) comprises a detector array (8) and an image processing apparatus (10). The detector array (8) is arranged to provide to the image processing apparatus (10) data (18) representing at least one image (6) of a scene detected by the array (8), and the image processing apparatus holds a noise model that substantially characterises the noise performance of the image capture system (1). The image processing apparatus (10) is arranged to identify, using the image data (18) and the noise model, one or more portions of the scene (2) that would contribute disproportionately to visible noise in an image formed from said image data (18), and to select an exposure pattern on the basis that said selected exposure pattern will reduce the contribution to the visible noise when exposures from said selected exposure pattern are combined to form the reduced noise image.
摘要:
A method of generating an audio signal comprises receiving a plurality of input audio signals from a plurality of microphones forming a microphone array, the plurality of input audio signals being representative of a set of sound sources within the auditory field of view of the microphone array at a given instant in time; receiving a motion input signal from a motion sensor, the motion input signal being representative of the motion of the microphone array; and manipulating the received plurality of input audio signals in response to the received motion input signal to generate an audio output signal that is representative of a set of sound sources within the auditory field of view of a virtual microphone, the apparent motion of the virtual microphone being independent of the motion of the microphone array.
摘要:
An exemplary embodiment is a method of processing audio data comprising: characterising an audio data representative of a recorded sound scene into a set of sound sources occupying positions within a time and space reference frame; analysing the sound sources; and generating a modified audio data representing sound captured from at least one virtual microphone configured for moving about the recorded sound scene, wherein the virtual microphone is controlled in accordance with a result of the analysis of said audio data, to conduct a virtual tour of the recorded sound scene.