摘要:
A method and apparatus for tracking an object using one or more video cameras together with a plurality of microphones and/or geophones, whereby accurate tracking of the object can be obtained even when the object changes directions as a result of an impact. A tennis ball may be tracked during a tennis match. Prior to and after an impact, the ball is tracked with the use of video cameras, illustratively in accordance with the method disclosed in U.S. Pat. No. 6,233,007. Then, the position of the ball at racket or ground impact is accurately determined by measuring the acoustical waves generated by the ball hitting the racket and/or either the acoustic waves or the elastic (Rayleigh) waves generated by the ball hitting the ground, respectively. Tracking then advantageously continues with use of the video cameras.
摘要:
A performance data mining system combines detailed sensor analysis data with other data sources to discover interesting patterns/rules for performance and utilizes real time sensor analysis to dynamically derive mining results in real time during an event. The system automatically generates advice/strategy and predictions based on specified criteria.
摘要:
Techniques for intelligent and automatic control of sensors for capturing data associated with real time events. Preferably, the sensors are associated with a multimedia database system. For example, a technique for controlling one or more sensors used to capture data associated with an event comprises the following steps/operations. First, sensor data captured in accordance with the event is processed. Then, the one or more sensors are automatically controlled based on information pertaining to the continual activity of at least one of one or more objects and one or more persons associated with the event in real time obtained using at least a portion of the processed data captured in accordance with the event.
摘要:
Techniques for indexing multimedia data substantially concurrently or contemporaneously with its capture to convert a real world event into an accessible database in real time are provided. The present invention introduces a new paradigm of converting a real world event in real time into a rich multimedia database by processing data from multiple sensors observing the event. Real time analysis of the sensor data, coupled with domain knowledge, results in instant indexing of multimedia data at capture time itself. This yields the semantic information to answer complex queries about the content, and the ability to extract portions of data that correspond to complex actions performed in the real world.
摘要:
Techniques for retrieval of multimedia data through visual representations are provided. Such visual representations, preferably in the form of visual activity maps or spatio-temporal activity maps, serve as an efficient and intuitive graphical user interface for multimedia retrieval, particularly when the media streams are derived from multiple sensors observing a physical environment. An architecture for interactive media retrieval is also provided by combining such visual activity maps with domain specific event information. Visual activity maps are derived from the trajectories of motion of objects in the environment. The visual activity map based techniques significantly help users in quickly and effectively discovering interesting portions of the data, and randomly accessing and retrieving the corresponding portions of the media streams.
摘要:
Techniques for computing a globally consistent set of image feature correspondences across a wide range of viewpoints suitable for interactive walkthroughs and visualizations. The inventive approach takes advantage of the redundancy inherent in a dense set of images captured in a plane (or in higher dimensions, e.g., images captured in a volume, images captured over time, etc). The technique may detect features in a set of source images and track the features to neighboring images. When features track to the same position in the same image, they are flagged as potential correspondences. Among the potential correspondences, the technique selects the maximal set using a greedy graph-labeling algorithm (e.g., best-first order). Only correspondences that produce a globally consistent labeling are selected. After globalization is done, a set of features common to a group of images can be quickly found and used to warp and combine the images to produce an interpolated novel view of the environment.
摘要:
Techniques for computing error-bounded position and orientation (pose) of a panoramic camera in real-world environments. Such environments may include large interior spaces (e.g., buildings) A space may include multiple rooms. For example, a technique for capturing images associated with an environment includes the following steps/operations. First, respective placements of fiducials in the environment are determined so as to satisfy at least one constraint. Images are captured, with an image capture device (e.g., camera), associated with the environment with the fiducials placed therein. A pose estimation of the image capture device is then determined based on projections of the fiducials in the captured images. The pose estimation may be optimized so as to obtain an optimal pose per image. Also, the fiducial placements may be optimized so as to obtain optimal fiducial placements. Then at least one constraint may include a constraint associated with the number of visible fiducials, a constraint associated with a distance from a viewpoint to a fiducial, and/or a constraint associated with an angle subtended by pairs of fiducials.
摘要:
Techniques for compressing and decompressing images captured from viewpoints throughout an n-dimensional space. As opposed to conventional approaches, these techniques exploit image coherence over the entire n-dimensional space, providing quick access to images along arbitrary contiguous viewpoint paths during computer graphics applications. This is accomplished by combining a hierarchical data structure with a compression methodology. Such techniques provide for generating and arranging reference images and residual images into a spatial image hierarchy such as a binary tree structure. Images and residuals may be encoded using intra-image compression. A captured image is extracted via a sequence of image warping and addition operations.
摘要:
Techniques for creating interactive walkthroughs or visualizations of real-world environments. Such environments for which interactive walkthroughs may be created may comprise large interior spaces (e.g., buildings), wherein a space may comprise multiple rooms. The interactive walkthroughs may be photorealistic and rendered from arbitrary viewpoints at interactive rates. In one aspect of the invention, a technique for generating a visualization representative of an environment comprises the following steps. A dense set of image data is captured from the environment. The captured image data is compressed for storage. A portion of the compressed image data may then be decompressed so as to render a visualization representative of the environment from a selected novel viewpoint. Another portion of the image data may be prefetched and decompressed such that captured image data surrounding the selected novel viewpoint is available to be rendered. The visualization of the environment may then be rendered from the selected novel viewpoint.