摘要:
Systems and methods for determining the location of a microphone by using sounds played from loudspeakers at known locations. Systems and methods may thereby require a minimal level of infrastructure, using sounds that would naturally be played in the environment. Systems and methods may thereby allow devices such as smart-phones, tablets, laptops or portable microphones to determine their location in indoor settings, where Global Positioning Satellite (GPS) systems may not work reliably.
摘要:
Provides a system for detecting an intersection between more than one panoramic video sequence and detecting the orientation of the sequences forming the intersection. Video images and corresponding location data are received. If required, the images and location data is processed to ensure the images contain location data. An intersection between two paths is then derived from the video images by deriving a rough intersection between two images, determining a neighborhood for the two images, and dividing each image in the neighborhood into strips. An identifying value is derived from each strip to create a row of strip values which are then converted to the frequency domain. A distance measure is taken between strips in the frequency domain, and the intersection is determined from the images having the smallest distance measure between them. The orientation between the two paths may also be determined in the frequency domain by using the phases of signals representing the images in the Fourier domain or performing a circular cross correlation of two vectors representing the images.
摘要:
Provides a system for detecting an intersection between more than one panoramic video sequence and detecting the orientation of the sequences forming the intersection. Video images and corresponding location data are received. If required, the images and location data is processed to ensure the images contain location data. An intersection between two paths is then derived from the video images by deriving a rough intersection between two images, determining a neighborhood for the two images, and dividing each image in the neighborhood into strips. An identifying value is derived from each strip to create a row of strip values which are then converted to the frequency domain. A distance measure is taken between strips in the frequency domain, and the intersection is determined from the images having the smallest distance measure between them. The orientation between the two paths may also be determined in the frequency domain by using the phases of signals representing the images in the Fourier domain or performing a circular cross correlation of two vectors representing the images.
摘要:
An audio device management system (ADMS) manages remote audio devices via user selections in video links. The system enhances audio acquisition quality by receiving and processing human suggestions, forming customized two-way audio links according to user requests, and learning audio pickup strategies and camera management strategies from user operations. The ADMS control interface for a remote user provides a multi-window GUI that provides an overview window and selection display window. The ADMS provides users with more flexibility to enhance audio signals according to their needs and makes it more convenient to form customized two-way audio links without requiring users to remember a list of phone numbers. The ADMS also automatically manages available microphones for audio pickup based on microphone sound quality and the system's past experience when users monitor a structured audio environment without explicitly expressing their attentions in the video window.
摘要:
Embodiments of the present invention introduce a user navigation interface that allows a user to monitor/navigate video streams captured from multiple cameras. It integrates video streams from multiple cameras with the semantic layout into a 3-D immersive environment and renders the video streams in multiple displays on a user navigation interface. It conveys the spatial distribution of the cameras as well as their fields of view and allows a user to navigate freely or switch among preset views. This description is not intended to be a complete description of, or limit the scope of, the invention. Other features, aspects, and objects of the invention can be obtained from a review of the specification, the figures, and the claims.
摘要:
Embodiments of the present invention introduce a novel technique to analyze and monitor video streams captured from multiple cameras. It highlights the foreground region of the video streams via local alpha blending and displays the videos in an immersive 3-D environment. The spatial arrangement of the displays can be generated by multi-dimensional scaling of the amount of simultaneous motion across different video streams. This description is not intended to be a complete description of, or limit the scope of, the invention. Other features, aspects, and objects of the invention can be obtained from a review of the specification, the figures, and the claims.
摘要:
Systems and methods for repeat photography and difference extraction that help users take pictures from the same position and camera angle as earlier photos. The system automatically extracts differences between the photos. Camera poses are estimated and then indicators are rendered to show the desired camera angle, which guide the user to the same camera angle for repeat photography. Using 3D rendering techniques, photos are virtually projected onto a 3D model to adjust them and improve the match between the photos, and the difference between the two photos are detected and highlighted. Highlighting the detected differences helps users to notice the differences.
摘要:
Video recording technology is utilized to enable business process investigation in an unobtrusive manner. Several cameras are situated, each having a defined field of view. For each camera, a region of interest (ROI) within the field of view is defined, and a background image is determined for each ROI. Motion within the ROI is detected by comparing each frame to the background image. The video recording can then be segmented and indexed according to the motion detection.
摘要:
Embodiments of the present invention describe a collaborative framework for mining of surveillance videos to detect abnormal events, which introduces a two-stage training process to alleviate the high false alarm problem. In the first stage, unsupervised clustering is performed on the segments of the video streams and a set of abnormal events are combined with user feedback to generate a clean training set. In the second stage, the clean training set is used to train a more precise model for the analysis of normal events and the motion detection results from multiple cameras can be cross validated and combined. This description is not intended to be a complete description of, or limit the scope of, the invention. Other features, aspects, and objects of the invention can be obtained from a review of the specification, the figures, and the claims.
摘要:
Systems and methods for providing a status of a teleconference by determining an approximate delay time and providing a status signal in view of the determined approximate delay time are provided. An approximate delay time is approximately the amount of time that will elapse before an occurrence occurring at a first time, which is captured into an occurrence signal by a source unit, will be experienced at a second time after the occurrence signal is received by at least one receiving unit.