摘要:
Systems and methods for repeat photography and difference extraction that help users take pictures from the same position and camera angle as earlier photos. The system automatically extracts differences between the photos. Camera poses are estimated and then indicators are rendered to show the desired camera angle, which guide the user to the same camera angle for repeat photography. Using 3D rendering techniques, photos are virtually projected onto a 3D model to adjust them and improve the match between the photos, and the difference between the two photos are detected and highlighted. Highlighting the detected differences helps users to notice the differences.
摘要:
Systems and methods are described for creating virtual models, primarily through actions taken in actual 3D physical space. For many applications, such systems are more natural to users and may provide a greater sense of reality than can be achieved by editing a virtual model at a computer display, which requires the use of manipulations of a 2D display to effect 3D changes. Actions are taken (markup is drawn or laid out, etc.) in a physical workspace. Such physical workspaces may in fact be identical to the space being modeled, small physical scale models of the space, or even a whiteboard or set of papers or objects which get mapped onto the space being modeled.
摘要:
Systems and methods for repeat photography and difference extraction that help users take pictures from the same position and camera angle as earlier photos. The system automatically extracts differences between the photos. Camera poses are estimated and then indicators are rendered to show the desired camera angle, which guide the user to the same camera angle for repeat photography. Using 3D rendering techniques, photos are virtually projected onto a 3D model to adjust them and improve the match between the photos, and the difference between the two photos are detected and highlighted. Highlighting the detected differences helps users to notice the differences.
摘要:
A method and system for defining a model by analyzing images of a physical space. In some embodiments the images of a physical space contain distinctive visual features with associated semantic information, and the model is defined using image feature detection techniques to identify distinctive visual features and a rich marker-based markup language to give meaning to the distinctive visual features. In some embodiments the distinctive visual features are predefined markers, and the markup language specifies model aspects and rules for combining semantic information from a plurality of markers to define the model.
摘要:
A method and system for defining a model by analyzing images of a physical space. In some embodiments the images of a physical space contain distinctive visual features with associated semantic information, and the model is defined using image feature detection techniques to identify distinctive visual features and a rich marker-based markup language to give meaning to the distinctive visual features. In some embodiments the distinctive visual features are predefined markers, and the markup language specifies model aspects and rules for combining semantic information from a plurality of markers to define the model.
摘要:
Systems and methods provide for gesture-based creation of interactive hotspots in a real world environment. A gesture made by a user in a three-dimensional space in the real world environment is detected by a motion capture device such as a camera, and the gesture is then identified and interpreted to create a “hotspot,” which is a region in three-dimensional space through which a user interacts with a computer system. The gesture may indicate that the hotspot is anchored to the real world environment or anchored to an object in the real world environment. The functionality of the hotspot is defined in order to identify the type of gesture which will initiate the hotspot and associate the activation of the hotspot with an activity in the system, such as control of an application on a computer or an electronic device connected with the system.
摘要:
Systems and methods for determining the location of a microphone by using sounds played from loudspeakers at known locations. Systems and methods may thereby require a minimal level of infrastructure, using sounds that would naturally be played in the environment. Systems and methods may thereby allow devices such as smart-phones, tablets, laptops or portable microphones to determine their location in indoor settings, where Global Positioning Satellite (GPS) systems may not work reliably.
摘要:
Embodiments of the present invention introduce a user navigation interface that allows a user to monitor/navigate video streams captured from multiple cameras. It integrates video streams from multiple cameras with the semantic layout into a 3-D immersive environment and renders the video streams in multiple displays on a user navigation interface. It conveys the spatial distribution of the cameras as well as their fields of view and allows a user to navigate freely or switch among preset views. This description is not intended to be a complete description of, or limit the scope of, the invention. Other features, aspects, and objects of the invention can be obtained from a review of the specification, the figures, and the claims.
摘要:
Embodiments of the present invention introduce a novel technique to analyze and monitor video streams captured from multiple cameras. It highlights the foreground region of the video streams via local alpha blending and displays the videos in an immersive 3-D environment. The spatial arrangement of the displays can be generated by multi-dimensional scaling of the amount of simultaneous motion across different video streams. This description is not intended to be a complete description of, or limit the scope of, the invention. Other features, aspects, and objects of the invention can be obtained from a review of the specification, the figures, and the claims.
摘要:
Video recording technology is utilized to enable business process investigation in an unobtrusive manner. Several cameras are situated, each having a defined field of view. For each camera, a region of interest (ROI) within the field of view is defined, and a background image is determined for each ROI. Motion within the ROI is detected by comparing each frame to the background image. The video recording can then be segmented and indexed according to the motion detection.