摘要:
Systems and methods directed to fine-grained interaction with ordinary markerless paper documents and projectors at flexible poses in 3D space. Systems and methods allow for the projection of images onto non-flat variable surfaces by utilizing depth detection techniques in order to project an image or video properly onto a variable surface.
摘要:
An audio device management system (ADMS) manages remote audio devices via user selections in video links. The system enhances audio acquisition quality by receiving and processing human suggestions, forming customized two-way audio links according to user requests, and learning audio pickup strategies and camera management strategies from user operations. The ADMS control interface for a remote user provides a multi-window GUI that provides an overview window and selection display window. The ADMS provides users with more flexibility to enhance audio signals according to their needs and makes it more convenient to form customized two-way audio links without requiring users to remember a list of phone numbers. The ADMS also automatically manages available microphones for audio pickup based on microphone sound quality and the system's past experience when users monitor a structured audio environment without explicitly expressing their attentions in the video window.
摘要:
Embedded Media Markers (EMM) are optical-filter-like overlaid marks that can be printed on a part of a paper document that signify the existence of media associated with that part of the document. Users take a picture of an EMM-signified document patch using a cell phone camera, and the media associated with the EMM-signified document location is retrieved and displayed on the phone. Unlike bar codes, EMMs are nearly transparent and thus do not interfere with the document contents. Retrieval of media associated with an EMM is based on image local features of the captured EMM-signified document patch. A semi-automatically technique places an EMM at a location in a document, in such a way that the EMM encompasses sufficient identification features with minimal disturbance to the original document.
摘要:
Embedded Media Barcode Links (EMBL), are optimally blended iconic barcode marks printed on paper documents that signify the existence of media associated with that part of the document. EMBLs are used for media retrieval with a camera phone. Users take a picture of an EMBL-signified document patch using a cell phone, and the media associated with the EMBL-signified document location is displayed on the phone. Unlike traditional bar code which requires an exclusive space, EMBLs are optimally blended with contents and thus have less interference with the original document layout and can be moved closer to a media associated location. Unlike Embedded Media Makers (EMM) which requires underlying document features for the marker identification, EMBL has no requirement to underlying features. Moreover, EMBL can take the advantage of users' knowledge to traditional barcode. Retrieval of media associated with an EMBL is based on the barcode identification of the captured EMBL.
摘要:
Systems and methods for addressing the problem of specific planar object search from a large-scale image database. Given a query image with a particular object and a planar surface, the systems and methods attempt to find, from a large image corpus, a set of representative images in which that object appears. According to the complexity of geometric transformations between a query image and its target images, the whole problem can be categorized into two classes: Rotation-Scale-Translation (RST)-transformed image search, and affine/homography-transformed image search.
摘要:
Embedded Media Barcode Links (EMBL), are optimally blended iconic barcode marks printed on paper documents that signify the existence of media associated with that part of the document. EMBLs are used for media retrieval with a camera phone. Users take a picture of an EMBL-signified document patch using a cell phone, and the media associated with the EMBL-signified document location is displayed on the phone. Unlike traditional bar code which requires an exclusive space, EMBLs are optimally blended with contents and thus have less interference with the original document layout and can be moved closer to a media associated location. Unlike Embedded Media Makers (EMM) which requires underlying document features for the marker identification, EMBL has no requirement to underlying features. Moreover, EMBL can take the advantage of users' knowledge to traditional barcode. Retrieval of media associated with an EMBL is based on the barcode identification of the captured EMBL.
摘要:
A system and method for authoring a media presentation including a media presentation environment representation having a portion defined as a hot spot associated with a media presentation device. Various embodiments include a hyper-slide listing portion, a media presentation authoring portion, and/or a media presentation device listing portion. Various embodiments include an integrated presentation authoring preview environment. The method includes selecting a physical device for a presentation unit in the media presentation environment, manipulating a visual representation of the presentation unit, recording a display of the presentation unit, and previewing the presentation in an augmented reality environment, a virtual reality environment, or both. Various embodiments operate with a plurality of types of media presentation devices and a plurality of each type of device.
摘要:
Systems and methods for addressing the problem of specific planar object search from a large-scale image database. Given a query image with a particular object and a planar surface, the systems and methods attempt to find, from a large image corpus, a set of representative images in which that object appears. According to the complexity of geometric transformations between a query image and its target images, the whole problem can be categorized into two classes: Rotation-Scale-Translation (RST)-transformed image search, and affine/homography-transformed image search.
摘要:
Systems and methods provide for determining a location and size of a visual link to digital media on physical media such as a paper document. An authoring tool for creating a link on a paper document, such as an Embedded Media Marker (EMM), identifies and scores other EMMs and related keypoints on the document to determine similarities between a newly-created EMM and other EMMs and keypoints on the paper document. The scores are visualized for a user on a display in order to position and size the newly-created EMM in a location on the paper document that will avoid confusion with other EMMs and related content. The location and size of the newly-created EMM may be automatically adjusted based on the scoring of the keypoints and related EMMs.
摘要:
Systems and methods provide for gesture-based creation of interactive hotspots in a real world environment. A gesture made by a user in a three-dimensional space in the real world environment is detected by a motion capture device such as a camera, and the gesture is then identified and interpreted to create a “hotspot,” which is a region in three-dimensional space through which a user interacts with a computer system. The gesture may indicate that the hotspot is anchored to the real world environment or anchored to an object in the real world environment. The functionality of the hotspot is defined in order to identify the type of gesture which will initiate the hotspot and associate the activation of the hotspot with an activity in the system, such as control of an application on a computer or an electronic device connected with the system.