摘要:
A method for generating content links between a first digital file and a second digital file by detecting a content feature of a first digital file segment of the first digital file during playback of the first digital file segment of the first digital file, searching an index of a plurality of content features for a plurality of segments including a second digital file segment of the second digital file, and dynamically generating a link between the first digital file one segment of the first digital file and the second digital file segment of the second digital file when a content feature of the first digital file segment of the first digital file is related to the content feature of the at least one segment of the second digital file.
摘要:
A system and method for interacting with a mobile device using an external display is disclosed, where a mobile device provides representations of an object and action to an external display for a user to select and then input back to the mobile device. An application on a mobile device provides representation data of an object and an action to a host system driving an external display. The host system arranges the representation data into a representation on the external display and allows the user to interact with the representations. The user then selects a representation and inputs the selected representation into the mobile device, where the application processes the representation to carry out the appropriate actions on the appropriate objects. The representation may be inputted to the mobile device using a photo of the representation taken by a mobile device camera, or by inputting a code provided with each representation on the external display.
摘要:
Systems and methods for addressing the problem of specific planar object search from a large-scale image database. Given a query image with a particular object and a planar surface, the systems and methods attempt to find, from a large image corpus, a set of representative images in which that object appears. According to the complexity of geometric transformations between a query image and its target images, the whole problem can be categorized into two classes: Rotation-Scale-Translation (RST)-transformed image search, and affine/homography-transformed image search.
摘要:
A method for generating content links between a first digital file and a second digital file by detecting a content feature of a first digital file segment of the first digital file during playback of the first digital file segment of the first digital file, searching an index of a plurality of content features for a plurality of segments including a second digital file segment of the second digital file, and dynamically generating a link between the first digital file one segment of the first digital file and the second digital file segment of the second digital file when a content feature of the first digital file segment of the first digital file is related to the content feature of the at least one segment of the second digital file.
摘要:
Systems and methods provide for determining a location and size of a visual link to digital media on physical media such as a paper document. An authoring tool for creating a link on a paper document, such as an Embedded Media Marker (EMM), identifies and scores other EMMs and related keypoints on the document to determine similarities between a newly-created EMM and other EMMs and keypoints on the paper document. The scores are visualized for a user on a display in order to position and size the newly-created EMM in a location on the paper document that will avoid confusion with other EMMs and related content. The location and size of the newly-created EMM may be automatically adjusted based on the scoring of the keypoints and related EMMs.
摘要:
Systems and methods provide for gesture-based creation of interactive hotspots in a real world environment. A gesture made by a user in a three-dimensional space in the real world environment is detected by a motion capture device such as a camera, and the gesture is then identified and interpreted to create a “hotspot,” which is a region in three-dimensional space through which a user interacts with a computer system. The gesture may indicate that the hotspot is anchored to the real world environment or anchored to an object in the real world environment. The functionality of the hotspot is defined in order to identify the type of gesture which will initiate the hotspot and associate the activation of the hotspot with an activity in the system, such as control of an application on a computer or an electronic device connected with the system.
摘要:
A system for measuring a pulse and respiratory rate from passive thermal video includes contour segmentation and tracking, clustering of informative pixels of interests, and robust dominant frequency component estimation. Contour segmentation is used to locate a blood vessel region to measure, after which all pixels in the nearby region are aligned across frames based on the segmentation's position, and scale in each frame. Spatial filtering is then performed to remove noise not related to heart beat and then non-linear filtering is performed on the temporal signal corresponding to each aligned pixel. The signal spectrum of each pixel is then feed to a clustering algorithm for outlier removal. Pixels in the largest cluster are then used to vote for the dominant frequency, and the median of the dominant frequency is output as the pulse rate.
摘要:
A method for exchanging information in a shared interactive environment, comprising selecting a first physical device in a first live video image wherein the first physical device has information associated with it, causing the information to be transferred to a second physical device in a second live video image wherein the transfer is brought about by manipulating a visual representation of the information, wherein the manipulation includes interacting with the first live video image and the second live video image, wherein the first physical device and the second physical device are part of the shared interactive environment, and wherein the first physical device and the second physical device are not the same.
摘要:
Embedded Media Markers (EMM) are optical-filter-like overlaid marks that can be printed on a part of a paper document that signify the existence of media associated with that part of the document. Users take a picture of an EMM-signified document patch using a cell phone camera, and the media associated with the EMM-signified document location is retrieved and displayed on the phone. Unlike bar codes, EMMs are nearly transparent and thus do not interfere with the document contents. Retrieval of media associated with an EMM is based on image local features of the captured EMM-signified document patch. A semi-automatically technique places an EMM at a location in a document, in such a way that the EMM encompasses sufficient identification features with minimal disturbance to the original document.
摘要:
Method, device, system and framework for enabling token and point level operations on language independent paper documents through camera phone interface. Image descriptors from snapshots of document captured by the phone can be extracted by phone itself and transmitted to server. In another implementation, the descriptors are extracted by receiving server. The server is connected to database of high-quality images of the same document and matched high-quality patch is sent back to phone for user's viewing and manipulation. Modifications and annotations of high-quality patch are transmitted to database and stored. Motion detection is combined with image recognition to provide high quality images of regions of document being viewed by sweeping the phone. Capabilities include web-search, e-dictionary, or keyword finding for words in paper documents, copy-paste operations, constructing photo collages from portions of printed photos, and playing dynamic contents of printed presentation slides on display of camera phone.