摘要:
Described is a technique for viewing a document page on a small display such as a mobile phone or PDA. The page can come from a scanned document (bitmap image) or an electronic document (text and graphics data plus metadata). The page with text and graphics is segmented into regions. For each region, a scale-distortion function is constructed based on image analysis. During interactive viewing of the document, as the user navigates by moving the viewport around the page, the zoom factor will be automatically adjusted by optimizing the scale-distortion functions of the regions in the viewport.
摘要:
An automated system and method for producing videos using expert video production rules. The automated video production system and method of the present invention is particularly well-suited for the online publishing of lectures. The system of the present invention includes a camera system for capturing the lecture, such as an audience-tracking camera and a non-intrusive lecturer-tracking camera. Moreover, each camera may be controlled by a cinematographer. Tracking of a subject is performed using a history-based, reduced-motion tracker that sets up a camera shot based on the subject's movement history and leaves the shot fixed until the camera is switched. The system also includes a virtual director module having a probabilistic finite state machine (FSM) module that uses probabilistic rules and the expert video production rules to determine a current camera view.
摘要:
Embodiments of the present invention introduce a user navigation interface that allows a user to monitor/navigate video streams captured from multiple cameras. It integrates video streams from multiple cameras with the semantic layout into a 3-D immersive environment and renders the video streams in multiple displays on a user navigation interface. It conveys the spatial distribution of the cameras as well as their fields of view and allows a user to navigate freely or switch among preset views. This description is not intended to be a complete description of, or limit the scope of, the invention. Other features, aspects, and objects of the invention can be obtained from a review of the specification, the figures, and the claims.
摘要:
A system and method for optimizing the visual fidelity of a presentation for a plurality of audience members and a plurality of display devices, comprising: modeling the quality of view available to the plurality of audience members based on: one or more properties of the display devices, a distribution of the display devices, a distribution of the plurality of audience members, and the visual presentation wherein the visual presentation comprises one or more h-slides; and determining an optimal mapping for the one or more h-slides to the plurality of display devices based on the modeling.
摘要:
An algorithm for finding regions of interest (ROI) in synthetic images based on an information driven approach in which sub-blocks of a set of synthetic image are analyzed for information content or compressibility based on textural and color features. A DCT may be used to analyze the textural features of a set of images and a color histogram may be used to analyze the color features of the set of images. Sub-blocks of low compressibility are grouped into ROIs using a type of morphological technique. Unlike other algorithms that are geared for highly specific types of ROI (e.g. OCR text detection), the method of the present invention is generally applicable to arbitrary synthetic images. The present invention can be used with several other image applications, including Stained-Glass collages presentations.
摘要:
Systems and methods in accordance with embodiments of the present invention can include a convertible podium having a compact and lightweight design that can provide multiple functionalities by converting its form. A system in accordance with one embodiment of the present invention can convert from an interactive podium to other presentation devices including (but not limited to) an imaging device, a remote avatar for a presenter, an interactive whiteboard, and an information board. The system includes one or more configurable controls for controlling one or both of a presentation and a presentation environment.
摘要:
Systems and methods directed to fine-grained interaction with ordinary markerless paper documents and projectors at flexible poses in 3D space. Systems and methods allow for the projection of images onto non-flat variable surfaces by utilizing depth detection techniques in order to project an image or video properly onto a variable surface.
摘要:
Systems and methods provide for determining a location and size of a visual link to digital media on physical media such as a paper document. An authoring tool for creating a link on a paper document, such as an Embedded Media Marker (EMM), identifies and scores other EMMs and related keypoints on the document to determine similarities between a newly-created EMM and other EMMs and keypoints on the paper document. The scores are visualized for a user on a display in order to position and size the newly-created EMM in a location on the paper document that will avoid confusion with other EMMs and related content. The location and size of the newly-created EMM may be automatically adjusted based on the scoring of the keypoints and related EMMs.
摘要:
Systems and methods directed to fine-grained interaction with ordinary markerless paper documents and projectors at flexible poses in 3D space. Systems and methods allow for the projection of images onto non-flat variable surfaces by utilizing depth detection techniques in order to project an image or video properly onto a variable surface.
摘要:
A peer-to-peer modular presentation system is disclosed for providing improved presentations. The system includes a plurality of displays. Each display is associated with an input device, processor and input/output port. The plurality of displays may include static or mobile wherein each display is aware of the displays immediately surrounding it or of the displays in a remote site during a teleconference. A propagation direction may be configured for at least two of the displays in the system. Propagation may be initiated for multiple displays by generating a gesture input at one display. The gesture input will cause content to be presented in the particular display, and initiate propagation of content along the configured direction in neighboring displays. Content may be fed to a particular display through a stack or other memory that is associated with the display.