摘要:
Systems and methods for interactive, user-driven detection, creation and completion of form fields in a digital document are provided. A document with form fields that require completion by a user is received, after which form fields are detected at the direction of the user. Once the user selects a possible form field, the system creates the appropriate fillable form field based on size, type, location, related text and other parameters of the form field and surrounding document. Additional levels of interaction include predictive text, pattern development and automatic completion of previously completed fields.
摘要:
A system and method to identify pictures in documents. An image representing a page of a document is received. The image is analyzed to identify text objects in the page. A masked image is generated by masking out regions of the image including the text objects in the page. Groups of pixels in the masked image are identified, wherein a respective group of pixels corresponds to at least one picture in the page. When there is one or more groups of pixels, regions for pictures are identified based on the one or more groups of pixels. Metadata tags for the pictures are stored, wherein a respective metadata tag for a respective picture includes information about a respective bounding box for the respective picture.
摘要:
A method for providing real-time feedback of an estimated quality of a captured final image, the method including obtaining a preliminary image, calculating a quality score of the preliminary image, and in response to the quality score of the preliminary image exceeding a threshold quality value, taking a first action.
摘要:
A system helps filter and correct video captured and streamed from a mobile device. In particular, the system detects and streams content shown on screens, allowing anyone to stream screen content immediately without needing to develop hooks into external software (i.e. without installing a screen recorder software in the computer). The system can use a variety of user-selectable techniques to detect the screen, and utilizes the mobile device's touchscreen to allow users to manually override detected corners. However, some of these approaches could potentially be applied to other types of content, such as identifying TV screens, appliance LCD screens, other mobile devices' screens, multifunction devices. (e.g. a remote technician could help troubleshoot a malfunctioning MFD by having the end-user point his cellphone to the LCD screen of the MFD).
摘要:
A method for navigating instructional video presentations is disclosed. The method includes determining a pause mode of a video presentation, and playing the video presentation on a display device. The video presentation has one or more predetermined pause positions. The method also includes, while playing the video presentation, determining that the video presentation has reached one of the one or more pause positions. The method further includes, in accordance with a determination that the video presentation is in a first pause mode, pausing the video presentation at the one of the one or more pause positions and maintaining a display of a paused frame of the video presentation, and, in accordance with a determination that the video presentation is in a second pause mode distinct from the first pause mode, continuing to play the video presentation through the one of the one or more pause positions.
摘要:
A method for navigating instructional video presentations is disclosed. The method includes determining a pause mode of a video presentation, and playing the video presentation on a display device. The video presentation has one or more predetermined pause positions. The method also includes, while playing the video presentation, determining that the video presentation has reached one of the one or more pause positions. The method further includes, in accordance with a determination that the video presentation is in a first pause mode, pausing the video presentation at the one of the one or more pause positions and maintaining a display of a paused frame of the video presentation, and, in accordance with a determination that the video presentation is in a second pause mode distinct from the first pause mode, continuing to play the video presentation through the one of the one or more pause positions.
摘要:
A system and a method for sharing web pages. In some embodiments, the following operations are preformed at a client computer system having one or more processors that execute one or more programs stored in memory of the client computer system. A representation of a web page that is displayed in a window of a web browser in a user interface of the client computer system is generated. The representation of the web page is partitioned into a plurality of tiles based on a document object model of the web page. For each tile in the plurality of tiles, it is determined whether the tile has changed relative to a previous version of the tile. In response to determining that the tile has changed, the tile that has changed is sent to a server.
摘要:
A system and method for identifying key frames of a presentation video that include stationary informational content. A sequence of frames is obtained from a presentation video and differences of pixel values between consecutive frames of the sequence of frames are computed. Sets of consecutive frames that are stationary are identified, wherein consecutive frames that are stationary have a proportion of changed pixel values below a first predetermined threshold, and wherein pixel values are deemed to be changed when the difference between the pixel values for corresponding pixels in consecutive frames exceeds a second predetermined threshold. Next, a set of key frames that include stationary informational content is retained. The set of key frames that include stationary informational content is then displayed for user interaction.
摘要:
Blogs (and other information sources) are recommended to a user based history of user's online activities. The system: (1) processes the user's web history, (2) identifies blog posts (and web pages) that link to pages read by the user, (3) generates multiple relevance scores for each identified post/page, and (4) produces multiple rankings of the corresponding source blogs (and web sites) by aggregating individual relevance scores (or combinations of relevance scores), according to users' preferences. The system allows the discovery of information sources that are likely to be interesting to the user and allows sources lost in the “long tail” to be seamlessly discovered.