摘要:
According to one set of embodiments, techniques are provided for performing actions based upon physical locations of one or more paper documents. According to another set of embodiments, techniques are provided for tracking the physical locations of paper documents. According to another set of embodiments, techniques are provided for determining electronic document information for paper documents. According to another set of embodiments, techniques are provided for determining and tracking the contents of a container. According to another set of embodiments, a document security system is provided. According to another set of embodiments, techniques are provided for tracking documents in a workflow.
摘要:
The present invention relates to systems and methods for analyzing media material having a layout. A media material analyzer includes a segmenter and an article composer. The segmenter identifies block segments associated with columnar body text in the media material. The article composer determines which of the identified block segments belong to one or more articles in the media material. The article composer can determine whether candidate block segments belong to a same article based on language statistics information, layout transition information, or both language statistics information and layout transition information. A system for searching media material having a layout over a network is also provided.
摘要:
The present invention relates to systems and methods for analyzing media material having a layout. A media material analyzer includes a segmenter and an article composer. The segmenter identifies block segments associated with columnar body text in the media material. The article composer determines which of the identified block segments belong to one or more articles in the media material. The article composer can determine whether candidate block segments belong to a same article based on language statistics information, layout transition information, or both language statistics information and layout transition information. A system for searching media material having a layout over a network is also provided.
摘要:
Techniques for capturing information during multimedia presentations. According to an embodiment, the presentation recording appliance (PRA) receives multimedia presentation information comprising video information and/or audio information. The PRA may also receive information from external sources other than the first source. The audio and video information received by the PRA is then processed and stored in a format which facilitates subsequent retrieval.
摘要:
Document monitoring provides a measure of document security. Documents incorporating radio frequency identification (RFID) tags can be monitored by appropriate interrogation components for movement activity. A surface suitable for placement of documents is configured for monitoring RFID tagged documents. Such documents can be monitored in a document processing device to control access to the document processing functions.
摘要:
A workflow system and method include tracking the physical movement of documents. The information of the physical movement is incorporated with the flow graph of a workflow. A display of the workflow can then be enhanced by the information relating to the physical movement of the document.
摘要:
A method and an apparatus for simultaneous highlighting of a paper document is described. The device includes a highlighter. A scanner is used and configured to capture at least one highlighted mark place on a paper document. The scanner is coupled to a memory for storing electronic versions of documents. An electronic document is accessed when a portion of the electronic document matches a portion of the paper document.
摘要:
A Mixed Media Reality (MMR) system and associated techniques are disclosed. The MMR system provides mechanisms for forming a mixed media document that includes media of at least two types (e.g., printed paper as a first medium and digital content and/or web link as a second medium). In one particular embodiment, the MMR system includes a content-based retrieval database configured with an index table to represent two-dimensional geometric relationships between objects extracted from a printed document in a way that allows look-up using a text-based index. A ranked set of document, page and location hypotheses can be computed given data from the index table. The techniques effectively transform features detected in an image patch into textual terms (or other searchable features) that represent both the features themselves and the geometric relationship between them. A storage facility can be used to store additional characteristics about each document image patch.
摘要:
Various aspects can be implemented for determining optical character recognition (OCR) parameters using an OCR engine. In general, one aspect can be a method that includes using an optical character recognition (OCR) engine in a base configuration to generate one or more OCR responses corresponding to one or more sample pages of a document. The method also includes identifying a dominant OCR parameter for the document based on the one or more generated OCR responses. Other implementations of this aspect include corresponding systems, apparatus, and computer program products.
摘要:
A live video stream captured by an on-device camera is displayed on a screen with an overlaid guideline. Video frames of the live video stream are analyzed for a video frame with acceptable quality. A text region is identified in the video frame approximate to the on-screen guideline and cropped from the video frame. The cropped image is transmitted to an optical character recognition (OCR) engine, which processes the cropped image and generates text in an editable symbolic form (the OCR'ed text). A confidence score is determined for the OCR'ed text and compared with a threshold value. If the confidence score exceeds the threshold value, the OCR'ed text is outputted.