Abstract:
A system and method to error correct extant electronic documents is disclosed. An electronic document may be rasterized to obtain a pixel representation of the electronic document (e.g., raster image). One or more optical character recognition (OCR) tasks may be performed on the raster image of the electronic document. Errors discovered by the OCR tasks may be corrected and a customized error corrected version of the electronic document may be created and stored. If the author of the electronic document is known, the raster image may be compared to a personalized tf*idf error dictionary associated with the author to determine known OCR errors specific to the author. The raster image may also be compared to a personalized electronic error dictionary associated with the author to determine known typographical errors specific to the author.
Abstract:
Systems and methods for performing task execution in a workflow are described. The system comprises at least one modular device (12, 42, 82, 91, 105, 112, 122, 142, 162, 212, 214, 222, 224, 232, 234, 362, 364, 372, 374, 382) comprising a sensor device (14, 44, 84, 92, 106, 114, 144, 164) that is interchangeably coupled to a processor device (72, 95, 103, 120, 150, 210, 220, 230, 360, 370, 380), a software service (20, 60, 78, 96, 96a, 130, 172, 216, 226, 236, 366, 376, 386) and an electronic workflow system (22, 62, 89, 98, 134, 174, 240, 352), where the sensor may correspond to at least one particular task of a workflow, and the software service may control operation of the modular device and generates metadata from task information received by the sensor of the modular devices for the electronic workflow system.
Abstract:
Example embodiments relate to keyword determination based on a weight of meaningfulness. In example embodiments, a computing device may determine a number of occurrences of a word in a particular document and may then determine a weight of meaningfulness for the word based on the number of occurrences. The computing device may then add the word to a set of keywords for the document based on the weight of meaningfulness.
Abstract:
Anti-counterfeit marking for a product, comprising: a tamper evident marker attached to or integrated with a product or packaging for a product and having a random mark; and an electronic memory element containing data about the random mark, the memory element being attached to or integrated with the product or packaging for the product in a tamper evident manner.
Abstract:
A method for enhancing security printing includes determining fields associated with print job variability. Physical security information is entered, and a physical security data stream is generated from the physical security information. The physical security data stream is mapped to a data stream that is used to provide settings for the fields for the print job variability. The fields for the print job variability are set based upon the mapping the physical security data stream.
Abstract:
Disclosed is a document analysis system and method. The document analysis system includes an interim analyzer configured to perform an interim document analysis to identify a number of interim regions on a digital document at an interim pixels-per-inch (PPI). The document analysis system also includes a complete analyzer configured to perform a complete analysis on at least one of the interim regions at a second PPI, thereby generating at least one complete region therefrom. The document analysis system and method provides significant flexibility to the user with a number of options relative to the analysis of the regions of information of interest in a digital document and to limit analysis to such preferred regions.
Abstract:
Example embodiments disclosed hereto relate So generating a regressive information object, information is encoded into an information object at states in a workflow. Information, is encoded such that information encoded in a last state in the workflow is readable by an information object reader and information, encoded in states prior to the last state is not readable by fee information object reader.
Abstract:
Methods for selecting metrics for substrate classification, and apparatus to perform such methods. The methods include determining a value of a metric from an image of a substrate sample for each substrate sample of a plurality of substrate samples, wherein the metric is indicative of a surface texture of each substrate sample and iteratively assigning substrate samples of the plurality of substrate samples to an aggregate of a particular number of aggregates in response to a value of the metric for each substrate sample until a convergence of clustering is deemed achieved, then determining an indication of cluster tightness of the particular number of aggregates. The methods further include selecting or ignoring the metric for substrate classification in response to the indication of cluster tightness of the particular number of aggregates.
Abstract:
A system can comprise a memory to store machine readable instructions and a processing unit to access the memory and execute the machine readable instructions. The machine readable instructions can comprise a feature set extractor to extract a feature set from each of a plurality of digital images of print samples. The feature set can be a filtered feature set that includes a feature set characterizing a printer that printed a given print sample of the print samples. The machine readable instructions can also comprise a cluster component to determine clusters of the print samples based on the feature set of each of the plurality of scanned images of the print samples. The machine readable instructions can further comprise a printer identifier to identify the printer of the print samples based on the clusters of the print samples.
Abstract:
A method for tracking customer loyalty information using an incremental information object (IIO) includes capturing an information object (IO), wherein the IO include a number of tiles, and wherein the number of tiles include a standard code. The method also includes analyzing the IO to identify the standard code and analyzing the IO to determine if the IO is an IIO containing a progressive code, wherein the progressive code does not interfere with the reading of the standard. The method further include confirming the progressive code and, if the progressive code is successfully confirmed, obtaining customer loyalty data from the progressive code within the IIO.