摘要:
A document processing system including a plurality of model images stored in a memory is provided. The model images are represented by a first information set with the first information set varying as a function of object-based coordinates. At least one of the plurality of model images includes a text containing region having index information intended for use in storing one or more document pages. Moreover, a query image represented as a second set of information varying as a function of object-based coordinates is provided. In practice, an image localization module corresponds the second set of information with a portion of the first set of information to obtain the text containing region, and a text extraction module extracts the index information from the text containing region to facilitate the storing of the one or more document pages.
摘要:
Providing means for people to input their observations can reduce the need for sensor deployments because humans have excellent sensing abilities. One thing people tend to observe carefully is parking availability. Parking meters and pay stations can request people to enter their observations of parking availability and other environmental factors. The observations, being numerical in nature, can be processed to determine reasonable parking fees, likelihood of violators in an area, and the statistical, observed, or estimated dispersion of available parking with a geographic region.
摘要:
Methods and systems for identifying device models or accounts exhibiting outlying behavior are disclosed. For a method of identifying a device model exhibiting outlying behavior, a processor may receive a color impression count, a monochrome impression count and either a device model for each of a plurality of devices. A proportion of color revenue may be determined for each device based on the color impression count and the monochrome impression count. The processor may determine, for each device model, a distribution of the proportion of color revenue for the one or more devices having the device model and may automatically identify one or more distributions of the proportion of color revenue exhibiting outlying behavior. Each distribution is associated with a device model.
摘要:
A method for analyzing documents is disclosed. The method compares concepts consisting of groups of terms for similarity within a corpus of document, clusters documents that contain certain concept term sets together. It may also rank the documents within each cluster according to the frequency of term co-occurrence within the concepts.
摘要:
A systems and methods for providing an image forming machine capable of monitoring the image quality of images that the image forming machine produces and detecting changes in the image quality. The monitoring system using statistical techniques to fit predetermined models to a measured image quality of time sequence of formed images. The predetermined models used to find current and predicted values of image quality and notifying a user or service provider when the image quality has changed.
摘要:
A system for detecting suspect meter reads in a print environment may include a computing device and a computer-readable storage medium in communication with the computing device. The computer-readable storage medium may include one or more programming instructions for receiving historical meter read values associated with a print-related service, selecting a model set including one or more of the historical meter read values, using a predictive model to determine an anticipated meter read value and a corresponding forecast error value from the model set, determining an updated forecast error value, determining a threshold value, identifying an actual meter read value, determining an average rate associated with the actual meter read value, and flagging the actual meter read value as suspect based on a comparison of the average rate and the threshold value.
摘要:
An inventory management system for forecasting demand in a print production environment may include a computing device and a computer-readable storage medium in communication with the computing device. The computer-readable storage medium may include programming instructions for updating a predictive model with intervention information comprising an anticipated demand value and a confidence value associated with the anticipated demand value. The predictive model may be associated with a demand distribution of a print-related service. The computer-readable storage medium may include programming instructions for generating a demand forecast associated with the print-related service by using the updated predictive model, using the generated demand forecast to compare a current inventory level associated with the print-related service to an anticipated inventory level associated with the demand forecast of the print-related service, and ordering additional inventory in response to the current inventory level being less than the anticipated inventory level.
摘要:
The present disclosure is directed to a method and apparatus for applying magnetic ink character recognition (MICR) technology to enable the embedding of coded information within text characters of a document.
摘要:
A system for electronically distilling information from a business document uses a network scanner to electronically scan a platen area, having a business document thereon, to create a bitmap. A network server carries out a segmentation process to segment the scan generated bitmap into a bitmap object, the bitmap object corresponding to the scanned business document; a bitmap to text conversion process to convert the bitmap object into a block of text; a semantic recognition process to generate a structured representation of semantic entities corresponding to the scanned business document; and a document generation process to convert the structured representation into a structure text file. The semantic recognition process includes the processes of generating, for each line of text having a keyword therein, a terminal symbol corresponding to the keyword therein; generating, for each line of text not having a keyword therein and absent of numeric characters, an alphabetic terminal symbol; generating, for each line of text not having a keyword therein and having a numeric character therein, an alphanumeric terminal symbol; generating a string of terminal symbols from the generated terminal symbols; determining a probable parsing of the generated string of terminal symbols; labeling each text line, according to a determined function, with non-terminal symbols; and parsing the business document information text into fields of business document information text based upon the non-terminal symbol of each text line and the determined probable parsing of the generated string of terminal symbols.
摘要:
A computer implemented system for segmenting data collected from a document production environment is provided. The system includes determining, with a computer implemented data processing platform, that a set of document production related data should be represented as a non-normal distribution. A first test is performed and it is determined that the non-normal distribution should not be analyzed pursuant to a first analytic category. A second test is performed and when it is determined that the non-normal distribution should be analyzed pursuant to a second analytic category, an output, indicating that the non-normal distribution should be analyzed pursuant to the second analytic category is provided.