摘要:
A system for proofreading a text document and automatically detecting and replacing text words in the document which exceed a predetermined understandability level for the documents intended audience. Text words and synonyms are stored in a dictionary which includes an understandability code for each word based statistically on textbook grade levels. The operator enters a grade level code into the system for the intended document audience. The system scans the document for words which exceed the desired grade level, highlights those words on the system display and prompts the operator with synonyms which can be used to replace the highlighted word. The operator may select a desired replacement synonym by placing the system cursor underneath the word and depressing and enter key from the system keyboard.
摘要:
A storage method and control system for storing and interactively accessing a large data base of related linguistic expressions such as synonyms and antonyms. The data base structure includes a stored ordered vocabulary of the linguistic expressions and a stored N.times.N binary matrix defining the relationship between the expressions in the vocabulary. Address indexes are associated with the vocabulary and binary matrix to enhance access times. The control system controls a programmable digital processor to receive an input linguistic expression and access the binary matrix to generate linkages to the related linguistic expressions in the vocabulary. The related linguistic expressions in the vocabulary are concatenated and displayed for operator review.
摘要:
A method and system for compacting text data to be transmitted over communications lines and thereby reduce the data volume and transmission time. Transmitting and receiving text processing systems are provided identical library memories containing text strings such as words commonly used in correspondence. Each word in a document to be communicated is compared to the transmitting system's word library and, if found in the library, only the library address is transmitted. If the word is not found in the library, then it is added to the transmitting system's library, sent, and added to the receiving system's library. The receiving system reconstructs the document by using the received addresses to access the appropriate words from its library and place them in the document. The system combines this word match encoding with character match encoding and facsimile run length encoding for communicating words not found in the system library.
摘要:
Spelling errors in a word processing system are detected and presented to the operator for correction at the end of a document page. A dictionary memory contains representations of the correct spellings for words most frequently used. As each word is typed, it is stored in a word queue where it is compared to the contents of the dictionary memory. If the compare is unequal, then the word and its location on the page are stored in an error memory. When an end of page indicator is set the printer automatically repositions the print head at the ending character of the first word in the error list. When the operator keys in the correct spelling, the printer is caused to remove the misspelled word from the page and type the correct spelling. The corresponding word in the error memory is also corrected. As each misspelled word in the error memory is corrected, the remainder of the memory is scanned and repetitions of the same spelling error are automatically corrected.
摘要:
A data processing system and method for the correction of address information on mail. The method makes use of a contextual predictive keying method for enabling an operator to read the image of an addressee mailing address and type in a minimum number of keystrokes necessary to sort the mail piece down to the final sorting level at the destination post office.
摘要:
A data processing system, method and program are disclosed to optimize mail piece sorting and the mapping of mail down to the carrier walk sequence using real time statistical data. The invention makes use of techniques such as fast OCR devices at a sending location or deferred processing of OCR scanned mail, to accumulate volume statistics indicating the number of mail pieces being routed particular addressees at a destination postal region on a given day. The information for mail volumes being directed to a particular postal region are collected over data communications links prior to the receipt of the actual mail pieces. The efficiency of sorting is maximized at the destination postal region by organizing the sorting apparatus to remove the highest volume addressee's mail first. This requires the compilation of the real time volume statistics from all of the sending postal regions sending mail to the destination postal location. In this manner, the maximum number of letters on every pass through the sorting apparatus can be achieved at the destination location. This minimizes the total number of reading operations required in order to achieve a desired level of mail sorting separation. Because the mail volume statistics are available at the destination location prior to sorting, at each stage of the sorting operations, bin allocation can be customized to yield the highest final patron or addressee sort. In this manner, the time for every subsequent pass through the sorting apparatus is reduced. This enables sorting directly to the addressee level and the distribution of the mail down to carrier walk sequence.
摘要:
A system and method are disclosed for enabling the technique of deferred processing of OCR scanned mail to be compatible with existing techniques for mechanical sortation of mail that use standard sort barcode formats which are common to a given destination postal system. This enables deferred OCR processed mail to be sorted on an unsegregated basis along with other types of mail which have not been processed by the deferred OCR technique. This allows the OCR encoded mail to be processed along with other types of encoded mail during standard sort barcode that has been imprinted using prior technology such as OCR or manual code desks.
摘要:
A data processing method and system are disclosed to provide active pigeon hole sorting for mail pieces in a postal system. The method is based upon the receipt of deferred optical character recognition statistics for mail pieces in transit to a destination postal region. An ordered list of addressees is compiled from the DOCR statistics. From this ordered list, the sorting case for sorting the mail is partitioned to eliminate pigeon holes for those postal recipients not receiving mail on that day. Still further, the pigeon holes in the sorting case are actively indicated with a prompting light to facilitate the operator physically sorting the mail piece down to delivery sequence. The assignment of delivery stops to pigeon holes is also developed so as to designate adjacent pigeon holes based on the carrier walk without regard to street number but rather to reflect geographic juxtaposition.
摘要:
A system for automatically proofreading a document for word use validation in a text processing system is provided by coupling a specialized dictionary of sets of homophones and confusable words to sets of di-gram and N-gram conditions whereby proper usage of the words can be statistically determined. A text document is reviewed word-by-word against a dictionary of homophones and confusable words. When a match occurs, the related list of syntactic rules is examined relative to the context of the subject homophone or confusable word. If the syntax in the immediate context of the homophone or confusable word conflicts with the prestored syntax rules, the homophone or confusable word is highlighted on the system display. The system then displays the definition of the highlighted word along with possible intended alternative forms and their respective definitions. The operator can examine the word used and the possible alternatives and make a determination as to whether an error has been made and if a correction of the text is required. If correction is required, the operator may cause the error word to be replaced by the desired word by positioning the display cursor under the desired word and depressing an appropriate key on the system keyboard.
摘要:
The presence of a non-text object is sensed in a mixed object document to be archived in an information retrieval system. In addition to text objects, a mixed object document can contain non-text objects such as image objects, graphics objects, formatted objects, font objects, voice objects, video objects and animation objects. This enables the creation of key words which characterize the non-text object, for incorporation in the inverted file index of the data base, thereby enabling the later retrieval of either the entire document or the independent retrieval of the non-text object through the use of such key words.