Automatic text grade level analyzer for a text processing system
    1.
    发明授权
    Automatic text grade level analyzer for a text processing system 失效
    用于文本处理系统的自动文本等级分析器

    公开(公告)号:US4456973A

    公开(公告)日:1984-06-26

    申请号:US373544

    申请日:1982-04-30

    IPC分类号: G06F17/27 G06F15/21

    CPC分类号: G06F17/2795

    摘要: A system for proofreading a text document and automatically detecting and replacing text words in the document which exceed a predetermined understandability level for the documents intended audience. Text words and synonyms are stored in a dictionary which includes an understandability code for each word based statistically on textbook grade levels. The operator enters a grade level code into the system for the intended document audience. The system scans the document for words which exceed the desired grade level, highlights those words on the system display and prompts the operator with synonyms which can be used to replace the highlighted word. The operator may select a desired replacement synonym by placing the system cursor underneath the word and depressing and enter key from the system keyboard.

    摘要翻译: 用于校对文本文档并自动检测和替换文档中的文本字的系统,该文本字符超出文档目标受众的预定可理解性级别。 文本词和同义词存储在字典中,其中包括基于教科书级别的统计学上每个单词的可理解性代码。 操作人员为期望的文档读者输入一个等级代码到系统中。 系统扫描文档超过所需年级的单词,突出显示系统显示上的这些单词,并提示操作员具有可用于替换突出显示的单词的同义词。 操作员可以通过将系统光标放在单词下面并按下并从系统键盘输入键来选择所需的替换同义词。

    Retrieval of related linked linguistic expressions including synonyms
and antonyms
    2.
    发明授权
    Retrieval of related linked linguistic expressions including synonyms and antonyms 失效
    检索相关联的语言表达,包括同义词和反义词

    公开(公告)号:US4384329A

    公开(公告)日:1983-05-17

    申请号:US218222

    申请日:1980-12-19

    摘要: A storage method and control system for storing and interactively accessing a large data base of related linguistic expressions such as synonyms and antonyms. The data base structure includes a stored ordered vocabulary of the linguistic expressions and a stored N.times.N binary matrix defining the relationship between the expressions in the vocabulary. Address indexes are associated with the vocabulary and binary matrix to enhance access times. The control system controls a programmable digital processor to receive an input linguistic expression and access the binary matrix to generate linkages to the related linguistic expressions in the vocabulary. The related linguistic expressions in the vocabulary are concatenated and displayed for operator review.

    摘要翻译: 一种用于存储和交互访问诸如同义词和反义词之类的相关语言表达的大数据库的存储方法和控制系统。 数据库结构包括存储的语言表达式的有序词汇和存储的NxN二进制矩阵,其定义词汇表达式之间的关系。 地址索引与词汇和二进制矩阵相关联,以增强访问时间。 控制系统控制可编程数字处理器以接收输入语言表达并访问二进制矩阵以产生与词汇中的相关语言表达式的联系。 词汇中的相关语言表达式被连接并显示用于操作者审查。

    Word autocorrelation redundancy match facsimile compression for text
processing systems
    3.
    发明授权
    Word autocorrelation redundancy match facsimile compression for text processing systems 失效
    字自相关冗余匹配文本处理系统的传真压缩

    公开(公告)号:US4494150A

    公开(公告)日:1985-01-15

    申请号:US397704

    申请日:1982-07-13

    CPC分类号: H04N1/4115 H03M7/42 H03M7/48

    摘要: A method and system for compacting text data to be transmitted over communications lines and thereby reduce the data volume and transmission time. Transmitting and receiving text processing systems are provided identical library memories containing text strings such as words commonly used in correspondence. Each word in a document to be communicated is compared to the transmitting system's word library and, if found in the library, only the library address is transmitted. If the word is not found in the library, then it is added to the transmitting system's library, sent, and added to the receiving system's library. The receiving system reconstructs the document by using the received addresses to access the appropriate words from its library and place them in the document. The system combines this word match encoding with character match encoding and facsimile run length encoding for communicating words not found in the system library.

    摘要翻译: 一种用于压缩通过通信线路发送的文本数据从而减少数据量和传输时间的方法和系统。 发送和接收文本处理系统提供了相同的库存储器,其包含文本串,例如通信中通常使用的单词。 将要传送的文档中的每个单词与发送系统的单词库进行比较,如果在库中找到,则仅传输库地址。 如果在库中找不到该字,则将其添加到发送系统的库中,发送并添加到接收系统的库中。 接收系统通过使用接收到的地址来重构文档,以从其库中访问适当的单词并将它们放在文档中。 该系统将该字匹配编码与字符匹配编码和传真运行长度编码相结合,用于传达在系统库中未发现的单词。

    System for automatically proofreading a document
    4.
    发明授权
    System for automatically proofreading a document 失效
    自动提交文件的系统

    公开(公告)号:US4136395A

    公开(公告)日:1979-01-23

    申请号:US755094

    申请日:1976-12-28

    摘要: Spelling errors in a word processing system are detected and presented to the operator for correction at the end of a document page. A dictionary memory contains representations of the correct spellings for words most frequently used. As each word is typed, it is stored in a word queue where it is compared to the contents of the dictionary memory. If the compare is unequal, then the word and its location on the page are stored in an error memory. When an end of page indicator is set the printer automatically repositions the print head at the ending character of the first word in the error list. When the operator keys in the correct spelling, the printer is caused to remove the misspelled word from the page and type the correct spelling. The corresponding word in the error memory is also corrected. As each misspelled word in the error memory is corrected, the remainder of the memory is scanned and repetitions of the same spelling error are automatically corrected.

    Data processing system for optimized mail piece sorting and mapping to
carrier walk sequence using real time statistical data
    6.
    发明授权
    Data processing system for optimized mail piece sorting and mapping to carrier walk sequence using real time statistical data 失效
    数据处理系统,用于优化邮件分类和映射到使用实时统计数据的载体步行序列

    公开(公告)号:US5287271A

    公开(公告)日:1994-02-15

    申请号:US748983

    申请日:1991-08-22

    IPC分类号: B07C3/00 G06Q99/00 G06F15/21

    摘要: A data processing system, method and program are disclosed to optimize mail piece sorting and the mapping of mail down to the carrier walk sequence using real time statistical data. The invention makes use of techniques such as fast OCR devices at a sending location or deferred processing of OCR scanned mail, to accumulate volume statistics indicating the number of mail pieces being routed particular addressees at a destination postal region on a given day. The information for mail volumes being directed to a particular postal region are collected over data communications links prior to the receipt of the actual mail pieces. The efficiency of sorting is maximized at the destination postal region by organizing the sorting apparatus to remove the highest volume addressee's mail first. This requires the compilation of the real time volume statistics from all of the sending postal regions sending mail to the destination postal location. In this manner, the maximum number of letters on every pass through the sorting apparatus can be achieved at the destination location. This minimizes the total number of reading operations required in order to achieve a desired level of mail sorting separation. Because the mail volume statistics are available at the destination location prior to sorting, at each stage of the sorting operations, bin allocation can be customized to yield the highest final patron or addressee sort. In this manner, the time for every subsequent pass through the sorting apparatus is reduced. This enables sorting directly to the addressee level and the distribution of the mail down to carrier walk sequence.

    摘要翻译: 公开了一种数据处理系统,方法和程序,以使用实时统计数据优化邮件分类和邮件向下映射到载体步行序列。 本发明利用诸如发送位置处的快速OCR设备或OCR扫描邮件的延迟处理的技术,以累积指示在给定日期的目的地邮政区域上特定地址的路由的邮件数量的卷统计量。 针对特定邮政区域的邮件卷的信息在收到实际邮件之前通过数据通信链路收集。 通过组织排序装置,首先删除最高容量的收件人的邮件,在目的地邮政区域最大化排序效率。 这需要汇编所有发送邮政区域的邮件到目的地邮政地点的实时统计数据。 以这种方式,可以在目的地位置实现每次通过分拣装置的最大字母数。 这样可以最大程度地减少为了达到所需级别的邮件分类分离所需的阅读操作总数。 因为邮件卷统计信息在排序之前在目的地位置可用,因此在排序操作的每个阶段都可以定制垃圾箱分配,以产生最高的最终顾客或收件人排序。 以这种方式,每次随后通过分拣装置的时间减少。 这使得能够直接将收件人级别分类并将邮件分发到载体步行序列。

    Deferred optical character recognition active pigeon hole sorting of
mail pieces
    8.
    发明授权
    Deferred optical character recognition active pigeon hole sorting of mail pieces 失效
    延迟光学字符识别活动鸽子孔分类邮件

    公开(公告)号:US5311597A

    公开(公告)日:1994-05-10

    申请号:US944559

    申请日:1992-09-11

    IPC分类号: B07C3/00 B07C7/00 G06K9/00

    CPC分类号: B07C7/005 B07C3/00 Y10S209/90

    摘要: A data processing method and system are disclosed to provide active pigeon hole sorting for mail pieces in a postal system. The method is based upon the receipt of deferred optical character recognition statistics for mail pieces in transit to a destination postal region. An ordered list of addressees is compiled from the DOCR statistics. From this ordered list, the sorting case for sorting the mail is partitioned to eliminate pigeon holes for those postal recipients not receiving mail on that day. Still further, the pigeon holes in the sorting case are actively indicated with a prompting light to facilitate the operator physically sorting the mail piece down to delivery sequence. The assignment of delivery stops to pigeon holes is also developed so as to designate adjacent pigeon holes based on the carrier walk without regard to street number but rather to reflect geographic juxtaposition.

    摘要翻译: 公开了一种数据处理方法和系统,用于为邮政系统中的邮件提供活动的鸽子洞分类。 该方法基于接收到运送到目的地邮政区域的邮件的延迟光学字符识别统计信息。 收件人的有序列表是从DOCR统计数据编制而成。 从这个有序的列表中,分类邮件的分类案例被分割,以消除当天不接收邮件的邮政接收者的空格。 此外,分类箱中的鸽孔通过提示灯积极地指示,以便操作者将邮件物理分类到递送顺序。 还开发了分配到鸽子洞的分配点,以便基于载体行走来指定相邻的鸽子孔,而不考虑街道数量,而是反映地理并置。

    System for detecting and correcting contextual errors in a text
processing system
    9.
    发明授权
    System for detecting and correcting contextual errors in a text processing system 失效
    用于检测和纠正文本处理系统中的上下文错误的系统

    公开(公告)号:US4674065A

    公开(公告)日:1987-06-16

    申请号:US807769

    申请日:1985-12-10

    IPC分类号: G06F17/27 G06K9/00 G06F7/28

    CPC分类号: G06F17/2795

    摘要: A system for automatically proofreading a document for word use validation in a text processing system is provided by coupling a specialized dictionary of sets of homophones and confusable words to sets of di-gram and N-gram conditions whereby proper usage of the words can be statistically determined. A text document is reviewed word-by-word against a dictionary of homophones and confusable words. When a match occurs, the related list of syntactic rules is examined relative to the context of the subject homophone or confusable word. If the syntax in the immediate context of the homophone or confusable word conflicts with the prestored syntax rules, the homophone or confusable word is highlighted on the system display. The system then displays the definition of the highlighted word along with possible intended alternative forms and their respective definitions. The operator can examine the word used and the possible alternatives and make a determination as to whether an error has been made and if a correction of the text is required. If correction is required, the operator may cause the error word to be replaced by the desired word by positioning the display cursor under the desired word and depressing an appropriate key on the system keyboard.

    摘要翻译: 在文本处理系统中自动校对一个用于文字使用验证的文档的系统是通过将同音词和混淆词集合的专门词典耦合到一组二进制和N-gram语言来提供的,从而正确使用单词可以是统计学的 决心。 一个文本文档逐字地反对同音词和混乱词的词典。 当匹配发生时,相对于主体同音或混淆词的上下文检查相关的句法规则列表。 如果同音字或混淆词的即时上下文中的语法与预先存储的语法规则冲突,则同音或混淆词在系统显示屏上突出显示。 系统然后显示突出显示的词的定义以及可能的预期替代形式及其各自的定义。 操作人员可以检查所使用的词和可能的替代方案,并确定是否发生错误以及是否需要更正文本。 如果需要校正,操作员可能会通过将显示光标放置在所需字下并按下系统键盘上的适当键,使错误字替换为所需字词。

    Non-text object storage and retrieval
    10.
    发明授权
    Non-text object storage and retrieval 失效
    非文本对象的存储和检索

    公开(公告)号:US5404435A

    公开(公告)日:1995-04-04

    申请号:US195043

    申请日:1994-02-14

    IPC分类号: G06F12/00 G06F17/30 G06F3/03

    摘要: The presence of a non-text object is sensed in a mixed object document to be archived in an information retrieval system. In addition to text objects, a mixed object document can contain non-text objects such as image objects, graphics objects, formatted objects, font objects, voice objects, video objects and animation objects. This enables the creation of key words which characterize the non-text object, for incorporation in the inverted file index of the data base, thereby enabling the later retrieval of either the entire document or the independent retrieval of the non-text object through the use of such key words.

    摘要翻译: 在混合对象文档中感测非文本对象的存在以存档在信息检索系统中。 除了文本对象之外,混合对象文档可以包含非文本对象,如图像对象,图形对象,格式化对象,字体对象,语音对象,视频对象和动画对象。 这使得能够创建表征非文本对象的关键词,用于并入数据库的反转文件索引中,从而使得通过使用能够稍后检索整个文档或非文本对象的独立检索 的关键词。