Office correspondence storage and retrieval system
    1.
    发明授权
    Office correspondence storage and retrieval system 失效
    办公通信存储和检索系统

    公开(公告)号:US4358824A

    公开(公告)日:1982-11-09

    申请号:US107994

    申请日:1979-12-28

    摘要: A system that intelligently abstracts and archives a document for storage and interprets a free form user retrieval query to recall the document from the storage file. The system includes a method for automatically selecting keywords from the document using a parts of a speech directory. A method is given for weighing the importance or centrality of each keyword with respect to the document of its origin. Using the same logic paths, a free form query that describes the document in the same manner that it would have to be described to a secretary to "find" it in a filing cabinet, the system automatically determines the key matching terms and finds the archived document(s) with the greatest affinity.

    摘要翻译: 一种智能抽象和归档文档以进行存储的系统,并解释一个自由表单用户检索查询,以从存储文件中调用该文档。 该系统包括用于使用语音目录的一部分从文档中自动选择关键字的方法。 给出了衡量每个关键词对其起源文件的重要性或中心性的方法。 使用相同的逻辑路径,一种自由格式查询,以与向秘书进行描述的方式相同的方式描述文档,以便在文件柜中“查找”文档,系统自动确定关键匹配项,并找到归档 具有最大亲和力的文件。

    Stem processing for data reduction in a dictionary storage file
    2.
    发明授权
    Stem processing for data reduction in a dictionary storage file 失效
    用于字典存储文件中的数据缩减的句柄处理

    公开(公告)号:US4342085A

    公开(公告)日:1982-07-27

    申请号:US1123

    申请日:1979-01-05

    CPC分类号: G06F17/273

    摘要: A system for reducing storage requirements and accessing times in a text processing machine for automatic spelling verification and hyphenation functions. The system includes a method for storing a word list file and accessing the word list file such that legal prefixes and suffixes are truncated and only the unique root element, or "stem", of a word is stored. A set of unique rules is provided for prefix/suffix removal during compilation of the word list file and subsequent accessing of the word list file. Spelling verification is accomplished by applying the rules to the words whose spelling is to be verified and application of the said rules provides, under most circumstances, a natural hyphenation break point at the prefix-stem and stem-suffix junctions.

    摘要翻译: 一种用于在文本处理机中减少存储要求和访问时间的系统,用于自动拼写检验和连字符功能。 该系统包括用于存储单词列表文件和访问单词列表文件的方法,使得合法前缀和后缀被截断,并且仅存储单词的唯一根元素或“词干”。 在汇编单词列表文件和随后访问单词列表文件时,提供了一组唯一的规则用于前缀/后缀删除。 拼写验证是通过将规则应用于要拼写验证的单词,并且在大多数情况下,前缀和词干后缀连接处的自然连字符断点在上述规则的应用程序中提供。

    Alpha content match prescan method for automatic spelling error
correction
    3.
    发明授权
    Alpha content match prescan method for automatic spelling error correction 失效
    Alpha内容匹配预扫描方法进行自动拼写纠错

    公开(公告)号:US4328561A

    公开(公告)日:1982-05-04

    申请号:US108000

    申请日:1979-12-28

    摘要: A system for reducing the computation required to match a misspelled word against various candidates from a dictionary to find one or more words that represent the best match to the misspelled word. The major facility offered is the ability to computationally discern the degree of apparent match that exists between words that do not perfectly match a given target word without requiring the computationally tedious procedure of character by character positional matching which necessitates shifting and realignment to accommodate for differences between the candidate and target words due to character differences or added and dropped syllables. The system includes a method for storing and retrieving words from the dictionary based on their likelihood of being the correct version of a misspelled word and then reviewing those words further using the Prescan Alpha Content Match to reduce the number of candidates that must then be examined in a high resolution positional match to find the candidate(s) which matches the mis-spelled word with the greatest character affinity. The Prescan Alpha Content Match reduces the number of candidates in contention so as to make a high resolution match computationally feasible on a real-time basis.

    摘要翻译: 一种用于减少匹配拼写错误的单词与来自词典的各种候选者所需的计算的系统,以找到表示与拼写错误的单词最佳匹配的一个或多个单词。 所提供的主要设施是能够计算地辨别不完全匹配给定目标词的单词之间存在的表观匹配程度,而不需要按字符位置匹配进行计算上繁琐的过程,这需要移位和重新排列以适应 候选人和目标词由于字符差异或添加和删除的音节。 该系统包括一种方法,用于根据其拼写错误的单词的正确版本的可能性来存储和检索词典中的单词,然后使用Prescan Alpha内容匹配来进一步检查这些单词,以减少必须接受检查的候选人数 高分辨率位置匹配,以找到匹配具有最大字符亲和力的拼写错误的单词的候选。 Prescan Alpha内容匹配减少了竞争中的候选人的数量,以使得在实时的基础上使计算上可行的高分辨率匹配。

    Instantaneous alpha content prescan method for automatic spelling error
correction
    4.
    发明授权
    Instantaneous alpha content prescan method for automatic spelling error correction 失效
    用于自动拼写错误纠正的瞬时alpha内容预扫描方法

    公开(公告)号:US4355371A

    公开(公告)日:1982-10-19

    申请号:US133707

    申请日:1980-03-25

    IPC分类号: G06K9/72 G06F17/27 G06F7/02

    CPC分类号: G06F17/273

    摘要: A system for reducing the computation required to match a misspelled word against various candidates from a dictionary to find one or more words that represent the best match to the misspelled word. The major facility offered is the ability to computationally discern the degree of apparent match that exists between words that do not perfectly match a given target word without requiring the computationally tedious procedure of character by character positional matching which necessitates shifting and realignment to accommodate for differences between the candidate and target words due to character differences or added and dropped syllables. The system includes a method for storing and retrieving words from the dictionary based on their likelihood of being the correct version of a misspelled word and then reviewing those words further to reduce the number of candidates that must then be examined in a high resolution positional match to find the candidate(s) which matches the misspelled word with the greatest character affinity. This technique reduces the number of candidates in contention so as to make a high resolution match computationally feasible on a real-time basis. The discriminant potential and the real-time computational burden associated with the technique are balanced in an optimal manner.

    摘要翻译: 一种用于减少匹配拼写错误的单词与来自词典的各种候选者所需的计算的系统,以找到表示与拼写错误的单词最佳匹配的一个或多个单词。 所提供的主要设施是能够计算地辨别不完全匹配给定目标词的单词之间存在的表观匹配程度,而不需要按字符位置匹配进行计算上繁琐的过程,这需要移位和重新排列以适应 候选人和目标词由于字符差异或添加和删除的音节。 该系统包括一种方法,用于根据其拼写错误的单词的正确版本的可能性来存储和检索词典中的单词,然后进一步检查这些单词以减少必须在高分辨率位置匹配中必须检查的候选人数 找到匹配拼写错误的单词具有最大字符亲和度的候选人。 这种技术减少了竞争中的候选人的数量,以使得在实时的基础上使计算上可行的高分辨率匹配。 与该技术相关的判别电位和实时计算负担以最佳方式进行平衡。

    Word autocorrelation redundancy match facsimile compression for text
processing systems
    5.
    发明授权
    Word autocorrelation redundancy match facsimile compression for text processing systems 失效
    字自相关冗余匹配文本处理系统的传真压缩

    公开(公告)号:US4494150A

    公开(公告)日:1985-01-15

    申请号:US397704

    申请日:1982-07-13

    CPC分类号: H04N1/4115 H03M7/42 H03M7/48

    摘要: A method and system for compacting text data to be transmitted over communications lines and thereby reduce the data volume and transmission time. Transmitting and receiving text processing systems are provided identical library memories containing text strings such as words commonly used in correspondence. Each word in a document to be communicated is compared to the transmitting system's word library and, if found in the library, only the library address is transmitted. If the word is not found in the library, then it is added to the transmitting system's library, sent, and added to the receiving system's library. The receiving system reconstructs the document by using the received addresses to access the appropriate words from its library and place them in the document. The system combines this word match encoding with character match encoding and facsimile run length encoding for communicating words not found in the system library.

    摘要翻译: 一种用于压缩通过通信线路发送的文本数据从而减少数据量和传输时间的方法和系统。 发送和接收文本处理系统提供了相同的库存储器,其包含文本串,例如通信中通常使用的单词。 将要传送的文档中的每个单词与发送系统的单词库进行比较,如果在库中找到,则仅传输库地址。 如果在库中找不到该字,则将其添加到发送系统的库中,发送并添加到接收系统的库中。 接收系统通过使用接收到的地址来重构文档,以从其库中访问适当的单词并将它们放在文档中。 该系统将该字匹配编码与字符匹配编码和传真运行长度编码相结合,用于传达在系统库中未发现的单词。

    System for automatically proofreading a document
    6.
    发明授权
    System for automatically proofreading a document 失效
    自动提交文件的系统

    公开(公告)号:US4136395A

    公开(公告)日:1979-01-23

    申请号:US755094

    申请日:1976-12-28

    摘要: Spelling errors in a word processing system are detected and presented to the operator for correction at the end of a document page. A dictionary memory contains representations of the correct spellings for words most frequently used. As each word is typed, it is stored in a word queue where it is compared to the contents of the dictionary memory. If the compare is unequal, then the word and its location on the page are stored in an error memory. When an end of page indicator is set the printer automatically repositions the print head at the ending character of the first word in the error list. When the operator keys in the correct spelling, the printer is caused to remove the misspelled word from the page and type the correct spelling. The corresponding word in the error memory is also corrected. As each misspelled word in the error memory is corrected, the remainder of the memory is scanned and repetitions of the same spelling error are automatically corrected.

    Data processing system for optimized mail piece sorting and mapping to
carrier walk sequence using real time statistical data
    8.
    发明授权
    Data processing system for optimized mail piece sorting and mapping to carrier walk sequence using real time statistical data 失效
    数据处理系统,用于优化邮件分类和映射到使用实时统计数据的载体步行序列

    公开(公告)号:US5287271A

    公开(公告)日:1994-02-15

    申请号:US748983

    申请日:1991-08-22

    IPC分类号: B07C3/00 G06Q99/00 G06F15/21

    摘要: A data processing system, method and program are disclosed to optimize mail piece sorting and the mapping of mail down to the carrier walk sequence using real time statistical data. The invention makes use of techniques such as fast OCR devices at a sending location or deferred processing of OCR scanned mail, to accumulate volume statistics indicating the number of mail pieces being routed particular addressees at a destination postal region on a given day. The information for mail volumes being directed to a particular postal region are collected over data communications links prior to the receipt of the actual mail pieces. The efficiency of sorting is maximized at the destination postal region by organizing the sorting apparatus to remove the highest volume addressee's mail first. This requires the compilation of the real time volume statistics from all of the sending postal regions sending mail to the destination postal location. In this manner, the maximum number of letters on every pass through the sorting apparatus can be achieved at the destination location. This minimizes the total number of reading operations required in order to achieve a desired level of mail sorting separation. Because the mail volume statistics are available at the destination location prior to sorting, at each stage of the sorting operations, bin allocation can be customized to yield the highest final patron or addressee sort. In this manner, the time for every subsequent pass through the sorting apparatus is reduced. This enables sorting directly to the addressee level and the distribution of the mail down to carrier walk sequence.

    摘要翻译: 公开了一种数据处理系统,方法和程序,以使用实时统计数据优化邮件分类和邮件向下映射到载体步行序列。 本发明利用诸如发送位置处的快速OCR设备或OCR扫描邮件的延迟处理的技术,以累积指示在给定日期的目的地邮政区域上特定地址的路由的邮件数量的卷统计量。 针对特定邮政区域的邮件卷的信息在收到实际邮件之前通过数据通信链路收集。 通过组织排序装置,首先删除最高容量的收件人的邮件,在目的地邮政区域最大化排序效率。 这需要汇编所有发送邮政区域的邮件到目的地邮政地点的实时统计数据。 以这种方式,可以在目的地位置实现每次通过分拣装置的最大字母数。 这样可以最大程度地减少为了达到所需级别的邮件分类分离所需的阅读操作总数。 因为邮件卷统计信息在排序之前在目的地位置可用,因此在排序操作的每个阶段都可以定制垃圾箱分配,以产生最高的最终顾客或收件人排序。 以这种方式,每次随后通过分拣装置的时间减少。 这使得能够直接将收件人级别分类并将邮件分发到载体步行序列。

    Deferred optical character recognition active pigeon hole sorting of
mail pieces
    10.
    发明授权
    Deferred optical character recognition active pigeon hole sorting of mail pieces 失效
    延迟光学字符识别活动鸽子孔分类邮件

    公开(公告)号:US5311597A

    公开(公告)日:1994-05-10

    申请号:US944559

    申请日:1992-09-11

    IPC分类号: B07C3/00 B07C7/00 G06K9/00

    CPC分类号: B07C7/005 B07C3/00 Y10S209/90

    摘要: A data processing method and system are disclosed to provide active pigeon hole sorting for mail pieces in a postal system. The method is based upon the receipt of deferred optical character recognition statistics for mail pieces in transit to a destination postal region. An ordered list of addressees is compiled from the DOCR statistics. From this ordered list, the sorting case for sorting the mail is partitioned to eliminate pigeon holes for those postal recipients not receiving mail on that day. Still further, the pigeon holes in the sorting case are actively indicated with a prompting light to facilitate the operator physically sorting the mail piece down to delivery sequence. The assignment of delivery stops to pigeon holes is also developed so as to designate adjacent pigeon holes based on the carrier walk without regard to street number but rather to reflect geographic juxtaposition.

    摘要翻译: 公开了一种数据处理方法和系统,用于为邮政系统中的邮件提供活动的鸽子洞分类。 该方法基于接收到运送到目的地邮政区域的邮件的延迟光学字符识别统计信息。 收件人的有序列表是从DOCR统计数据编制而成。 从这个有序的列表中,分类邮件的分类案例被分割,以消除当天不接收邮件的邮政接收者的空格。 此外,分类箱中的鸽孔通过提示灯积极地指示,以便操作者将邮件物理分类到递送顺序。 还开发了分配到鸽子洞的分配点,以便基于载体行走来指定相邻的鸽子孔,而不考虑街道数量,而是反映地理并置。