Editing scanned document images using simple interpretations
    1.
    发明授权
    Editing scanned document images using simple interpretations 失效
    使用简单的解释编辑扫描的文档图像

    公开(公告)号:US5734761A

    公开(公告)日:1998-03-31

    申请号:US268355

    申请日:1994-06-30

    申请人: Steven C. Bagley

    发明人: Steven C. Bagley

    CPC分类号: G06T11/60

    摘要: A method and apparatus for editing a scanned image. The present invention provides for editing of a scanned image in terms of interpretations of graphical objects contained therein. A graphical object can represent a letter, word, line of text, graphic or any other portion of the document image selected by the user. An interpretation embodies a predetermined relationship between graphical objects as well as editing operations that can be performed on the graphical objects. Interpretations belong to one of two classes. A first class, set interpretation, treats graphical objects as an unordered set lying within a document plane. Editing operations in a set interpretation allow a graphical object to be manipulated within in the document plane without disturbing the spatial orientation of other graphical objects. A second class, sequence interpretation, is like a set interpretation except that the set of graphical objects are ordered. An editing operation in a sequence interpretation will typically affect the spatial orientation of other graphical objects in the set of graphical objects. A particular type of sequence interpretation, called text interpretation, allows for manipulation of sets of graphical objects as if they were text.

    摘要翻译: 一种用于编辑扫描图像的方法和装置。 本发明提供了根据其中包含的图形对象的解释来编辑扫描图像。 图形对象可以表示由用户选择的文档图像的字母,字,行,图形或文档图像的任何其他部分。 解释体现了图形对象之间的预定关系以及可以对图形对象执行的编辑操作。 解释属于两类之一。 第一类设置解释将图形对象视为位于文档平面内的无序集。 集合解释中的编辑操作允许在文档平面内操纵图形对象,而不会影响其他图形对象的空间方向。 第二类,序列解释,就像集合解释,除了一组图形对象被排序。 顺序解释中的编辑操作通常会影响图形对象集合中其他图形对象的空间方向。 称为文本解释的特定类型的序列解释允许对图形对象的集合进行操纵,就像它们是文本一样。

    Editing text in an image
    4.
    发明授权
    Editing text in an image 失效
    编辑图像中的文字

    公开(公告)号:US5548700A

    公开(公告)日:1996-08-20

    申请号:US39553

    申请日:1993-03-29

    IPC分类号: G06T1/00 G06T11/60 G06F17/00

    CPC分类号: G06T11/60

    摘要: Character level text editing is performed on an image without recognizing characters, by operating on a character-size array obtained from a two-dimensional array defining an image region. A processor, in response to a request for a text editing operation, accesses an edit data structure that includes the image region array and performs the operation. The character-size array is obtained by dividing the image region array when necessary. An image region array that includes more than one line is divided along interline spaces. An image region array that includes one line is divided along intercharacter spaces. Character-size arrays are divided out of larger arrays by finding connected component bounding boxes, and then determining from the bounding boxes whether the connected components are likely to form a character. If so, the connected components are used to obtain the character-size array and spatial data about position, size, and shape of the character. Smaller arrays and spatial data can replace a larger array in the edit data structure. Smaller arrays are obtained only as necessary to perform a requested text editing operation, and if the edit data structure is not otherwise modified, obtaining a smaller array does not necessitate redrawing of the display. In addition to character level editing, a text editing operation can be performed on a sequence of arrays, such as a word, line, or a sequence that begins on one line and ends on another. The spatial data can be used to position arrays after insertion or deletion, to advance a cursor through the text, and to justify a line of arrays. A character-size array can be assigned to a keyboard key, and the key may then be used to insert that array into the text or to request a search for other arrays matching that array.

    摘要翻译: 通过对从定义图像区域的二维阵列获得的字符尺寸阵列进行操作,对图像执行字符级文本编辑,而不识别字符。 响应于文本编辑操作的请求,处理器访问包括图像区域阵列的编辑数据结构并执行该操作。 字符大小的阵列是必要时划分图像区域阵列获得的。 包含多条线的图像区域阵列沿着行间空间分割。 包括一条线的图像区域阵列沿着字符间隔被分割。 通过查找连接的组件边界框,从更大的数组中划分字符大小的数组,然后从边界框确定连接的组件是否可能形成一个字符。 如果是这样,连接的组件用于获取关于字符的位置,大小和形状的字符大小数组和空间数据。 较小的阵列和空间数据可以替代编辑数据结构中较大的阵列。 只有在执行所请求的文本编辑操作所需的时候才能获得较小的数组,并且如果编辑数据结构没有被修改,则获得较小的数组不需要重新绘制显示。 除了字符级编辑之外,还可以对一系列的数组执行文本编辑操作,例如字符,行或从一行开始并以另一行开头的序列。 空间数据可以用于在插入或删除之后对数组进行定位,以使光标在文本中前进,并对齐一行数组。 字符大小的数组可以分配给键盘键,然后可以使用键将该数组插入文本或请求搜索与该数组匹配的其他数组。

    Methods and apparatus for selecting semantically significant images in a
document image without decoding image content
    5.
    发明授权
    Methods and apparatus for selecting semantically significant images in a document image without decoding image content 失效
    在文件图像中选择语义有意义的图像而不对图像内容进行解码的方法和装置

    公开(公告)号:US5390259A

    公开(公告)日:1995-02-14

    申请号:US794191

    申请日:1991-11-19

    摘要: A method and apparatus for processing a document image, using a programmed general or special purpose computer, includes forming the image into image units, and at least one image unit classifier of at least one of the image units is determined, without decoding the content of the at least one of the image units. The classifier of the at least one of the image units is then compared with a classifier of another image unit. The classifier may be image unit length, width, location in the document, font, typeface, cross-section, the number of ascenders, the number of descenders, the average pixel density, the length of the top line contour, the length of the base contour, the location of image units with respect to neighboring image units, vertical position, horizontal inter-image unit spacing, and so forth. The classifier comparison can be a comparison with classifiers of image units of words in a reference table, or with classifiers of other image units in the document. Equivalent classes of image units can be generated, from which word frequency and significance can be determined. The image units can be determined by creating bounding boxes about identifiable segments or extractable units of the image, and can contain a word, a phrase, a letter, a number, a character, a glyph or the like.

    摘要翻译: 一种用于使用编程的通用或专用计算机处理文档图像的方法和装置,包括将图像形成为图像单元,并且确定至少一个图像单元的至少一个图像单元分类器,而不对 该至少一个图像单元。 然后将至少一个图像单元的分类器与另一图像单元的分类器进行比较。 分类器可以是图像单元长度,宽度,文档中的位置,字体,字体,横截面,上升数,下降数,平均像素密度,顶线轮廓的长度, 基本轮廓,图像单元相对于相邻图像单元的位置,垂直位置,水平图像间距等。 分类器比较可以是与参考表中的单词的图像单位的分类器或文档中的其他图像单元的分类器的比较。 可以生成等效的图像单位类别,从中可以确定字频率和重要性。 可以通过创建关于图像的可标识段或可提取单元的边界框来确定图像单元,并且可以包含单词,短语,字母,数字,字符,字形等。

    Changing characters in an image
    6.
    发明授权
    Changing characters in an image 失效
    改变图像中的特征

    公开(公告)号:US5167016A

    公开(公告)日:1992-11-24

    申请号:US459026

    申请日:1989-12-29

    CPC分类号: G06T11/60

    摘要: An existing character, in a text defined in image form by data such as a two-dimensional array, is copied to add a new character to the text. The existing character is found by performing character recognition on a two-dimensional data array defining an image that includes part of the text, such as a page. The array can be obtained from a scanner. A word that is recognized as including characters of the type needed is tested to determine whether it can be divided into the correct number of characters. The word is divided by finding connected components in the part of the array in which the word was found during recognition. The connected components are grouped into sets, each set being likely to be a character. If the word can be correctly divided, character-size arrays for its characters are obtained and saved. One of the arrays for the character type of the new character is selected and used to produce an array for the word in which it is included. The new word's array is then used to produce an array for a line in which the new word replaces an old word. The characters of the new word are spaced according to the spacing of the characters of the old word. The new character is positioned transverse to the line based on the transverse positioning of the existing character. The interword spaces of the line are adjusted. The line's array is then used to produce data defining a modified version of the text in image form.