摘要:
In accordance with one aspect of the present invention, disclosed is an image analysis and conversion method and system, where digital ink images are converted to structured object representations of the digital ink images, capable of being edited by a structured text/graphics editor.
摘要:
A graphical input and display system for creating and manipulating electronic images includes input devices permitting a user to manipulate elements of electronic images received from various image input sources. A processor, connected to the system, receives requests for various image editing operations and also accesses a memory structure. The system memory structure includes a user interaction module, which allows a user to enter new image material or select and modify existing image material to form primary image objects, as well as a grouping module, which maintains an unrestricted grouping structure, an output module, and data memory.
摘要:
A method and apparatus for compressing a corpus of document images into a collective tokenized representation. Initially, documents in the corpus are individually compressed into a document tokenized format. A document image in the document tokenized format is represented using a symbol table and a table of positions. Each symbol in the symbol table is a shape in the original document image. The positions in the table of positions indicates where the symbols in the symbol table are placed to form the document image. Subsequently, the individual symbol tables of each document in the corpus are assembled to form clusters of similar shapes. These clusters are then analyzed to identify the degree of interrelationship between the symbols in the individual symbol tables. Individual document symbol tables with a large number of recurring symbols are grouped together. For each of the groups of symbol tables, a collective symbol table is computed. The collective symbol table improves the compression ratio of a corpus by eliminating redundant shapes appearing in the individual document symbol tables. Also, the collective symbol table advantageously identifies groupings of documents in the corpus which are related because a significant number of similar shapes are used in each of the documents.
摘要:
Input image data define an input image set that shows a graphical feature and editing marks indicating an editing operation to be performed on the graphical feature. The input image data are used to obtain operation category data indicating whether the editing operation would translate the graphical feature so that it is centered at a different position within the input image set. The operation category data are used to obtain output image data defining an output image that includes an edited version of the input image set. The output image shows the graphical feature centered at a different position only if the operation category data so indicate. The input image set can include an original image showing the graphical feature and an overlay image showing the editing marks. The editing marks can form a node-link structure with the graphical feature. If the structure is a directed graph, it can indicate an editing operation that would translate the graphical feature to be centered at a different position, such as a simple translation to a new position, a translation with scaling or rotation, or a replacement operation. If the structure is an undirected graph, it can indicate an editing operation that would not translate the graphical feature, such as a delete operation or a scale or rotate operation. A rectangle with a dot inside it can indicate scaling or rotation. A cross can indicate deletion.
摘要:
Input image data define an input image that shows a proportioned parts graph, such as a pie chart or a whole or segmented bar graph. The input image data are used to obtain segmented feature data indicating a feature in the input image that satisfies a constraint on segments. The segmented feature data are used to obtain proportion data indicating each segment's proportion. Various criteria could be applied to find parts of the input image that form the feature. For example, for some pie charts, the segments constraint could include a circularity criterion, a center criterion, and a direction criterion, and proportions could be obtained from directions. For other pie charts, the segments constraint could include a distinct regions criterion and a circularity criterion and proportions could be obtained from directions of region sides. The segments constraint could include a feature candidate criterion for a specific category of proportioned parts graphs, a center criterion applicable to all categories, and either a direction criterion or an angle criterion. The direction criterion could require a part that extends radially from a center, while the angle criterion permits parts that cover a range of angles. For either criterion, the segments constraint could also require nearness to a reference point such as the center. The input image can show a sketch and the proportion data can be used to obtain output image data defining an output image that includes a precisely formed proportioned parts graph or other graphical representation of the indicated proportions.
摘要:
Input image data define an input image set that shows a graphical representation of a layout with two or more segments. The graphical representation can be a sketch, and can include, for example, a rectangular boundary with lines parallel to its sides defining rectangular segments within the boundary. The input image data are used to obtain segment source data indicating a source for each segment and segment position data indicating a position for each segment. The segment source data can indicate, for each segment of the layout, one of a number of source images in the input image set. The segment position data can indicate a reference point and a width and height for each segment. The source image for a segment can be a sketch of a graphical representation--such as a node-link structure, a parallel length graph, a proportioned parts graph, a row/column representation, a perimeter relationship representation, or a two-dimensional graph--that can be categorized and rendered to obtain data defining a precisely formed graphical representation. The segment source data for other segments and the segment position data can be used with the data defining the precisely formed graphical representation to obtain output image data defining an output image that includes a layout as represented by the graphical representation.
摘要:
A system and method for communication in an online electronic chat environment having multiple communication devices connected to each other in a communication network is provided. Displayed on a display screen of an electronic communication device of the multiple communication devices, is a chat region configured to hold text, and a graphics region to hold graphic objects. The chat region and the graphics region are positioned on a common electronic canvas of the display screen. Text from the chat region can be moved to the graphics region, and graphic objects in the graphics region may be moved to the chat region. The design allows for the mixing of chat and graphics in a common window when material is moved between the two modalities. In additional embodiments, the text in the chat region and the graphics in the graphics region are synchronized whereby movement of one causes action in the other.
摘要:
An image analysis and conversion method and system. Bitmapped ink images are converted to structured object representations of the bitmapped images, which may be read and edited by a structured text/graphics editor. The structured object representations correlate to perceptually salient areas of the bitmapped images. The structured object representations are editable by the structured text/graphics editor to allow a user to generate alternative interpretations of the bitmapped images.
摘要:
A document search system provides a user with a programming interface for dynamically specifying features of documents recorded in a corpus of documents. The programming interface operates at a high-level that is suitable for interactive user specification of layout components and structures of documents. In operation, a bitmap image of a document is analyzed by the document search system to identify layout objects such as text blocks or graphics. Subsequently, the document search system computes a set of attributes for each of the identified layout objects. The set of attributes which are identified are used to describe the layout structure of a page image of a document in terms of the spatial relations that layout objects have to frames of reference that are defined by other layout objects. After computing attributes for each layout object, a user can operate the programming interface to define unique document features. Each document feature is a routine defined by a sequence of selections operations which consume a first set of layout objects and produce a second set of layout objects. The second set of layout objects constitutes the feature in a page image of a document. Using the programming interface, a user flexibly defines a genre of document using the user-specified document features.
摘要:
The present invention is a method and apparatus for analyzing image data, and more particularly for analyzing image data representing images containing text to partition the image into running and non-running text regions therein. The present invention utilizes characteristics of running text regions to identify such regions and to subsequently group all non-running text regions into related groups.