摘要:
The present invention relates to interfacing with electronic ink. Ink is stored in a data structure that permits later retrieval by applications. The ink includes stroke information and may include property information. Through various programming interfaces, one may interact with the ink through methods and setting or retrieving properties. Other objects and collections may be used as well in conjunction with the ink objects.
摘要:
Ink is stored in a data structure that includes a mapping of the ink's coordinates from one space to another. The responsibility of transforming the ink into desired coordinates occurs after the capture of the ink and at least based on the mapping in the data structure.
摘要:
Ink is stored in a data structure that permits later retrieval by applications. The ink includes stroke information and property information. Properties may be associated globally with the ink strokes and/or with each stroke individually. Aspects include a data structure that supports ink for multiple applications. Using the storage system, method, and data structure, various applications may expand on ink to include additional attributes while permitting the ink to be used in applications not supporting the additional attributes. When encountering an unknown tag, the applications will skip past the end of the tag and its associated data to start reading the next tag.
摘要:
A system and method is described for manipulating text relative to a curved reference line in order to transform a character, rendered in a particular font, to generate a warped character with the degree of warping reflecting the local curvature of the curved reference line. The curvature of the reference line is reflected in the nature of quadilaterals, i.e., quads, generated for a corresponding rectangle on a straight reference line. The coordinates of the corners of the quad provide the parameters to carry out the transformation. Such rendering improves the appearance of the text and provides a method that modifies available fonts in a flexible fashion without the need to generate new fonts.
摘要:
The present invention relates to methods and apparatus for increasing the perceived quality of displayed images. This is achieved in a variety of ways including the use of a plurality of device specific display characteristics when preparing images for display. It is also achieved through the monitoring of display device and/or ambient light conditions, e.g., on a periodic basis, and using the obtained information when controlling display output. Another approach to improving the perceived quality of displayed images involves the use of information relating to a specific user's ability to perceive image characteristics such as color. By customizing display output to an individual user's own physical perception capabilities and/or viewing characteristics it is possible to enhance the image quality perceived by the individual viewer as compared to embodiments which do not take into consideration individual user characteristics.
摘要:
Methods and systems for increasing the contrast with which thin-stemmed characters can be displayed in sub-pixel rendering processes are disclosed. Some fonts, such as Courier New, have relatively thin stems and other character features that would otherwise be rendered with low contrast when displayed using an oversampling process combined with a rendering process that treats individual pixel sub-components as separate luminous intensity sources. In order to avoid such low contrast features and to avoid the need for revising the font itself, the alpha values for the pixel sub-components that correspond to the character features are selectively increased to simulate larger dimensions of the character features and to increase the contrast with which the character dimensions are displayed. The methods involve oversampling image data representing a character by obtaining multiple samples for each of a plurality of pixel sub-components of a pixel. The alpha values associated with pixel sub-components at or near the position of the character features are increased to give the appearance of larger dimensions of the character features and greater contrast in the displayed image.
摘要:
Line segmentation in an OCR process is performed to detect the positions of words within an input textual line image by extracting features from the input to locate breaks and then classifying the breaks into one of two break classes which include inter-word breaks and inter-character breaks. An output including the bounding boxes of the detected words and a probability that a given break belongs to the identified class can then be provided to downstream OCR or other components for post-processing. Advantageously, by reducing line segmentation to the extraction of features, including the position of each break and the number of break features, and break classification, the task of line segmentation is made less complex but with no loss of generality.
摘要:
An optical character recognition process characterizes text lines in a textual image by their base-line, mean-line and x-height. The base-line for at least one text line in the image is determined by finding a parametric curve that maximizes a first fitness function that depends on the values of pixels through which the parametric curve passes and pixels below the parametric curve. The base-line corresponds to the parametric curve for which the first fitness function is maximized. The first fitness function is designed so that it increases with increasing lightless or brightness of pixels immediately below the parametric curve while also increasing with decreasing lightness of pixels through which the parametric curve passes. The mean-line is determined by incrementally shifting the base-line upward by predetermined amounts (e.g., a single pixel) until a second fitness function for the shifted base-line is maximized. The second fitness function is essentially the inverse of the first fitness function. Specifically, the second fitness function increases with increasing lightless of pixels immediately above the shifted base-line while also increasing with decreasing lightness of pixels through which the shifted base-line passes. The x-height is equal to the sum of the predetermined amounts by which the base-line is shifted upward in order to maximize the second fitness function. In some cases different groups of text-lines in the textual image may be characterized differently from one another. For example, each group may be characterized by a most probable x-height for that group.
摘要:
In embodiments consistent with the subject matter of this disclosure, a user may input one or more strokes as digital ink to a processing device. The processing device may produce and present a recognition result, which may include a misrecognized portion. A user may indicate a desire to correct the misrecognized portion and may further select one or more strokes of the misrecognized portion. The processing device may then present the one or more recognition alternates corresponding to the selected one or more strokes of the misrecognized portion. In some embodiments, the processing device may permit a user to rewrite the selected one or more strokes of the misrecognized portion with newly entered digital ink. Features, such as, rewriting and correction of the input digital ink may be discoverable in some embodiments.
摘要:
An optical character recognition process characterizes text lines in a textual image by their base-line, mean-line and x-height. The base-line for at least one text line in the image is determined by finding a parametric curve that maximizes a first fitness function that depends on the values of pixels through which the parametric curve passes and pixels below the parametric curve. The base-line corresponds to the parametric curve for which the first fitness function is maximized. The first fitness function is designed so that it increases with increasing lightless or brightness of pixels immediately below the parametric curve while also increasing with decreasing lightness of pixels through which the parametric curve passes. The mean-line is determined by incrementally shifting the base-line upward by predetermined amounts (e.g., a single pixel) until a second fitness function for the shifted base-line is maximized. The second fitness function is essentially the inverse of the first fitness function. Specifically, the second fitness function increases with increasing lightless of pixels immediately above the shifted base-line while also increasing with decreasing lightness of pixels through which the shifted base-line passes. The x-height is equal to the sum of the predetermined amounts by which the base-line is shifted upward in order to maximize the second fitness function. In some cases different groups of text-lines in the textual image may be characterized differently from one another. For example, each group may be characterized by a most probable x-height for that group.