Geometric parsing of mathematical expressions
    1.
    发明申请
    Geometric parsing of mathematical expressions 有权
    几何解析数学表达式

    公开(公告)号:US20080253657A1

    公开(公告)日:2008-10-16

    申请号:US11784889

    申请日:2007-04-10

    IPC分类号: G06K9/18

    CPC分类号: G06K9/00402

    摘要: A processing device may parse a group of strokes representing a mathematical expression. The group of strokes may be examined to determine whether the group of strokes satisfies any of a finite set of rules. When the group of strokes, included in a region, satisfies any of the finite set of rules, the region may be partitioned according to a satisfied one of the finite set of rules. The group of strokes included in the region may be further examined to determine whether the group of strokes may be further partitioned according to any of the finite set of rules. After all regions have been examined and no further partitioning of regions may be performed, all mathematical symbols of the mathematical expression may be isolated in at least some of the regions and may be recognized.

    摘要翻译: 处理设备可以解析表示数学表达式的一组笔划。 可以检查一组笔划以确定笔划组是否满足任何一组有限的规则。 当包括在区域中的笔划组满足任何有限的规则集合时,可以根据有限规则集合中的一个满足区域。 可以进一步检查包括在该区域中的笔划组以确定是否可以根据任何有限规则集进一步划分笔划组。 在检查了所有区域之后,并且不能进行区域的进一步分割,数学表达式的所有数学符号可以在至少一些区域中被隔离并且可被识别。

    Geometric parsing of mathematical expressions
    2.
    发明授权
    Geometric parsing of mathematical expressions 有权
    几何解析数学表达式

    公开(公告)号:US08064696B2

    公开(公告)日:2011-11-22

    申请号:US11784889

    申请日:2007-04-10

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00402

    摘要: A processing device may parse a group of strokes representing a mathematical expression. The group of strokes may be examined to determine whether the group of strokes satisfies any of a finite set of rules. When the group of strokes, included in a region, satisfies any of the finite set of rules, the region may be partitioned according to a satisfied one of the finite set of rules. The group of strokes included in the region may be further examined to determine whether the group of strokes may be further partitioned according to any of the finite set of rules. After all regions have been examined and no further partitioning of regions may be performed, all mathematical symbols of the mathematical expression may be isolated in at least some of the regions and may be recognized.

    摘要翻译: 处理设备可以解析表示数学表达式的一组笔划。 可以检查一组笔划以确定笔划组是否满足任何一组有限的规则。 当包括在区域中的笔划组满足任何有限的规则集合时,可以根据有限规则集合中的一个满足区域。 可以进一步检查包括在该区域中的笔划组以确定是否可以根据任何有限规则集进一步划分笔划组。 在检查了所有区域之后,并且不能进行区域的进一步分割,数学表达式的所有数学符号可以在至少一些区域中被隔离并且可被识别。

    RECOGNITION OF TABULAR STRUCTURES
    3.
    发明申请
    RECOGNITION OF TABULAR STRUCTURES 审中-公开
    识别矩形结构

    公开(公告)号:US20120121182A1

    公开(公告)日:2012-05-17

    申请号:US13357414

    申请日:2012-01-24

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00463 G06K9/00422

    摘要: A number of regions and partitions may be created based on input handwritten atoms and a grammar parsing framework. Productions for tabular structures may be added to the grammar parsing framework to produce an extended grammar parsing framework. Each of the regions may be searched for a tabular structure. Upon finding a tabular structure, a type of tabular structure may be determined. Configuration partitions may be created, based on the added productions, and added to the created partitions. A set of configuration regions may be created based on the configuration partitions and added to the created regions. The productions for tabular structures and productions of the grammar parsing framework may be applied, as rewriting rules, to the atoms to produce possible recognition results. A best recognition result may be determined and displayed. A mechanism for correcting misrecognition errors, which may occur while recognizing tabular structures, may be provided.

    摘要翻译: 可以基于输入的手写原子和语法解析框架来创建多个区域和分区。 表格结构的生成可以被添加到语法解析框架中以产生扩展语法解析框架。 可以搜索每个区域的表格结构。 在找到表格结构后,可以确定一种类型的表格结构。 可以基于添加的生产创建配置分区,并将其添加到创建的分区。 可以基于配置分区创建一组配置区域,并将其添加到创建的区域。 语法解析框架的表格结构和制作的制作可以作为重写规则应用于原子以产生可能的识别结果。 可以确定和显示最佳识别结果。 可以提供用于校正在识别表格结构时可能发生的错误识别错误的机制。

    Recognition of mathematical expressions
    4.
    发明申请
    Recognition of mathematical expressions 有权
    数学表达式的识别

    公开(公告)号:US20080260251A1

    公开(公告)日:2008-10-23

    申请号:US11788190

    申请日:2007-04-19

    IPC分类号: G06K9/00

    摘要: In embodiments consistent with the subject matter of this disclosure, a user may input strokes as digital ink to a processing device. The processing device may partition the input strokes into multiple regions of strokes. A first recognizer and a second recognizer may score grammar objects included in regions and represented by chart entries. The scores may be converted to a converted score, which may have at least a near standard normal distribution. The processing device may present a recognition result based on highest converted scores according to a recurrence formula. The processing device may receive a correction hint with respect to misrecognized strokes and may add a penalty score with respect to chart entries representing grammar objects breaking the correction hint. Incremental recognition may be performed when a pause is detected during inputting of strokes.

    摘要翻译: 在与本公开的主题相一致的实施例中,用户可以将笔画作为数字墨水输入到处理设备。 处理装置可以将输入笔划划分成多个笔画区域。 第一识别器和第二识别器可以对包括在区域中的语法对象进行评分并由图表条目表示。 得分可以转换成转换得分,其可以具有至少近标准正态分布。 处理装置可以根据递归公式提供基于最高转换分数的识别结果。 处理设备可以接收关于错误识别的笔画的校正提示,并且可以相对于表示打破校正提示的语法对象的图表条目添加惩罚分数。 当在笔画输入期间检测到暂停时,可以执行增量识别。

    User interface for inputting two-dimensional structure for recognition
    5.
    发明申请
    User interface for inputting two-dimensional structure for recognition 有权
    用于输入二维结构进行识别的用户界面

    公开(公告)号:US20080260240A1

    公开(公告)日:2008-10-23

    申请号:US11788180

    申请日:2007-04-19

    IPC分类号: G06K9/62

    CPC分类号: G06K9/00436

    摘要: In embodiments consistent with the subject matter of this disclosure, a user may input one or more strokes as digital ink to a processing device. The processing device may produce and present a recognition result, which may include a misrecognized portion. A user may indicate a desire to correct the misrecognized portion and may further select one or more strokes of the misrecognized portion. The processing device may then present the one or more recognition alternates corresponding to the selected one or more strokes of the misrecognized portion. In some embodiments, the processing device may permit a user to rewrite the selected one or more strokes of the misrecognized portion with newly entered digital ink. Features, such as, rewriting and correction of the input digital ink may be discoverable in some embodiments.

    摘要翻译: 在与本公开的主题相一致的实施例中,用户可以将一个或多个笔画作为数字墨水输入到处理设备。 处理装置可以产生和呈现识别结果,其可以包括错误识别的部分。 用户可以指示纠正错误识别的部分的愿望,并且可以进一步选择错误识别部分的一个或多个笔画。 处理装置然后可以呈现对应于误识别部分的所选择的一个或多个笔画的一个或多个识别代替。 在一些实施例中,处理设备可以允许用户使用新输入的数字墨水来重写错误识别部分中所选择的一个或多个笔画。 在一些实施例中,可以发现诸如重写和校正输入数字墨水的特征。

    USER CORRECTION OF ERRORS ARISING IN A TEXTUAL DOCUMENT UNDERGOING OPTICAL CHARACTER RECOGNITION (OCR) PROCESS
    6.
    发明申请
    USER CORRECTION OF ERRORS ARISING IN A TEXTUAL DOCUMENT UNDERGOING OPTICAL CHARACTER RECOGNITION (OCR) PROCESS 审中-公开
    用户校正在光学字符识别(OCR)过程中出现的文本文档中的错误

    公开(公告)号:US20110280481A1

    公开(公告)日:2011-11-17

    申请号:US12780991

    申请日:2010-05-17

    IPC分类号: G06K9/03 G06K9/34

    CPC分类号: G06K9/033

    摘要: An electronic model of the image document is created by undergoing an OCR process. The electronic model includes elements (e.g., words, text lines, paragraphs, images) of the image document that have been determined by each of a plurality of sequentially executed stages in the OCR process. The electronic model serves as input information which is supplied to each of the stages by a previous stage that processed the image document. A graphical user interface is presented to the user so that the user can provide user input data correcting a mischaracterized item appearing in the document. Based on the user input data, the processing stage which produced the initial error that gave rise to the mischaracterized item corrects the initial error. Stages of the OCR process subsequent to this stage then correct any consequential errors arising in their respective stages as a result of the initial error.

    摘要翻译: 通过进行OCR过程创建图像文档的电子模型。 电子模型包括由OCR处理中的多个顺序执行阶段中的每一个确定的图像文档的元素(例如,单词,文本行,段落,图像)。 电子模型用作输入信息,该信息由处理图像文档的前一级提供给每个级。 向用户呈现图形用户界面,使得用户可以提供校正出现在文档中的错误描述的项目的用户输入数据。 基于用户输入数据,产生引起错误特征项的初始误差的处理阶段校正初始误差。 在此阶段之后的OCR过程的阶段然后纠正由于初始错误而在其各自阶段中产生的任何后果性错误。

    User interface for providing digital ink input and correcting recognition errors
    7.
    发明授权
    User interface for providing digital ink input and correcting recognition errors 有权
    用于提供数字墨水输入和校正识别错误的用户界面

    公开(公告)号:US08116570B2

    公开(公告)日:2012-02-14

    申请号:US11788180

    申请日:2007-04-19

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00436

    摘要: In embodiments consistent with the subject matter of this disclosure, a user may input one or more strokes as digital ink to a processing device. The processing device may produce and present a recognition result, which may include a misrecognized portion. A user may indicate a desire to correct the misrecognized portion and may further select one or more strokes of the misrecognized portion. The processing device may then present the one or more recognition alternates corresponding to the selected one or more strokes of the misrecognized portion. In some embodiments, the processing device may permit a user to rewrite the selected one or more strokes of the misrecognized portion with newly entered digital ink. Features, such as, rewriting and correction of the input digital ink may be discoverable in some embodiments.

    摘要翻译: 在与本公开的主题相一致的实施例中,用户可以将一个或多个笔画作为数字墨水输入到处理设备。 处理装置可以产生和呈现识别结果,其可以包括错误识别的部分。 用户可以指示纠正错误识别的部分的愿望,并且可以进一步选择错误识别部分的一个或多个笔画。 处理装置然后可以呈现对应于误识别部分的所选择的一个或多个笔画的一个或多个识别代替。 在一些实施例中,处理设备可以允许用户使用新输入的数字墨水来重写错误识别部分中所选择的一个或多个笔画。 在一些实施例中,可以发现诸如重写和校正输入数字墨水的特征。

    RECOGNITION OF TABULAR STRUCTURES
    8.
    发明申请
    RECOGNITION OF TABULAR STRUCTURES 失效
    识别矩形结构

    公开(公告)号:US20090304282A1

    公开(公告)日:2009-12-10

    申请号:US12134200

    申请日:2008-06-06

    IPC分类号: G06K9/68

    CPC分类号: G06K9/00463 G06K9/00422

    摘要: A number of regions and partitions may be created based on input handwritten atoms and a grammar parsing framework. Productions for tabular structures may be added to the grammar parsing framework to produce an extended grammar parsing framework. Each of the regions may be searched for a tabular structure. Upon finding a tabular structure, a type of tabular structure may be determined. Configuration partitions may be created, based on the added productions, and added to the created partitions. A set of configuration regions may be created based on the configuration partitions and added to the created regions. The productions for tabular structures and productions of the grammar parsing framework may be applied, as rewriting rules, to the atoms to produce possible recognition results. A best recognition result may be determined and displayed. A mechanism for correcting misrecognition errors, which may occur while recognizing tabular structures, may be provided.

    摘要翻译: 可以基于输入的手写原子和语法解析框架来创建多个区域和分区。 表格结构的生成可以被添加到语法解析框架中以产生扩展语法解析框架。 可以搜索每个区域的表格结构。 在找到表格结构后,可以确定一种类型的表格结构。 可以基于添加的生产创建配置分区,并将其添加到创建的分区。 可以基于配置分区创建一组配置区域,并将其添加到创建的区域。 语法解析框架的表格结构和制作的制作可以作为重写规则应用于原子以产生可能的识别结果。 可以确定和显示最佳识别结果。 可以提供用于校正在识别表格结构时可能发生的错误识别错误的机制。

    Detecting position of word breaks in a textual line image
    9.
    发明授权
    Detecting position of word breaks in a textual line image 有权
    检测文字行图像中的分词位置

    公开(公告)号:US08345978B2

    公开(公告)日:2013-01-01

    申请号:US12749599

    申请日:2010-03-30

    IPC分类号: G06K9/00

    摘要: Line segmentation in an OCR process is performed to detect the positions of words within an input textual line image by extracting features from the input to locate breaks and then classifying the breaks into one of two break classes which include inter-word breaks and inter-character breaks. An output including the bounding boxes of the detected words and a probability that a given break belongs to the identified class can then be provided to downstream OCR or other components for post-processing. Advantageously, by reducing line segmentation to the extraction of features, including the position of each break and the number of break features, and break classification, the task of line segmentation is made less complex but with no loss of generality.

    摘要翻译: 执行OCR处理中的线分割以通过从输入中提取特征来定位分组,然后将分组分类成包括字间间隔和字符间的两个断点类之一来检测输入文本行图像内的单词的位置 休息 然后可以将包括检测到的单词的边界框和给定中断属于所识别的类别的概率的输出提供给下游OCR或用于后处理的其他组件。 有利的是,通过将行分割减少到特征的提取,包括每个断点的位置和断裂特征的数量以及断裂分类,线分割的任务变得不那么复杂,但不失一般性。

    Corrections for recognizers
    10.
    发明授权
    Corrections for recognizers 有权
    识别器的更正

    公开(公告)号:US08285049B2

    公开(公告)日:2012-10-09

    申请号:US12134193

    申请日:2008-06-06

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00436 G06K9/00463

    摘要: A processing device may recognize a number of input handwritten strokes, which may represent a mathematical expression, a chemical formula, or other two-dimensional structure. Rewriting rules of a grammar may be applied to the strokes to produce a number of possible recognition results. Each of the possible recognition results has a respective score based on a sum of rewriting rules applied to the strokes to produce respective ones of the possible recognition results. Input may be provided to identify misrecognized strokes and a correct terminal production, or symbol corresponding to the misrecognized strokes. Strokes may be misrecognized for many reasons, including parsing errors, over-grouping or under-grouping of matrices, and improper placement of a recognized terminal production, or symbol, with respect to a root structure. Correction hints may be leveraged for correcting types of errors mentioned above.

    摘要翻译: 处理装置可以识别可以表示数学表达式,化学式或其它二维结构的多个输入手写笔画。 语法的重写规则可以应用于笔画以产生许多可能的识别结果。 每个可能的识别结果具有基于施加到笔画的重写规则的总和的相应得分,以产生相应的可能的识别结果。 可以提供输入以识别误识别的笔画和正确的终端制作或与错误识别的笔画相对应的符号。 笔画可能由于许多原因而被误认识,包括解析错误,矩阵的分组过大或分组不足,以及对根结构的识别终端生产或符号的不正确放置。 纠正提示可用于纠正上述错误类型。