Gesture-based selective text recognition
    1.
    发明授权
    Gesture-based selective text recognition 有权
    基于手势的选择性文本识别

    公开(公告)号:US08520983B2

    公开(公告)日:2013-08-27

    申请号:US12575015

    申请日:2009-10-07

    IPC分类号: G06K7/00 G06K9/32

    摘要: An image is displayed on a touch screen. A user's underline gesture on the displayed image is detected. The area of the image touched by the underline gesture and a surrounding region approximate to the touched area are identified. Skew for text in the surrounding region is determined and compensated. A text region including the text is identified in the surrounding region and cropped from the image. The cropped image is transmitted to an optical character recognition (OCR) engine, which processes the cropped image and returns OCR'ed text. The OCR'ed text is outputted.

    摘要翻译: 图像显示在触摸屏上。 检测到用户在显示图像上的下划线手势。 识别由下划线手势触摸的图像的区域和接近触摸区域的周边区域。 确定并补偿周边地区的文字偏差。 在周围区域中识别包括文本的文本区域,并从图像中裁剪。 裁剪的图像被传输到光学字符识别(OCR)引擎,其处理裁剪的图像并返回OCR的文本。 输出OCR的文本。

    On-Screen Guideline-Based Selective Text Recognition
    2.
    发明申请
    On-Screen Guideline-Based Selective Text Recognition 有权
    基于屏幕指导的选择性文本识别

    公开(公告)号:US20110123115A1

    公开(公告)日:2011-05-26

    申请号:US12626520

    申请日:2009-11-25

    摘要: A live video stream captured by an on-device camera is displayed on a screen with an overlaid guideline. Video frames of the live video stream are analyzed for a video frame with acceptable quality. A text region is identified in the video frame approximate to the on-screen guideline and cropped from the video frame. The cropped image is transmitted to an optical character recognition (OCR) engine, which processes the cropped image and generates text in an editable symbolic form (the OCR'ed text). A confidence score is determined for the OCR'ed text and compared with a threshold value. If the confidence score exceeds the threshold value, the OCR'ed text is outputted.

    摘要翻译: 由设备上相机拍摄的实时视频流将显示在具有重叠指南的屏幕上。 分析实时视频流的视频帧,以获得可接受的质量的视频帧。 在视频帧中识别文本区域,其近似于屏幕上的指南并从视频帧中裁剪。 裁剪的图像被传输到光学字符识别(OCR)引擎,其处理裁剪的图像并以可编辑的符号形式(OCR的文本)生成文本。 确定OCR文本的置信度,并与阈值进行比较。 如果置信度分数超过阈值,则输出OCR的文本。

    Method and apparatus for generating and managing a language model data structure
    4.
    发明授权
    Method and apparatus for generating and managing a language model data structure 失效
    用于生成和管理语言模型数据结构的方法和装置

    公开(公告)号:US07020587B1

    公开(公告)日:2006-03-28

    申请号:US09608526

    申请日:2000-06-30

    IPC分类号: G06F7/60

    CPC分类号: G06F17/27 G10L15/285

    摘要: The generation and management of a language model data structure include assigning each segment of a received corpus to a node in a data structure that denotes dependencies between the respective nodes. A transitional probability between each of the nodes in the data structure is calculated. A frequency of occurrence is calculated for each item of the respective segments, and those nodes of the data structure associated with items that do not meet a minimum frequency of occurrence threshold are removed. The data structure may be managed across a system memory of a computer system and an extended memory of the computer system.

    摘要翻译: 语言模型数据结构的生成和管理包括将接收到的语料库的每个段分配给表示相应节点之间的依赖关系的数据结构中的节点。 计算数据结构中每个节点之间的过渡概率。 针对各段的每个项目计算出现频率,并且去除与不符合最小发生频率阈值的项目相关联的数据结构的那些节点。 可以跨计算机系统的系统存储器和计算机系统的扩展存储器来管理数据结构。

    On-screen guideline-based selective text recognition
    5.
    发明授权
    On-screen guideline-based selective text recognition 有权
    基于屏幕指南的选择性文本识别

    公开(公告)号:US08515185B2

    公开(公告)日:2013-08-20

    申请号:US12626520

    申请日:2009-11-25

    摘要: A live video stream captured by an on-device camera is displayed on a screen with an overlaid guideline. Video frames of the live video stream are analyzed for a video frame with acceptable quality. A text region is identified in the video frame approximate to the on-screen guideline and cropped from the video frame. The cropped image is transmitted to an optical character recognition (OCR) engine, which processes the cropped image and generates text in an editable symbolic form (the OCR'ed text). A confidence score is determined for the OCR'ed text and compared with a threshold value. If the confidence score exceeds the threshold value, the OCR'ed text is outputted.

    摘要翻译: 由设备上相机拍摄的实时视频流将显示在具有重叠指南的屏幕上。 分析实时视频流的视频帧,以获得可接受的质量的视频帧。 在视频帧中识别文本区域,其近似于屏幕上的指南并从视频帧中裁剪。 裁剪的图像被传输到光学字符识别(OCR)引擎,其处理裁剪的图像并以可编辑的符号形式(OCR的文本)生成文本。 确定OCR文本的置信度,并与阈值进行比较。 如果置信度分数超过阈值,则输出OCR的文本。

    GESTURE-BASED SELECTIVE TEXT RECOGNITION
    7.
    发明申请
    GESTURE-BASED SELECTIVE TEXT RECOGNITION 有权
    基于GESTURE的选择性文本识别

    公开(公告)号:US20110081083A1

    公开(公告)日:2011-04-07

    申请号:US12575015

    申请日:2009-10-07

    IPC分类号: G06K9/18

    摘要: An image is displayed on a touch screen. A user's underline gesture on the displayed image is detected. The area of the image touched by the underline gesture and a surrounding region approximate to the touched area are identified. Skew for text in the surrounding region is determined and compensated. A text region including the text is identified in the surrounding region and cropped from the image. The cropped image is transmitted to an optical character recognition (OCR) engine, which processes the cropped image and returns OCR'ed text. The OCR'ed text is outputted.

    摘要翻译: 图像显示在触摸屏上。 检测到用户在显示图像上的下划线手势。 识别由下划线手势触摸的图像的区域和接近触摸区域的周边区域。 确定并补偿周边地区的文字偏差。 在周围区域中识别包括文本的文本区域,并从图像中裁剪。 裁剪的图像被传输到光学字符识别(OCR)引擎,其处理裁剪的图像并返回OCR的文本。 输出OCR的文本。