Image selection and recognition processing from a video feed

    公开(公告)号:US09626577B1

    公开(公告)日:2017-04-18

    申请号:US14486509

    申请日:2014-09-15

    Inventor: Qingfeng Yu Yue Liu

    CPC classification number: G06K9/036 G06K9/3258

    Abstract: A system that selects image frames from a video feed for recognition of objects (such as physical objects, text characters, or the like) within the image frames. The individual frames are selected using robust historical metrics that compare individual metrics of the particular image (such as focus, motion, intensity, etc.) to similar metrics of previous image frames in the video feed. The system will select the image frame for object recognition if the image frame is relatively high quality, that is the image frame is suitable for a later object recognition processing.

    Sharpness-based frame selection for OCR
    2.
    发明授权
    Sharpness-based frame selection for OCR 有权
    用于OCR的基于锐度的帧选择

    公开(公告)号:US09576210B1

    公开(公告)日:2017-02-21

    申请号:US14500005

    申请日:2014-09-29

    Abstract: A system to select video frames for optical character recognition (OCR) based on feature metrics associated with blur and sharpness. A device captures a video frame including text characters. An edge detection filter is applied to the frame to determine gradient features in perpendicular directions. An “edge map” is created from the gradient features, and points along edges in the edge map are identified. Edge transition widths are determined at each of the edge points based in local intensity minimum and maximum on opposite sides of the respective edge point in the frame. Sharper edges have smaller edge transition widths than blurry images. Statistics are determined from the edge transition widths, and the statistics are processed by a trained classifier to determine if the frame is or is not sufficiently sharp for text processing.

    Abstract translation: 基于与模糊和锐度相关联的特征度量来选择用于光学字符识别(OCR)的视频帧的系统。 设备捕获包含文本字符的视频帧。 将边缘检测滤波器应用于帧以确定垂直方向上的梯度特征。 从梯度特征创建“边缘图”,并且识别沿着边缘图中边缘的点。 基于帧中相应边缘点的相对侧上的局部强度最小值和最大值,在每个边缘点处确定边缘过渡宽度。 更亮的边缘具有比模糊图像更小的边缘过渡宽度。 根据边缘转换宽度确定统计量,并且由训练有素的分类器处理统计信息,以确定帧是否为文本处理不够清晰。

    Sharpness-based frame selection for OCR
    3.
    发明授权
    Sharpness-based frame selection for OCR 有权
    用于OCR的基于锐度的帧选择

    公开(公告)号:US09418316B1

    公开(公告)日:2016-08-16

    申请号:US14500208

    申请日:2014-09-29

    CPC classification number: G06K9/3258 G06K9/6231 G06K2209/01

    Abstract: A process for training and optimizing a system to select video frames for optical character recognition (OCR) based on feature metrics associated with blur and sharpness. A set of image frames are subjectively labelled based on a comparison of each frame before and after binarization to determine to what degree text is recognizable in the binary image. A plurality of different sharpness feature metrics are generated based on the original frame. A classifier is then trained using the feature metrics and the subjective labels. The feature metrics are then tested for accuracy and/or correlation with subjective labelling data. The set of feature metrics may be refined based on which metrics produce the best results.

    Abstract translation: 基于与模糊和锐度相关的特征量度,训练和优化系统以选择用于光学字符识别(OCR)的视频帧的过程。 基于二值化之前和之后的每个帧的比较来主观地标记一组图像帧,以确定二进制图像中文本是可识别的。 基于原始帧生成多个不同的锐度特征度量。 然后使用特征指标和主观标签对分类器进行训练。 然后测试特征度量的准确性和/或与主观标记数据的相关性。 可以基于哪些度量产生最佳结果来改进特征度量集合。

    Text detection using features associated with neighboring glyph pairs
    4.
    发明授权
    Text detection using features associated with neighboring glyph pairs 有权
    使用与相邻字形对相关联的功能的文本检测

    公开(公告)号:US09367736B1

    公开(公告)日:2016-06-14

    申请号:US14842125

    申请日:2015-09-01

    Abstract: A multi-orientation text detection method and associated system is disclosed that utilizes orientation-variant glyph features to determine a text line in an image regardless of an orientation of the text line. Glyph features are determined for each glyph in an image with respect to a neighboring glyph. The glyph features are provided to a learned classifier that outputs a glyph pair score for each neighboring glyph pair. Each glyph pair score indicates a likelihood that the corresponding pair of neighboring glyphs form part of a same text line. The glyph pair scores are used to identify candidate text lines, which are then ranked to select a final set of text lines in the image.

    Abstract translation: 公开了一种多方向文本检测方法和相关系统,其利用取向变体字形特征来确定图像中的文本行,而不管文本行的取向如何。 为相对于相邻字形的图像中的每个字形确定字形特征。 字形特征被提供给学习的分类器,其为每个相邻字形对输出字形对分数。 每个字形对得分表示对应的相邻字形对形成相同文本行的一部分的可能性。 字形对分数用于识别候选文本行,然后将其排序以选择图像中的最后一组文本行。

Patent Agency Ranking