-
公开(公告)号:US20110123115A1
公开(公告)日:2011-05-26
申请号:US12626520
申请日:2009-11-25
申请人: Dar-Shyang Lee , Lee-Feng Chien , Aries Hsieh , Pin Ting , Kin Wong
发明人: Dar-Shyang Lee , Lee-Feng Chien , Aries Hsieh , Pin Ting , Kin Wong
CPC分类号: H04N5/23212 , G06K9/036 , G06K9/2081 , G06K9/228 , G06K9/325 , G06K2209/01 , H04N5/232 , H04N5/23248 , H04N5/23258 , H04N5/23264
摘要: A live video stream captured by an on-device camera is displayed on a screen with an overlaid guideline. Video frames of the live video stream are analyzed for a video frame with acceptable quality. A text region is identified in the video frame approximate to the on-screen guideline and cropped from the video frame. The cropped image is transmitted to an optical character recognition (OCR) engine, which processes the cropped image and generates text in an editable symbolic form (the OCR'ed text). A confidence score is determined for the OCR'ed text and compared with a threshold value. If the confidence score exceeds the threshold value, the OCR'ed text is outputted.
摘要翻译: 由设备上相机拍摄的实时视频流将显示在具有重叠指南的屏幕上。 分析实时视频流的视频帧,以获得可接受的质量的视频帧。 在视频帧中识别文本区域,其近似于屏幕上的指南并从视频帧中裁剪。 裁剪的图像被传输到光学字符识别(OCR)引擎,其处理裁剪的图像并以可编辑的符号形式(OCR的文本)生成文本。 确定OCR文本的置信度,并与阈值进行比较。 如果置信度分数超过阈值,则输出OCR的文本。
-
公开(公告)号:US08520983B2
公开(公告)日:2013-08-27
申请号:US12575015
申请日:2009-10-07
申请人: Dar-Shyang Lee , Lee-Feng Chien , Pin Ting , Aries Hsieh , Kin Wong
发明人: Dar-Shyang Lee , Lee-Feng Chien , Pin Ting , Aries Hsieh , Kin Wong
CPC分类号: G06K9/18 , G06K9/2081 , G06K2209/01
摘要: An image is displayed on a touch screen. A user's underline gesture on the displayed image is detected. The area of the image touched by the underline gesture and a surrounding region approximate to the touched area are identified. Skew for text in the surrounding region is determined and compensated. A text region including the text is identified in the surrounding region and cropped from the image. The cropped image is transmitted to an optical character recognition (OCR) engine, which processes the cropped image and returns OCR'ed text. The OCR'ed text is outputted.
摘要翻译: 图像显示在触摸屏上。 检测到用户在显示图像上的下划线手势。 识别由下划线手势触摸的图像的区域和接近触摸区域的周边区域。 确定并补偿周边地区的文字偏差。 在周围区域中识别包括文本的文本区域,并从图像中裁剪。 裁剪的图像被传输到光学字符识别(OCR)引擎,其处理裁剪的图像并返回OCR的文本。 输出OCR的文本。
-
公开(公告)号:US20110081083A1
公开(公告)日:2011-04-07
申请号:US12575015
申请日:2009-10-07
申请人: Dar-Shyang Lee , Lee-Feng Chien , Aries Hsieh , Pin Ting , Kin Wong
发明人: Dar-Shyang Lee , Lee-Feng Chien , Aries Hsieh , Pin Ting , Kin Wong
IPC分类号: G06K9/18
CPC分类号: G06K9/18 , G06K9/2081 , G06K2209/01
摘要: An image is displayed on a touch screen. A user's underline gesture on the displayed image is detected. The area of the image touched by the underline gesture and a surrounding region approximate to the touched area are identified. Skew for text in the surrounding region is determined and compensated. A text region including the text is identified in the surrounding region and cropped from the image. The cropped image is transmitted to an optical character recognition (OCR) engine, which processes the cropped image and returns OCR'ed text. The OCR'ed text is outputted.
摘要翻译: 图像显示在触摸屏上。 检测到用户在显示图像上的下划线手势。 识别由下划线手势触摸的图像的区域和接近触摸区域的周边区域。 确定并补偿周边地区的文字偏差。 在周围区域中识别包括文本的文本区域,并从图像中裁剪。 裁剪的图像被传输到光学字符识别(OCR)引擎,其处理裁剪的图像并返回OCR的文本。 输出OCR的文本。
-
公开(公告)号:US08515185B2
公开(公告)日:2013-08-20
申请号:US12626520
申请日:2009-11-25
申请人: Dar-Shyang Lee , Lee-Feng Chien , Aries Hsieh , Pin Ting , Kin Wong
发明人: Dar-Shyang Lee , Lee-Feng Chien , Aries Hsieh , Pin Ting , Kin Wong
CPC分类号: H04N5/23212 , G06K9/036 , G06K9/2081 , G06K9/228 , G06K9/325 , G06K2209/01 , H04N5/232 , H04N5/23248 , H04N5/23258 , H04N5/23264
摘要: A live video stream captured by an on-device camera is displayed on a screen with an overlaid guideline. Video frames of the live video stream are analyzed for a video frame with acceptable quality. A text region is identified in the video frame approximate to the on-screen guideline and cropped from the video frame. The cropped image is transmitted to an optical character recognition (OCR) engine, which processes the cropped image and generates text in an editable symbolic form (the OCR'ed text). A confidence score is determined for the OCR'ed text and compared with a threshold value. If the confidence score exceeds the threshold value, the OCR'ed text is outputted.
摘要翻译: 由设备上相机拍摄的实时视频流将显示在具有重叠指南的屏幕上。 分析实时视频流的视频帧,以获得可接受的质量的视频帧。 在视频帧中识别文本区域,其近似于屏幕上的指南并从视频帧中裁剪。 裁剪的图像被传输到光学字符识别(OCR)引擎,其处理裁剪的图像并以可编辑的符号形式(OCR的文本)生成文本。 确定OCR文本的置信度,并与阈值进行比较。 如果置信度分数超过阈值,则输出OCR的文本。
-
-
-