Patent search cpc:"G06K9/325" Page 2

11.

发明申请
APPARATUS AND METHOD FOR PROVIDING FAILED-ATTEMPT FEEDBACK USING A CAMERA ON GLASSES 审中-公开
Title translation: 使用照相机在玻璃上提供失败的反射反馈的装置和方法

公开(公告)号：WO2014140843A1

公开(公告)日：2014-09-18

申请号：PCT/IB2014/000972

申请日：2014-02-26

Applicant: ORCAM TECHNOLOGIES LTD.

Inventor： WEXLER, Yonatan , SHASHUA, Amnon

IPC: G09B21/00 , G06K9/00 , G06F3/01

CPC classification number: G06K9/3275 , A61F9/08 , G01B11/24 , G02C11/10 , G06F3/011 , G06F3/16 , G06F17/2765 , G06K9/00221 , G06K9/00288 , G06K9/00442 , G06K9/00463 , G06K9/00469 , G06K9/00483 , G06K9/00671 , G06K9/00852 , G06K9/2081 , G06K9/22 , G06K9/30 , G06K9/3233 , G06K9/3241 , G06K9/325 , G06K9/3283 , G06K9/74 , G06K2009/00489 , G06K2009/2045 , G08B3/10 , G08B6/00 , G09B21/00 , G09B21/001 , G09B21/003 , G09B21/006 , G10L13/043 , H04N5/2251 , H04N5/2252 , H04N5/23229 , H04N5/23232

Abstract: Apparatuses and a method are provided for providing feedback to a user, who may be visually-impaired. In one implementation, a method is provided for providing feedback to a visually impaired user. The method comprises receiving from a mobile image sensor real time image data that includes a representation of an object in an environment of the visually impaired user. The mobile image sensor is configured to be connected to glasses worn by the visually impaired user. Further, the method comprises receiving a signal indicating a desire of the visually impaired user to obtain information about the object. The method also includes accessing a database holding information about a plurality of objects, and comparing information derived from the received real time image data with information in the database. The method comprises providing the visually impaired user with nonvisual feedback that the object is not locatable in the database.

Abstract translation: 提供了设备和方法来向用户提供反馈，谁可能被视觉障碍。在一个实现中，提供了一种用于向视力受损用户提供反馈的方法。该方法包括从视觉受损用户的环境中接收包括对象的表示的移动图像传感器的实时图像数据。移动图像传感器被配置为连接到由视障者使用的眼镜。此外，该方法包括接收指示视觉受损用户希望获取关于对象的信息的信号。该方法还包括访问保存关于多个对象的信息的数据库，以及将从接收到的实时图像数据导出的信息与数据库中的信息进行比较。该方法包括向视觉障碍用户提供对象在数据库中不可定位的非视觉反馈。

12.

发明申请
APPARATUS AND METHOD FOR PERFORMING ACTIONS BASED ON CAPTURED IMAGE DATA 审中-公开
Title translation: 用于基于捕获的图像数据执行动作的装置和方法

公开(公告)号：WO2014140816A2

公开(公告)日：2014-09-18

申请号：PCT/IB2014000850

申请日：2014-03-06

Applicant: ORCAM TECHNOLOGIES LTD

Inventor： WEXLER YONATAN , SHASHUA AMNON

IPC: G06K9/00

CPC classification number: G06K9/3275 , A61F9/08 , G01B11/24 , G02C11/10 , G06F3/011 , G06F3/16 , G06F17/2765 , G06K9/00221 , G06K9/00288 , G06K9/00442 , G06K9/00463 , G06K9/00469 , G06K9/00483 , G06K9/00671 , G06K9/00852 , G06K9/2081 , G06K9/22 , G06K9/30 , G06K9/3233 , G06K9/3241 , G06K9/325 , G06K9/3283 , G06K9/74 , G06K2009/00489 , G06K2009/2045 , G08B3/10 , G08B6/00 , G09B21/00 , G09B21/001 , G09B21/003 , G09B21/006 , G10L13/043 , H04N5/2251 , H04N5/2252 , H04N5/23229 , H04N5/23232

Abstract: An apparatus and method are provided for performing one or more actions based on triggers detecting within captured image data. In one implementation, a method is provided for audibly reading text retrieved from a captured image. According to the method, real-time image data is captured from an environment of a user, and an existence of a trigger is determined within the captured image data. In one aspect, the trigger may be associated with a desire of the user to hear text read aloud, and the trigger identifies an intermediate portion of the text a distance from a level break in the text. The method includes performing a layout analysis on the text to identify the level break associated with the trigger, and reading aloud text beginning from the level break associated with the trigger.

Abstract translation: 提供了一种装置和方法，用于基于捕获的图像数据内的触发检测来执行一个或多个动作。在一个实施方式中，提供了一种用于可听地读取从捕捉的图像中检索的文本的方法。根据该方法，从用户的环境捕获实时图像数据，并且在捕获的图像数据内确定是否存在触发。在一个方面中，触发器可以与用户希望朗读朗读文本的期望相关联，并且触发器识别文本的距离文本中的水平中断的距离的中间部分。该方法包括对文本执行布局分析以识别与触发相关联的等级中断，并且从与触发相关联的等级中断开始朗读文本。

13.

发明申请
METHOD AND APPARATUS FOR IDENTIFYING PICTURE 审中-公开
Title translation: 识别图像的方法和装置

公开(公告)号：WO2014086239A1

公开(公告)日：2014-06-12

申请号：PCT/CN2013/087808

申请日：2013-11-26

Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor： WANG, Quan , WANG, Guoqiang

IPC: G06K9/00

CPC classification number: G06K9/6217 , G06F3/0482 , G06F17/30879 , G06K9/325

Abstract: According to an example, whether a picture of a web page is a two-dimensional code picture is determined when the picture is triggered, a user is prompted to determine whether to identify the two-dimensional code picture when the picture of the web page is the two-dimensional code picture, the two-dimensional code picture is parsed after receiving an instruction of identifying the two-dimensional code picture from the user, two-dimensional code information is obtained, and processing is performed according to the two-dimensional code information.

Abstract translation: 根据一个示例，当触发图片时确定网页的图片是否是二维代码图片，当网页的图片为网页时，提示用户确定是否识别二维代码图片二维码图像，在从用户接收到识别二维码图像的指令之后解析二维码图像，获得二维码信息，并根据二维码进行处理信息。

14.

发明申请
TEXT AND CONTEXT RECOGNITION THROUGH IMAGES AND VIDEO 审中-公开
Title translation: 通过图像和视频的文本和语境识别

公开(公告)号：WO2014065808A1

公开(公告)日：2014-05-01

申请号：PCT/US2012/062061

申请日：2012-10-26

Applicant: BLACKBERRY LIMITED , GRIFFIN, Jason Tyler , HAMILTON, Alistair Robert

Inventor： GRIFFIN, Jason Tyler , HAMILTON, Alistair Robert

IPC: G06K9/22 , H04M1/725

CPC classification number: G06K9/18 , G06F17/21 , G06F17/211 , G06F17/24 , G06F17/2765 , G06K9/228 , G06K9/325 , H04M1/2745 , H04M1/72552 , H04M1/72569

Abstract: The display of text by a first device, that is extracted and used by a second communication device, based upon a determined context of the text, is provided. The image displayed for the first device is captured by an image capture element of a second communication device, which also has a recognition module and an extraction module. The image includes the text and a context element representing a context of the text.

Abstract translation: 提供了基于确定的文本上下文由第二设备提取和使用由第一设备显示的文本。对于第一设备显示的图像由第二通信设备的图像捕获元件捕获，第二通信设备还具有识别模块和提取模块。图像包括文本和表示文本上下文的上下文元素。

15.

发明申请
IMAGE DATA STORAGE AND SHARING 审中-公开
Title translation: 图像数据存储和共享

公开(公告)号：WO2013136093A2

公开(公告)日：2013-09-19

申请号：PCT/GB2013/050671

申请日：2013-03-15

Applicant: BVXL LIMITED

Inventor： GORBAN, Alexander , MATSYPAEV, Dmitry , LAMOND, Colin

IPC: G06K9/32

CPC classification number: G06K9/325 , G06F19/321 , G06T5/005 , G06T2207/10132 , G06T2207/30004 , H04N1/0084 , H04N1/00872 , H04N2201/0079

Abstract: A computer-implemented method of storing image data comprising images of a subject generated by a medical imaging device is disclosed. The method comprises: a) capturing the image data; b) receiving subject identification metadata; c) analysing at least one selected element of the image data to detect features identifying the subject and modifying the or each selected element of the image data by removing or obscuring any such detected features; and d) storing a subject record comprising the or each modified selected element of the image data and the subject identification metadata.

Abstract translation: 公开了一种计算机实现的存储包括由医学成像装置生成的对象的图像的图像数据的方法。该方法包括：a）捕获图像数据; b）接收主体识别元数据; c）分析所述图像数据的至少一个选定元素以检测识别所述对象的特征，并且通过去除或模糊所述检测到的特征来修改所述图像数据的所述或每个所选择的元素; 以及d）存储包括所述图像数据和所述对象标识元数据的所述修改的所选元素或每个修改的所选元素的主题记录。

16.

发明申请
OBJECT TRACKING AND PROCESSING 审中-公开
Title translation: 对象跟踪和处理

公开(公告)号：WO2013103450A1

公开(公告)日：2013-07-11

申请号：PCT/US2012/065887

申请日：2012-11-19

Applicant: QUALCOMM INCORPORATED

Inventor： KOO, Hyung-Il , YOU, Kisun , BAIK, Young-Ki

IPC: G06K9/00 , G06K9/32

CPC classification number: G06K9/3258 , G06K9/00664 , G06K9/00671 , G06K9/325 , G06K2209/01

Abstract: A method includes tracking an object in each of a plurality of frames of video data to generate a tracking result. The method also includes performing object processing of a subset of frames of the plurality of frames selected according to a multi-frame latency of an object detector or an object recognizer. The method includes combining the tracking result with an output of the object processing to produce a combined output.

Abstract translation: 一种方法包括跟踪视频数据的多个帧中的每一个中的对象以产生跟踪结果。该方法还包括执行根据对象检测器或对象识别器的多帧等待时间选择的多个帧的子集的对象处理。该方法包括将跟踪结果与对象处理的输出组合以产生组合输出。

17.

发明申请
一种车牌图像定位的方法和装置审中-公开

公开(公告)号：WO2013063820A1

公开(公告)日：2013-05-10

申请号：PCT/CN2011/081964

申请日：2011-11-09

Applicant: 青岛海信网络科技股份有限公司 , 付廷杰 , 陈维强 , 李月高 , 刘韶 , 裴雷

Inventor： 付廷杰 , 陈维强 , 李月高 , 刘韶 , 裴雷

IPC: G06K9/00

CPC classification number: G06K9/325 , G06K2209/15

Abstract: 公开了一种车牌图像定位的方法和装置。该方法包括：根据颜色水平投影值，确定出颜色投影高度对应的像素行；当颜色投影高度与标准高度的差值小于第一边界阈值时，将颜色投影高度对应的起始像素行和终止像素行作为车牌图像的第一边界和第二边界；根据颜色垂直投影值，确定出颜色投影宽度对应的像素列；当颜色投影宽度与标准宽度的差值小于第二边界阈值时，将颜色投影宽度对应的起始像素列和终止像素列作为车牌图像的第三边界和第四边界。该车牌图像定位的方法和装置，可以准确进行车牌定位。

18.

发明申请
SYSTEM AND METHOD FOR RECOGNIZING TEXT INFORMATION IN OBJECT 审中-公开
Title translation: 用于识别对象中的文本信息的系统和方法

公开(公告)号：WO2013002955A1

公开(公告)日：2013-01-03

申请号：PCT/US2012/040445

申请日：2012-06-01

Applicant: QUALCOMM INCORPORATED , KOO, Hyung-Il , YOU, Kisun , CHO, Hyun-Mook

Inventor： KOO, Hyung-Il , YOU, Kisun , CHO, Hyun-Mook

IPC: G06K9/22 , G06K9/32 , G06K9/62

CPC classification number: G06K9/228 , G06K9/325 , G06K9/6292 , G06K2209/01

Abstract: A method for recognizing a text block in an object is disclosed. The text block includes a set of characters. A plurality of images of the object are captured and received. The object in the received images is then identified by extracting a pattern in one of the object images and comparing the extracted pattern with predetermined patterns. Further, a boundary of the object in each of the object images is detected and verified based on predetermined size information of the identified object. Text blocks in the object images are identified based on predetermined location information of the identified object. Interim sets of characters in the identified text blocks are generated based on format information of the identified object. Based on the interim sets of characters, a set of characters in the text block in the object is determined.

Abstract translation: 公开了一种用于识别对象中的文本块的方法。文本块包括一组字符。拍摄和接收对象的多个图像。然后通过提取一个对象图像中的图案并将提取的图案与预定图案进行比较来识别接收到的图像中的对象。此外，基于所识别的对象的预定大小信息来检测和验证每个对象图像中的对象的边界。基于所识别对象的预定位置信息来识别对象图像中的文本块。基于所标识的对象的格式信息生成所识别的文本块中的中间字符集。基于中间字符集，确定对象中的文本块中的一组字符。

19.

发明申请
METHOD OF DETECTING LOGOS, TITLES, OR SUB-TITLES IN VIDEO FRAMES 审中-公开
Title translation: 在视频框架中检测LOGO，TITLES或SUB TITLES的方法

公开(公告)号：WO2012096768A2

公开(公告)日：2012-07-19

申请号：PCT/US2011/066338

申请日：2011-12-21

Applicant: INTEL CORPORATION , LEVY, Avi , LEVIT-GUREVICH, Konstantin

Inventor： LEVY, Avi , LEVIT-GUREVICH, Konstantin

IPC: G06T7/00

CPC classification number: G06K9/325 , G06K9/00711 , G06K9/00744 , G06K9/3266 , G06K2209/25

Abstract: Detecting a static graphic object (such as a logo, title, or sub-title) in a sequence of video frames may be accomplished by analyzing each selected one of a plurality of pixels in a video frame of the sequence of video frames. Basic conditions for the selected pixel may be tested to determine whether the selected pixel is a static pixel. When the selected pixel is a static pixel, a static similarity measure and a forward motion similarity measure may be determined for the selected pixel. A temporal score for the selected pixel may be determined based at least in part on the similarity measures. Finally, a static graphic object decision for the selected pixel may be made based at least in part on the temporal score.

Abstract translation: 可以通过分析视频帧序列的视频帧中的多个像素中的每一个选择的一个来检测视频帧序列中的静态图形对象（诸如徽标，标题或子标题）。可以测试所选像素的基本条件以确定所选择的像素是否是静态像素。当所选择的像素是静态像素时，可以针对所选择的像素确定静态相似性度量和向前运动相似性度量。可以至少部分地基于相似性度量来确定所选择的像素的时间分数。最后，可以至少部分地基于时间分数来对所选择的像素进行静态图形对象决定。

20.

发明申请
SYSTEM AND METHOD FOR WEB PAGE SEGMENTATION USING ADAPTIVE THRESHOLD COMPUTATION 审中-公开
Title translation: 使用自适应阈值计算的网页分段的系统和方法

公开(公告)号：WO2011143814A1

公开(公告)日：2011-11-24

申请号：PCT/CN2010/072910

申请日：2010-05-19

Applicant: HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P. , ZHENG, Li-Wei , JIN, Jian-Ming , LIM, Suk Hwan , XIONG, Yuhong , LIU, Jerry J

Inventor： ZHENG, Li-Wei , JIN, Jian-Ming , LIM, Suk Hwan , XIONG, Yuhong , LIU, Jerry J

IPC: G06F17/30

CPC classification number: G06F17/2241 , G06K9/00449 , G06K9/325

Abstract: A system and method for an adaptive threshold Web Page segmenting is disclosed. In one embodiment, a method performed by a physical computing system having one or more processors for segmenting a Web page including a plurality of nodes includes parsing content in the Web page into the plurality of nodes using the physical computing system, obtaining feature values between each pair of nodes using the physical computing system, estimating an adaptive threshold value using the obtained feature values using the physical computing system, and segmenting the Web page by comparing the feature values associated with each pair of nodes with the estimated adaptive threshold value.

Abstract translation: 公开了一种用于自适应阈值网页分割的系统和方法。在一个实施例中，具有用于分割包括多个节点的网页的一个或多个处理器的物理计算系统执行的方法包括使用物理计算系统将网页中的内容解析为多个节点，从而获得每个使用所述物理计算系统的一对节点，使用所述物理计算系统使用所获得的特征值来估计自适应阈值，以及通过将与每对节点相关联的特征值与所估计的自适应阈值进行比较来分割所述网页。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification