Augmented reality with sound and geometric analysis
    1.
    发明授权
    Augmented reality with sound and geometric analysis 有权
    增强现实与声音和几何分析

    公开(公告)号:US09563265B2

    公开(公告)日:2017-02-07

    申请号:US13585927

    申请日:2012-08-15

    CPC分类号: G06F3/011 G06F3/167

    摘要: A method for responding in an augmented reality (AR) application of a mobile device to an external sound is disclosed. The mobile device detects a target. A virtual object is initiated in the AR application. Further, the external sound is received, by at least one sound sensor of the mobile device, from a sound source. Geometric information between the sound source and the target is determined, and at least one response for the virtual object to perform in the AR application is generated based on the geometric information.

    摘要翻译: 公开了一种用于在移动设备的增强现实(AR)应用中对外部声音进行响应的方法。 移动设备检测目标。 在AR应用程序中启动虚拟对象。 此外,通过移动设备的至少一个声音传感器从声源接收外部声音。 确定声源和目标之间的几何信息,并且基于几何信息生成在AR应用中要执行的虚拟对象的至少一个响应。

    Camera OCR with context information
    2.
    发明授权
    Camera OCR with context information 有权
    相机OCR与上下文信息

    公开(公告)号:US09082035B2

    公开(公告)日:2015-07-14

    申请号:US13450016

    申请日:2012-04-18

    摘要: Embodiments of the invention describe methods and apparatus for performing context-sensitive OCR. A device obtains an image using a camera coupled to the device. The device identifies a portion of the image comprising a graphical object. The device infers a context associated with the image and selects a group of graphical objects based on the context associated with the image. Improved OCR results are generated using the group of graphical objects. Input from various sensors including microphone, GPS, and camera, along with user inputs including voice, touch, and user usage patterns may be used in inferring the user context and selecting dictionaries that are most relevant to the inferred contexts.

    摘要翻译: 本发明的实施例描述了用于执行上下文敏感的OCR的方法和装置。 设备使用耦合到该设备的照相机来获得图像。 设备识别包括图形对象的图像的一部分。 设备推断与图像相关联的上下文,并且基于与图像相关联的上下文来选择一组图形对象。 使用图形对象组生成改进的OCR结果。 来自包括麦克风,GPS和相机在内的各种传感器的输入以及包括语音,触摸和用户使用模式在内的用户输入可以用于推断用户上下文并选择与所推断的上下文最相关的字典。

    CAMERA OCR WITH CONTEXT INFORMATION
    3.
    发明申请
    CAMERA OCR WITH CONTEXT INFORMATION 有权
    CAMERA OCR与上下文信息

    公开(公告)号:US20130108115A1

    公开(公告)日:2013-05-02

    申请号:US13450016

    申请日:2012-04-18

    IPC分类号: G06K9/20

    摘要: Embodiments of the invention describe methods and apparatus for performing context-sensitive OCR. A device obtains an image using a camera coupled to the device. The device identifies a portion of the image comprising a graphical object. The device infers a context associated with the image and selects a group of graphical objects based on the context associated with the image. Improved OCR results are generated using the group of graphical objects. Input from various sensors including microphone, GPS, and camera, along with user inputs including voice, touch, and user usage patterns may be used in inferring the user context and selecting dictionaries that are most relevant to the inferred contexts.

    摘要翻译: 本发明的实施例描述了用于执行上下文敏感的OCR的方法和装置。 设备使用耦合到该设备的照相机来获得图像。 设备识别包括图形对象的图像的一部分。 设备推断与图像相关联的上下文,并且基于与图像相关联的上下文来选择一组图形对象。 使用图形对象组生成改进的OCR结果。 来自包括麦克风,GPS和相机在内的各种传感器的输入以及包括语音,触摸和用户使用模式在内的用户输入可以用于推断用户上下文并选择与所推断的上下文最相关的字典。

    AUGMENTED REALITY WITH SOUND AND GEOMETRIC ANALYSIS
    4.
    发明申请
    AUGMENTED REALITY WITH SOUND AND GEOMETRIC ANALYSIS 有权
    具有声音和几何分析的现实

    公开(公告)号:US20130182858A1

    公开(公告)日:2013-07-18

    申请号:US13585927

    申请日:2012-08-15

    IPC分类号: H04R29/00

    CPC分类号: G06F3/011 G06F3/167

    摘要: A method for responding in an augmented reality (AR) application of a mobile device to an external sound is disclosed. The mobile device detects a target. A virtual object is initiated in the AR application. Further, the external sound is received, by at least one sound sensor of the mobile device, from a sound source. Geometric information between the sound source and the target is determined, and at least one response for the virtual object to perform in the AR application is generated based on the geometric information.

    摘要翻译: 公开了一种用于在移动设备的增强现实(AR)应用中对外部声音进行响应的方法。 移动设备检测目标。 在AR应用程序中启动虚拟对象。 此外,通过移动设备的至少一个声音传感器从声源接收外部声音。 确定声源和目标之间的几何信息,并且基于几何信息生成在AR应用中要执行的虚拟对象的至少一个响应。

    Parallel processing method and apparatus for determining text information from an image
    5.
    发明授权
    Parallel processing method and apparatus for determining text information from an image 有权
    用于从图像确定文本信息的并行处理方法和装置

    公开(公告)号:US09202127B2

    公开(公告)日:2015-12-01

    申请号:US13539797

    申请日:2012-07-02

    摘要: A method for processing a multi-channel image is disclosed. The method includes generating a plurality of grayscale images from the multi-channel image. At least one text region is identified in the plurality of grayscale images and text region information is determined from the at least one text region. The method generates text information of the multi-channel image based on the text region information. If the at least one text region includes a plurality of text regions, text region information from the plurality of text regions is merged to generate the text information. The plurality of the grayscale images is processed in parallel. In identifying the at least one text region, at least one candidate text region may be identified in the plurality of grayscale images and the at least one text region may be identified in the identified candidate text region.

    摘要翻译: 公开了一种用于处理多通道图像的方法。 该方法包括从多声道图像生成多个灰度图像。 在多个灰度图像中识别至少一个文本区域,并且从至少一个文本区域确定文本区域信息。 该方法基于文本区域信息生成多通道图像的文本信息。 如果至少一个文本区域包括多个文本区域,则合并来自多个文本区域的文本区域信息以生成文本信息。 多个灰度图像并行处理。 在识别至少一个文本区域时,可以在多个灰度图像中识别至少一个候选文本区域,并且可以在识别的候选文本区域中识别至少一个文本区域。

    System and method for recognizing text information in object
    6.
    发明授权
    System and method for recognizing text information in object 有权
    用于识别对象中的文本信息的系统和方法

    公开(公告)号:US09418304B2

    公开(公告)日:2016-08-16

    申请号:US13367764

    申请日:2012-02-07

    IPC分类号: G06K9/22 G06K9/62 G06K9/32

    摘要: A method for recognizing a text block in an object is disclosed. The text block includes a set of characters. A plurality of images of the object are captured and received. The object in the received images is then identified by extracting a pattern in one of the object images and comparing the extracted pattern with predetermined patterns. Further, a boundary of the object in each of the object images is detected and verified based on predetermined size information of the identified object. Text blocks in the object images are identified based on predetermined location information of the identified object. Interim sets of characters in the identified text blocks are generated based on format information of the identified object. Based on the interim sets of characters, a set of characters in the text block in the object is determined.

    摘要翻译: 公开了一种用于识别对象中的文本块的方法。 文本块包括一组字符。 拍摄和接收对象的多个图像。 然后通过提取一个对象图像中的图案并将提取的图案与预定图案进行比较来识别接收到的图像中的对象。 此外,基于所识别的对象的预定大小信息来检测和验证每个对象图像中的对象的边界。 基于所识别对象的预定位置信息来识别对象图像中的文本块。 基于所标识的对象的格式信息生成所识别的文本块中的中间字符集。 基于中间字符集,确定对象中的文本块中的一组字符。

    Parallel Processing Method and Apparatus for Determining Text Information from an Image
    7.
    发明申请
    Parallel Processing Method and Apparatus for Determining Text Information from an Image 有权
    用于从图像确定文本信息的并行处理方法和装置

    公开(公告)号:US20130011055A1

    公开(公告)日:2013-01-10

    申请号:US13539797

    申请日:2012-07-02

    IPC分类号: G06K9/36 G06K9/46

    摘要: A method for processing a multi-channel image is disclosed. The method includes generating a plurality of grayscale images from the multi-channel image. At least one text region is identified in the plurality of grayscale images and text region information is determined from the at least one text region. The method generates text information of the multi-channel image based on the text region information. If the at least one text region includes a plurality of text regions, text region information from the plurality of text regions is merged to generate the text information. The plurality of the grayscale images is processed in parallel. In identifying the at least one text region, at least one candidate text region may be identified in the plurality of grayscale images and the at least one text region may be identified in the identified candidate text region.

    摘要翻译: 公开了一种用于处理多通道图像的方法。 该方法包括从多声道图像生成多个灰度图像。 在多个灰度图像中识别至少一个文本区域,并且从至少一个文本区域确定文本区域信息。 该方法基于文本区域信息生成多通道图像的文本信息。 如果至少一个文本区域包括多个文本区域,则合并来自多个文本区域的文本区域信息以生成文本信息。 多个灰度图像并行处理。 在识别至少一个文本区域时,可以在多个灰度图像中识别至少一个候选文本区域,并且可以在识别的候选文本区域中识别至少一个文本区域。

    SYSTEM AND METHOD FOR RECOGNIZING TEXT INFORMATION IN OBJECT
    8.
    发明申请
    SYSTEM AND METHOD FOR RECOGNIZING TEXT INFORMATION IN OBJECT 有权
    用于识别对象中的文本信息的系统和方法

    公开(公告)号:US20130004076A1

    公开(公告)日:2013-01-03

    申请号:US13367764

    申请日:2012-02-07

    IPC分类号: G06K9/34 G06K9/18

    摘要: A method for recognizing a text block in an object is disclosed. The text block includes a set of characters. A plurality of images of the object are captured and received. The object in the received images is then identified by extracting a pattern in one of the object images and comparing the extracted pattern with predetermined patterns. Further, a boundary of the object in each of the object images is detected and verified based on predetermined size information of the identified object. Text blocks in the object images are identified based on predetermined location information of the identified object. Interim sets of characters in the identified text blocks are generated based on format information of the identified object. Based on the interim sets of characters, a set of characters in the text block in the object is determined.

    摘要翻译: 公开了一种用于识别对象中的文本块的方法。 文本块包括一组字符。 拍摄和接收对象的多个图像。 然后通过提取一个对象图像中的图案并将提取的图案与预定图案进行比较来识别接收到的图像中的对象。 此外,基于所识别的对象的预定大小信息来检测和验证每个对象图像中的对象的边界。 基于所识别对象的预定位置信息来识别对象图像中的文本块。 基于所标识的对象的格式信息生成所识别的文本块中的中间字符集。 基于中间字符集,确定对象中的文本块中的一组字符。

    Method to reject false positives detecting and tracking image objects
    9.
    发明授权
    Method to reject false positives detecting and tracking image objects 有权
    拒绝检测和跟踪图像对象的误报的方法

    公开(公告)号:US08836799B2

    公开(公告)日:2014-09-16

    申请号:US13550531

    申请日:2012-07-16

    IPC分类号: H04N5/228

    摘要: Methods, apparatuses, systems, and computer-readable media for rejecting false positive detection and tracking of image objects are presented. According to one or more aspects, a computing device may implement embodiments of the invention to use the movement of the mobile device for distinguishing false positives from true movement of the mobile device depicted in the field of view of the camera. In one embodiment, the actual movement of the mobile device may be measured using multi-modal sensor data from inertial sensors such as accelerometers and gyroscopes. In another embodiment, the actual movement of the device is calculated using the global movement of the mobile phone with reference to other objects in the field of view of the camera.

    摘要翻译: 提出了用于拒绝图像对象的假阳性检测和跟踪的方法,装置,系统和计算机可读介质。 根据一个或多个方面,计算设备可以实现本发明的实施例,以使用移动设备的移动来区分在相机视野中描绘的移动设备的真实移动的假阳性。 在一个实施例中,可以使用来自诸如加速度计和陀螺仪的惯性传感器的多模态传感器数据来测量移动设备的实际移动。 在另一个实施例中,使用移动电话参照摄像机视野中的其他对象的全局移动来计算设备的实际移动。

    METHOD TO REJECT FALSE POSITIVES DETECTING AND TRACKING IMAGE OBJECTS
    10.
    发明申请
    METHOD TO REJECT FALSE POSITIVES DETECTING AND TRACKING IMAGE OBJECTS 有权
    检测和跟踪图像对象的方法

    公开(公告)号:US20130258141A1

    公开(公告)日:2013-10-03

    申请号:US13550531

    申请日:2012-07-16

    IPC分类号: H04N5/235

    摘要: Methods, apparatuses, systems, and computer-readable media for rejecting false positive detection and tracking of image objects are presented. According to one or more aspects, a computing device may implement embodiments of the invention to use the movement of the mobile device for distinguishing false positives from true movement of the mobile device depicted in the field of view of the camera. In one embodiment, the actual movement of the mobile device may be measured using multi-modal sensor data from inertial sensors such as accelerometers and gyroscopes. In another embodiment, the actual movement of the device is calculated using the global movement of the mobile phone with reference to other objects in the field of view of the camera.

    摘要翻译: 提出了用于拒绝图像对象的假阳性检测和跟踪的方法,装置,系统和计算机可读介质。 根据一个或多个方面,计算设备可以实现本发明的实施例,以使用移动设备的移动来区分在相机视野中描绘的移动设备的真实移动的假阳性。 在一个实施例中,可以使用来自诸如加速度计和陀螺仪的惯性传感器的多模态传感器数据来测量移动设备的实际移动。 在另一个实施例中,使用移动电话参照相机视野中的其他对象的全局移动来计算设备的实际移动。