Architectures for object recognition

    公开(公告)号:US09904866B1

    公开(公告)日:2018-02-27

    申请号:US13529638

    申请日:2012-06-21

    申请人: Isaac S. Noble

    发明人: Isaac S. Noble

    摘要: The accuracy of an image matching and/or object identification process can be improved by utilizing a BCM network-based process that maintains higher order relationships between features in an image. A dataset of images can be converted to floating point vectors and then processed using a BCM-based approach. The resulting vectors can be stored as an image library for purposes of matching subsequently received images. When a match is located for a query image, for example, information associated with the matching image can be provided in order to help identify one or more objects in the received query image.

    Noise reduction based on mouth area movement recognition
    2.
    发明授权
    Noise reduction based on mouth area movement recognition 有权
    基于口腔运动识别的降噪

    公开(公告)号:US09263044B1

    公开(公告)日:2016-02-16

    申请号:US13534388

    申请日:2012-06-27

    IPC分类号: G10L15/24 G10L15/25

    摘要: A computing device can capture video data of at least a portion of a mouth area (e.g., mouth, lips, tongue, chin, jaw) of a user of the device. The computing device can also capture sound data including a voice of the user as well as noise (e.g. background noise). The video data can be processed to detect a movement of the portion of the mouth area. The movement of the portion of the mouth area can be analyzed and compared with mouth area movement models characteristic of oral communication (e.g., speech, song). If the movement of the portion of the mouth area corresponds to at least one model characteristic of oral communication, then the movement indicates that the user is likely engaging in oral communication. Noise reduction can be applied and/or increased on the captured sound data to reduce noise and in turn enhance the user's voice.

    摘要翻译: 计算设备可以捕获设备的用户的口部区域(例如,嘴,嘴唇,舌头,下巴,颌部)的至少一部分的视频数据。 计算设备还可以捕获包括用户的语音以及噪声(例如背景噪声)的声音数据。 可以处理视频数据以检测口部区域的移动。 可以分析嘴部区域的运动并与口腔通信(例如,语音,歌曲)特征的口区运动模型进行比较。 如果嘴部区域的运动对应于至少一种口腔通信的模型特征,则运动指示用户很可能进行口头交流。 可以在捕获的声音数据上施加和/或增加噪声降低噪声,从而增强用户的声音。

    Camera based sensor for motion detection
    3.
    发明授权
    Camera based sensor for motion detection 有权
    摄像机传感器,用于运动检测

    公开(公告)号:US09176608B1

    公开(公告)日:2015-11-03

    申请号:US13170164

    申请日:2011-06-27

    IPC分类号: G06F3/041 G09G5/00 G06F3/033

    摘要: The amount of power and processing needed to process gesture input for a computing device can be reduced by utilizing a separate gesture sensor. The gesture sensor can have a form factor similar to that of conventional camera elements, in order to reduce costs by being able to utilize readily available low cost parts, but can have a lower resolution and adjustable virtual shutter such that fast motions can be captured and/or recognized by the device. In some devices, a subset of the pixels of the gesture sensor can be used as a motion detector, enabling the gesture sensor to run in a low power state unless there is likely gesture input to process. Further, at least some of the processing and circuitry can be included with the gesture sensor such that various functionality can be performed without accessing a central processor or system bus, thus further reducing power consumption.

    摘要翻译: 可以通过使用单独的手势传感器来减少处理用于计算设备的手势输入所需的功率和处理量。 手势传感器可以具有与常规相机元件相似的外形尺寸,以便通过能够利用容易获得的低成本部件来降低成本,但是可以具有较低的分辨率和可调节的虚拟快门,使得可以捕获快速运动, /或由设备识别。 在一些设备中,手势传感器的像素的子集可以用作运动检测器,使得手势传感器能够以低功率状态运行,除非可能需要手势输入来处理。 此外,手势传感器中可以包括至少一些处理和电路,使得可以在不访问中央处理器或系统总线的情况下执行各种功能,从而进一步降低功耗。

    Estimating fingertip position using image analysis
    4.
    发明授权
    Estimating fingertip position using image analysis 有权
    使用图像分析估算指尖位置

    公开(公告)号:US09041689B1

    公开(公告)日:2015-05-26

    申请号:US13565019

    申请日:2012-08-02

    摘要: A computing device and/or application executing on the device can utilize fingertip tracking using a camera. However, when the fingertip is in a dead zone (an area that is not viewable by the camera), the fingertip tracking cannot function properly. Nevertheless, the position of the fingertip, when in the dead zone, can still be estimated. An image of a user's hand can be captured by at least one camera. A portion of a pointing finger can be detected in the captured image. An orientation of the portion of the pointing finger can be determined. One or more joint lines of the pointing finger can be identified. Based on data about a slant and/or a bend of the pointing finger obtained using information relating to the identified joint line(s), and/or on data obtained via calibration and/or historic/current usage, the position of the fingertip, when in the dead zone, can be approximated.

    摘要翻译: 在设备上执行的计算设备和/或应用可以利用使用相机的指尖跟踪。 然而,当指尖处于死区(相机不可见的区域)时,指尖跟踪功能无法正常工作。 然而,仍然可以估计指尖在死区的位置。 用户的手的图像可以由至少一个相机捕获。 可以在捕获的图像中检测指向手指的一部分。 可以确定指点手指的部分的取向。 可以识别指点手指的一个或多个关节线。 基于关于使用与所识别的联合线相关的信息获得的指向手指的倾斜和/或弯曲的数据,和/或通过校准和/或历史/当前使用获得的数据,指尖的位置, 当在死区时,可以近似。

    Gesture recognition for device input
    5.
    发明授权
    Gesture recognition for device input 有权
    手势识别装置输入

    公开(公告)号:US08819812B1

    公开(公告)日:2014-08-26

    申请号:US13587586

    申请日:2012-08-16

    IPC分类号: G06F21/00

    CPC分类号: G06F3/017 G06F21/31 G06F21/36

    摘要: A user can make a symbol with their hand, or other such gesture, at a distance from a computing device that can be captured by at least one imaging element of the device. The captured information can be analyzed to attempt to determine the location of distinguishing features of the symbol in the image information. The image information is then compared to hand gesture information stored in, for example, a library of hand gestures for the user. Upon identifying a match, an input to an application executing on the computing device is provided when the image information contains information matching at least one hand gesture with at least a minimum level of certainty. The hand gesture could include a single “static” gesture, such as a specific letter in sign language, for example, or include two or more “static” gestures. The gesture could also include motion, such as hand movement.

    摘要翻译: 用户可以用与其相关的手势或其他这样的手势在与可由设备的至少一个成像元件捕获的计算设备相距一定距离处制造符号。 可以分析捕获的信息以尝试确定图像信息中符号的区别特征的位置。 然后将图像信息与存储在例如用户的手势图书馆中的手势信息进行比较。 当识别匹配时,当图像信息包含与至少一个手势至少具有至少最小确定性水平匹配的信息时,提供对在计算设备上执行的应用的输入。 手势可以包括单个“静态”手势,例如手语中的特定字母,或者包括两个或多个“静态”手势。 手势还可以包括运动,如手动。

    Collaboration of device resources
    6.
    发明授权
    Collaboration of device resources 有权
    设备资源的协作

    公开(公告)号:US08683054B1

    公开(公告)日:2014-03-25

    申请号:US13215591

    申请日:2011-08-23

    IPC分类号: G06F15/16

    摘要: Computing devices can collaborate in order to take advantage of various components distributed across those devices. In various embodiments, image information captured by multiple devices can be used to identify and determine the relative locations of various persons and objects near those devices, even when not every device can view those persons or objects. In some embodiments, one or more audio or video capture elements can be selected based on their proximity and orientation to an object to be captured. In some embodiments, the information captured from the various audio and/or video elements can be combined to provide three-dimensional imaging, surround sound, and other such capture data.

    摘要翻译: 计算设备可以协作,以便利用分布在这些设备上的各种组件的优势。 在各种实施例中,即使不是每个设备都可以查看那些人或物体,所以可以使用由多个设备捕获的图像信息来识别和确定这些设备附近的各种人物和物体的相对位置。 在一些实施例中,一个或多个音频或视频捕捉元件可以基于它们与待捕获的对象的接近度和取向来选择。 在一些实施例中,从各种音频和/或视频元素捕获的信息可以被组合以提供三维成像,环绕声和其他这样的捕获数据。

    Inter-device location determinations
    7.
    发明授权
    Inter-device location determinations 有权
    设备间位置确定

    公开(公告)号:US08634848B1

    公开(公告)日:2014-01-21

    申请号:US12893930

    申请日:2010-09-29

    IPC分类号: H04W24/00

    摘要: Electronic devices can identify other nearby devices, and determine the relative positions of those devices, using a combination of techniques. Various devices are able to project one or more instances of a unique identifier, such as a barcode, which can be imaged by other devices. The devices also can communicate position and/or orientation information over a wireless sideband channel. By combining the information relating to the projected identifier with information collected over the sideband channel, devices can automatically determine the location of various devices and associate a user or device identity with those devices. A user of a device then can view relative locations of those devices on a display element, including information about the user of the device. Further, the relative position determinations can enable a user to perform certain functions with respect to another device based at least in part upon the position and/or identity of that device.

    摘要翻译: 电子设备可以识别其他附近的设备,并且使用技术的组合来确定这些设备的相对位置。 各种设备能够投射可由其他设备成像的唯一标识符(例如条形码)的一个或多个实例。 这些设备还可以通过无线边带信道来传送位置和/或取向信息。 通过将与投影标识相关的信息与通过边带通道收集的信息组合,设备可以自动确定各种设备的位置,并将用户或设备身份与这些设备相关联。 然后,设备的用户可以在显示元件上查看这些设备的相对位置,包括关于设备的用户的信息。 此外,相对位置确定可以使得用户能够至少部分地基于该设备的位置和/或身份来执行关于另一设备的某些功能。

    MULTI-DISPLAY TYPE DEVICE INTERACTIONS
    8.
    发明申请
    MULTI-DISPLAY TYPE DEVICE INTERACTIONS 有权
    多显示类型设备交互

    公开(公告)号:US20120218191A1

    公开(公告)日:2012-08-30

    申请号:US13035325

    申请日:2011-02-25

    IPC分类号: G06F3/041

    摘要: An electronic device including two or more display elements can provide enhanced functionality with improved rates of power consumption. A user can cause information that does not change rapidly to be provided or moved to a relatively static display element, such as an electronic ink display, which enables that information to be displayed for a period of time with little additional power consumption. Similarly, content (e.g., video) that changes rapidly can be displayed on a relatively dynamic display element, such as and LCD or OLED display. Each display can be touch sensitive, such that a user can move content between the displays by pressing on, or making a motion in contact with, at least one of the displays. Various modes can be activated which cause certain types of content to be displayed on the dynamic and/or static display element.

    摘要翻译: 包括两个或更多个显示元件的电子设备可以提供增强的功率消耗功率。 用户可以引起不快速改变以提供或移动到相对静止的显示元件(例如电子墨水显示器)的信息,这使得该信息能够在少量额外的功率消耗下显示一段时间。 类似地,快速变化的内容(例如,视频)可以显示在诸如LCD或OLED显示器的相对动态的显示元件上。 每个显示器可以是触摸敏感的,使得用户可以通过按压或使与至少一个显示器接触的动作来在显示器之间移动内容。 可以激活可以在动态和/或静态显示元件上显示某些类型的内容的各种模式。

    Dynamic content discoverability
    9.
    发明授权

    公开(公告)号:US09754016B1

    公开(公告)日:2017-09-05

    申请号:US12980770

    申请日:2010-12-29

    IPC分类号: G06F17/30

    摘要: A user interacting with an electronic device can receive suggestions for applications or services that can help the user with a specific task. The user might enter information that can be processed semantically to determine one or more actions, objects, or other types of information useful in discovering related applications or services, which might be local or remote to that device. One or more of these discovered applications or services is selected to suggest to the user based on any number of criteria, such as user behavior, preferences, location, time of day, etc. Using such an approach, a device can utilize a semantic process to attempt to infer an action or intent relating to a simple or complex task, and can discover and suggest applications or services that can assist with that task even where the user is unaware of, or not looking for, such an application or service.

    Finger detection for element selection
    10.
    发明授权
    Finger detection for element selection 有权
    手指检测元素选择

    公开(公告)号:US09400575B1

    公开(公告)日:2016-07-26

    申请号:US13528532

    申请日:2012-06-20

    IPC分类号: G06F3/033 G09G5/08 G06F3/042

    摘要: A user can use a finger, or other such object, to provide input to a computing device. The finger does not have to contact the device, but can be positioned and/or oriented in such a way that the device can determine an input that the user is attempting to provide, such as an element or icon that the user is intended to select. One or more cameras can capture image information, which can be analyzed to attempt to determine the location and/or orientation of the finger. If the finger is at least partially outside a field of view of the camera(s), the device can use a sensor (e.g., EMF) to attempt to determine a location of at least a portion of the finger, which can be used with the image information to determine the location and/or orientation of the finger. Other estimation processes can be used as well.

    摘要翻译: 用户可以使用手指或其他这样的对象向计算设备提供输入。 手指不必联系设备,而是可以以这样的方式定位和/或定向,使得设备可以确定用户正试图提供的输入,诸如用户想要选择的元件或图标 。 一个或多个相机可以捕获图像信息,其可以被分析以尝试确定手指的位置和/或取向。 如果手指至少部分地位于摄像机的视场之外,则设备可以使用传感器(例如,EMF)来尝试确定手指的至少一部分的位置,其可以与 该图像信息用于确定手指的位置和/或取向。 也可以使用其他估计过程。