Dynamic hand gesture recognition using depth data
    1.
    发明授权
    Dynamic hand gesture recognition using depth data 有权
    使用深度数据的动态手势识别

    公开(公告)号:US09536135B2

    公开(公告)日:2017-01-03

    申请号:US13526501

    申请日:2012-06-18

    IPC分类号: G06K9/00

    摘要: The subject disclosure is directed towards a technology by which dynamic hand gestures are recognized by processing depth data, including in real-time. In an offline stage, a classifier is trained from feature values extracted from frames of depth data that are associated with intended hand gestures. In an online stage, a feature extractor extracts feature values from sensed depth data that corresponds to an unknown hand gesture. These feature values are input to the classifier as a feature vector to receive a recognition result of the unknown hand gesture. The technology may be used in real time, and may be robust to variations in lighting, hand orientation, and the user's gesturing speed and style.

    摘要翻译: 主题公开涉及一种通过处理深度数据(包括实时)来识别动态手势的技术。 在离线阶段,从与预期的手势相关联的深度数据的帧中提取的特征值训练分类器。 在在线阶段,特征提取器从对应于未知手势的感测深度数据中提取特征值。 将这些特征值作为特征向量输入到分类器,以接收未知手势的识别结果。 该技术可以实时使用,并且对于照明,手取向和用户的手势速度和风格的变化可能是鲁棒的。

    Dynamic Hand Gesture Recognition Using Depth Data
    2.
    发明申请
    Dynamic Hand Gesture Recognition Using Depth Data 有权
    使用深度数据的动态手势识别

    公开(公告)号:US20130336524A1

    公开(公告)日:2013-12-19

    申请号:US13526501

    申请日:2012-06-18

    IPC分类号: G06K9/46

    摘要: The subject disclosure is directed towards a technology by which dynamic hand gestures are recognized by processing depth data, including in real-time. In an offline stage, a classifier is trained from feature values extracted from frames of depth data that are associated with intended hand gestures. In an online stage, a feature extractor extracts feature values from sensed depth data that corresponds to an unknown hand gesture. These feature values are input to the classifier as a feature vector to receive a recognition result of the unknown hand gesture. The technology may be used in real time, and may be robust to variations in lighting, hand orientation, and the user's gesturing speed and style.

    摘要翻译: 主题公开涉及一种通过处理深度数据(包括实时)来识别动态手势的技术。 在离线阶段,从与预期的手势相关联的深度数据的帧中提取的特征值训练分类器。 在在线阶段,特征提取器从对应于未知手势的感测深度数据中提取特征值。 将这些特征值作为特征向量输入到分类器,以接收未知手势的识别结果。 该技术可以实时使用,并且对于照明,手取向和用户的手势速度和风格的变化可能是鲁棒的。

    Data buddy
    3.
    发明授权
    Data buddy 有权
    资料好友

    公开(公告)号:US09055607B2

    公开(公告)日:2015-06-09

    申请号:US12323570

    申请日:2008-11-26

    摘要: Multi-modal, multi-lingual devices can be employed to consolidate numerous items including, but not limited to, keys, remote controls, image capture devices, audio recorders, cellular telephone functionalities, location/direction detectors, health monitors, calendars, gaming devices, smart home inputs, pens, optical pointing devices or the like. For example, a corner of a cellular telephone can be used as an electronic pen. Moreover, the device can be used to snap multiple pictures stitching them together to create a panoramic image. A device can automate ignition of an automobile, initiate appliances, etc. based upon relative distance. The device can provide for near to eye capabilities for enhanced image viewing. Multiple cameras/sensors can be provided on a single device to provide for stereoscopic capabilities. The device can also provide assistance to blind, privacy, etc. by consolidating services.

    摘要翻译: 可以使用多模式,多语言设备来整合许多项目,包括但不限于键,遥控器,图像捕获设备,音频记录器,蜂窝电话功能,位置/方向检测器,健康监视器,日历,游戏设备 智能家庭输入,笔,光学指向装置等。 例如,蜂窝电话的角落可以用作电子笔。 此外,该设备可以用于将多个图片拼接在一起以创建全景图像。 设备可以基于相对距离自动点火汽车,起动电器等。 该设备可以提供近眼睛的功能,以增强图像观看效果。 可以在单个设备上提供多个摄像机/传感器以提供立体能力。 该设备还可以通过整合服务来提供盲人,隐私等方面的帮助。

    Spatialized audio over headphones
    6.
    发明授权
    Spatialized audio over headphones 有权
    通过耳机进行空间化音频

    公开(公告)号:US08737648B2

    公开(公告)日:2014-05-27

    申请号:US12472080

    申请日:2009-05-26

    IPC分类号: H04R5/02

    CPC分类号: H04R27/00

    摘要: A spatial element is added to communications, including over telephone conference calls heard through headphones or a stereo speaker setup. Functions are created to modify signals from different callers to create the illusion that the callers are speaking from different parts of the room.

    摘要翻译: 一个空间元素添加到通信中,包括通过耳机听到的电话会议通话或立体声扬声器设置。 创建功能来修改来自不同呼叫者的信号,以创建呼叫者从房间的不同部分讲话的错觉。

    Distinguishing live faces from flat surfaces
    7.
    发明授权
    Distinguishing live faces from flat surfaces 有权
    将活的面孔从平坦表面区分开来

    公开(公告)号:US08675926B2

    公开(公告)日:2014-03-18

    申请号:US12796470

    申请日:2010-06-08

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00228 G06K9/00906

    摘要: Multiple images including a face presented by a user are accessed. One or more determinations are made based on the multiple images, such as a determination of whether the face included in the multiple images is a 3-dimensional structure or a flat surface and/or a determination of whether motion is present in one or more face components (e.g., eyes or mouth). If it is determined that the face included in the multiple images is a 3-dimensional structure or that that motion is present in the one or more face components, then an indication is provided that the user can be authenticated. However, if it is determined that the face included in the multiple images is a flat surface or that motion is not present in the one or more face components, then an indication is provided that the user cannot be authenticated.

    摘要翻译: 访问包括用户呈现的脸部的多个图像。 基于多个图像进行一个或多个确定,例如确定包括在多个图像中的面是三维结构还是平面,和/或确定运动是否存在于一个或多个面中 组分(如眼睛或嘴巴)。 如果确定包括在多个图像中的面是三维结构或者该一个或多个面部组件中存在该运动,则提供用户可被认证的指示。 然而,如果确定包括在多个图像中的面是平面或者一个或多个面部组件中不存在运动,则提供用户不能被认证的指示。

    Virtual sound source positioning
    9.
    发明授权
    Virtual sound source positioning 有权
    虚拟声源定位

    公开(公告)号:US08620009B2

    公开(公告)日:2013-12-31

    申请号:US12140283

    申请日:2008-06-17

    IPC分类号: H04R5/02 H04R5/00 H04B1/00

    CPC分类号: H04S7/302 H04S2400/11

    摘要: Systems and methods for determining a virtual sound source position by determining an output for loudspeakers by the position of the loudspeakers in relation to a listener. The output of respective loudspeakers is generated using aural cues to give the listener knowledge of the virtual position of the virtual sound source. Both a gain in intensity and a delay are simulated.

    摘要翻译: 用于通过扬声器相对于收听者的位置确定扬声器的输出来确定虚拟声源位置的系统和方法。 使用听觉提示产生各个扬声器的输出,以使聆听者了解虚拟声源的虚拟位置。 模拟强度和延迟的增益。

    RECOVERING DIS-OCCLUDED AREAS USING TEMPORAL INFORMATION INTEGRATION
    10.
    发明申请
    RECOVERING DIS-OCCLUDED AREAS USING TEMPORAL INFORMATION INTEGRATION 有权
    使用时间信息整合恢复分散区域

    公开(公告)号:US20130294710A1

    公开(公告)日:2013-11-07

    申请号:US13463934

    申请日:2012-05-04

    IPC分类号: G06K9/32

    CPC分类号: G06K9/32 G06T7/593

    摘要: A temporal information integration dis-occlusion system and method for using historical data to reconstruct a virtual view containing an occluded area. Embodiments of the system and method use temporal information of the scene captured previously to obtain a total history. This total history is warped onto information captured by a camera at a current time in order to help reconstruct the dis-occluded areas. The historical data (or frames) from the total history match only a portion of the frames contained in the captured information. This warping yields warped history information. Warping is performed by using one of two embodiments to match points in an estimation of the current information to points in the captured information. Next, regions of current information are split using a classifier. The warped history information and the captured information then are merged to obtain an estimate for the current information and the reconstructed virtual view.

    摘要翻译: 一种用于使用历史数据重建包含遮挡区域的虚拟视图的时间信息整合遮挡系统和方法。 系统和方法的实施例使用先前捕获的场景的时间信息来获得总历史。 这个总历史在当前时间由相机拍摄的信息扭曲,以帮助重建被遮挡的区域。 来自总历史记录的历史数据(或帧)仅匹配捕获信息中包含的帧的一部分。 这种扭曲产生扭曲的历史信息。 通过使用两个实施例中的一个实现扭曲,以将当前信息的估计中的点与捕获的信息中的点进行匹配。 接下来,使用分类器分割当前信息的区域。 然后将翘曲的历史信息和捕获的信息合并,以获得当前信息和重建的虚拟视图的估计。