Multi media computing or entertainment system for responding to user presence and activity

    公开(公告)号:US10444854B2

    公开(公告)日:2019-10-15

    申请号:US16055994

    申请日:2018-08-06

    Applicant: Apple Inc.

    Abstract: Intelligent systems are disclosed that respond to user intent and desires based upon activity that may or may not be expressly directed at the intelligent system. In some embodiments, the intelligent system acquires a depth image of a scene surrounding the system. A scene geometry may be extracted from the depth image and elements of the scene may be monitored. In certain embodiments, user activity in the scene is monitored and analyzed to infer user desires or intent with respect to the system. The interpretation of the user's intent as well as the system's response may be affected by the scene geometry surrounding the user and/or the system. In some embodiments, techniques and systems are disclosed for interpreting express user communication, e.g., expressed through hand gesture movements. In some embodiments, such gesture movements may be interpreted based on real-time depth information obtained from, e.g., optical or non-optical type depth sensors.

    Multi media computing or entertainment system for responding to user presence and activity

    公开(公告)号:US10048765B2

    公开(公告)日:2018-08-14

    申请号:US14865850

    申请日:2015-09-25

    Applicant: Apple Inc.

    Abstract: Varying embodiments of intelligent systems are disclosed that respond to user intent and desires based upon activity that may or may not be expressly directed at the intelligent system. In some embodiments, the intelligent system acquires a depth image of a scene surrounding the system. A scene geometry may be extracted from the depth image and elements of the scene, such as walls, furniture, and humans may be evaluated and monitored. In certain embodiments, user activity in the scene is monitored and analyzed to infer user desires or intent with respect to the system. The interpretation of the user's intent or desire as well as the system's response may be affected by the scene geometry surrounding the user and/or the system. In some embodiments, techniques and systems are disclosed for interpreting express user communication, for example, expressed through fine hand gesture movements. In some embodiments, such gesture movements may be interpreted based on real-time depth information obtained from, for example, optical or non-optical type depth sensors. The depth information may be interpreted in “slices” (three-dimensional regions of space having a relatively small depth) until one or more candidate hand structures are detected. Once detected, each candidate hand structure may be confirmed or rejected based on its own unique physical properties (e.g., shape, size and continuity to an arm structure). Each confirmed hand structure may be submitted to a depth-aware filtering process before its own unique three-dimensional features are quantified into a high-dimensional feature vector. A two-step classification scheme may be applied to the feature vectors to identify a candidate gesture (step 1), and to reject candidate gestures that do not meet a gesture-specific identification operation (step-2). The identified gesture may be used to initiate some action controlled by a computer system.

    Three-Dimensional Hand Tracking Using Depth Sequences
    23.
    发明申请
    Three-Dimensional Hand Tracking Using Depth Sequences 有权
    使用深度序列的三维手跟踪

    公开(公告)号:US20160048726A1

    公开(公告)日:2016-02-18

    申请号:US14706649

    申请日:2015-05-07

    Applicant: Apple Inc.

    Abstract: In the field of Human-computer interaction (HCI), i.e., the study of the interfaces between people (i.e., users) and computers, understanding the intentions and desires of how the user wishes to interact with the computer is a very important problem. The ability to understand human gestures, and, in particular, hand gestures, as they relate to HCI, is a very important aspect in understanding the intentions and desires of the user in a wide variety of applications. In this disclosure, a novel system and method for three-dimensional hand tracking using depth sequences is described. Some of the major contributions of the hand tracking system described herein include: 1.) a robust hand detector that is invariant to scene background changes; 2.) a bi-directional tracking algorithm that prevents detected hands from always drifting closer to the front of the scene (i.e., forward along the z-axis of the scene); and 3.) various hand verification heuristics.

    Abstract translation: 在人机互动(HCI)领域,即研究人(即用户)和计算机之间的界面,理解用户希望如何与计算机交互的意图和欲望是非常重要的问题。 了解人类手势,特别是手势,因为它们与HCI相关的能力在理解用户在各种应用中的意图和欲望方面是一个非常重要的方面。 在本公开中,描述了使用深度序列的三维手跟踪的新颖系统和方法。 本文描述的手持跟踪系统的一些主要贡献包括:1.)对场景背景变化不变的鲁棒手指检测器; 2.)双向跟踪算法,其防止检测到的手总是漂移到靠近场景的前方(即,沿着场景的z轴向前); 和3.)各种手验证启发式。

    3D Representation of Physical Environment Objects

    公开(公告)号:US20230289993A1

    公开(公告)日:2023-09-14

    申请号:US18119792

    申请日:2023-03-09

    Applicant: Apple Inc.

    Abstract: Various implementations provide 3D representations of objects. Such representations may be based on 3D point cloud and/or 2D image inputs that are obtained based on sensor data, e.g., images, depth data, motion data, etc. 3D point cloud input may be used for part segmentation and/or to determine position and/or orientation of object parts, e.g., generating 3D bounding boxes representing the sizes, positions, and orientations, of object parts. 2D image input may be used for part attribute recognition, e.g., to determine whether a chair legs part has a particular type such as star-shaped, straight down, crossed-shaped, etc. Part attributes may be used to produce a relatively simple and relatively accurate representation of the shape of each part within a respective area, e.g., within a bounding box determined for each part using the 3D point cloud input.

    Multi media computing or entertainment system for responding to user presence and activity

    公开(公告)号:US11561621B2

    公开(公告)日:2023-01-24

    申请号:US16600830

    申请日:2019-10-14

    Applicant: Apple Inc.

    Abstract: Intelligent systems are disclosed that respond to user intent and desires based upon activity that may or may not be expressly directed at the intelligent system. In some embodiments, the intelligent system acquires a depth image of a scene surrounding the system. A scene geometry may be extracted from the depth image and elements of the scene may be monitored. In certain embodiments, user activity in the scene is monitored and analyzed to infer user desires or intent with respect to the system. The interpretation of the user's intent as well as the system's response may be affected by the scene geometry surrounding the user and/or the system. In some embodiments, techniques and systems are disclosed for interpreting express user communication, e.g., expressed through hand gesture movements. In some embodiments, such gesture movements may be interpreted based on real-time depth information obtained from, e.g., optical or non-optical type depth sensors.

    Obstruction detection during facial recognition processes

    公开(公告)号:US11367305B2

    公开(公告)日:2022-06-21

    申请号:US16549009

    申请日:2019-08-23

    Applicant: Apple Inc.

    Abstract: A facial recognition process operating on a device may include one or more processes that determine if a camera and/or components associated with the camera are obstructed by an object (e.g., a user's hand or fingers). Obstruction of the device may be assessed using flood infrared illumination images when a user's face is not able to be detected by a face detection process operating on the device. Obstruction of the device may also be assessed using a pattern detection process that operates after the user's face is detected by the face detection process. When obstruction of the device is detected, the device may provide a notification to the user that the device (e.g., the camera and/or an illuminator) is obstructed and that the obstruction should be removed for the facial recognition process to operate correctly.

Patent Agency Ranking