-
公开(公告)号:US11010961B2
公开(公告)日:2021-05-18
申请号:US16352402
申请日:2019-03-13
摘要: A computer system is provided that includes a camera device and a processor configured to receive scene data captured by the camera device for a three-dimensional environment that includes one or more physical objects, generate a geometric representation of the scene data, process the scene data using an artificial intelligence machine learning model that outputs object boundary data and object labels, augment the geometric representation with the object boundary data and the object labels, and identify the one or more physical objects based on the augmented geometric representation of the three-dimensional environment. For each identified physical object, the processor is configured to generate an associated virtual object that is fit to one or more geometric characteristics of that identified physical object. The processor is further configured to track each identified physical object and associated virtual object across successive updates to the scene data.
-
公开(公告)号:US09280972B2
公开(公告)日:2016-03-08
申请号:US13892094
申请日:2013-05-10
发明人: Daniel McCulloch , Abby Lin Lee , Adam Benjamin Smith-Kipnis , Jonathan William Plumb , Alexandre David , Michael O Hale , Jeff Cole , Hendrik Mark Langerak
摘要: Embodiments that relate to converting audio inputs from an environment into text are disclosed. For example, in one disclosed embodiment a speech conversion program receives audio inputs from a microphone array of a head-mounted display device. Image data is captured from the environment, and one or more possible faces are detected from image data. Eye-tracking data is used to determine a target face on which a user is focused. A beamforming technique is applied to at least a portion of the audio inputs to identify target audio inputs that are associated with the target face. The target audio inputs are converted into text that is displayed via a transparent display of the head-mounted display device.
摘要翻译: 公开了将音频输入从环境转换为文本的实施例。 例如,在一个公开的实施例中,语音转换程序从头戴式显示设备的麦克风阵列接收音频输入。 从环境中捕获图像数据,并且从图像数据检测到一个或多个可能的面。 眼睛跟踪数据用于确定用户聚焦的目标脸部。 波束形成技术被应用于至少一部分音频输入以识别与目标面相关联的目标音频输入。 目标音频输入被转换成通过头戴式显示设备的透明显示器显示的文本。
-
公开(公告)号:US10740960B2
公开(公告)日:2020-08-11
申请号:US16294748
申请日:2019-03-06
摘要: An augmented reality device includes a logic machine and a storage machine holding instructions executable by the logic machine to, for one or more real-world surfaces represented in a three-dimensional representation of a real-world environment of the augmented reality device, fit a virtual two-dimensional plane to the real-world surface. A request to place a virtual three-dimensional object on the real-world surface is received. For each of a plurality of candidate placement locations on the virtual two-dimensional plane, the candidate placement location is evaluated as a valid placement location or an invalid placement location for the virtual three-dimensional object. An invalidation mask is generated that defines the valid and invalid placement locations on the virtual two-dimensional plane.
-
公开(公告)号:US11010965B2
公开(公告)日:2021-05-18
申请号:US16925728
申请日:2020-07-10
摘要: An augmented reality device includes a logic machine and a storage machine holding instructions executable by the logic machine to, for one or more real-world surfaces represented in a three-dimensional representation of a real-world environment of the augmented reality device, fit a virtual two-dimensional plane to the real-world surface. A request to place a virtual three-dimensional object on the real-world surface is received. For each of a plurality of candidate placement locations on the virtual two-dimensional plane, the candidate placement location is evaluated as a valid placement location or an invalid placement location for the virtual three-dimensional object. An invalidation mask is generated that defines the valid and invalid placement locations on the virtual two-dimensional plane.
-
-
-