Real time simplification of meshes

    公开(公告)号:US12256098B1

    公开(公告)日:2025-03-18

    申请号:US17691691

    申请日:2022-03-10

    Applicant: Apple Inc.

    Abstract: A decoding computing device receives a bit stream for compressed 3D volumetric content. The bit stream includes video encoded image frames comprising packed attribute patch images and depth maps for the 3D volumetric content. Instead of generating a mesh having a vertex for each depth value signaled in the depth map, the decoder performs a real-time mesh simplification process to reduce a resolution of the mesh, such that the mesh resolution is reduced without exceeding an error threshold, which may be dynamically determined. Additionally, the decoder may perform a re-meshing of particular regions of the mesh for the 3D volumetric content to avoid cracks or gaps.

    Computer-generated reality recorder

    公开(公告)号:US11790653B2

    公开(公告)日:2023-10-17

    申请号:US17016127

    申请日:2020-09-09

    Applicant: Apple Inc.

    CPC classification number: G06V20/42 G06T17/20 G06T19/006 H04N9/8715 G06V20/44

    Abstract: Implementations of the subject technology provides analyzing a recording of content. The subject technology generates metadata information based at least in part on the analyzing. The subject technology identifies, based at least in part on at least one of a user preference or a detected event, a region of interest or an object of interest in the recording of content. Based at least in part on the identified region of interest or object of interest, the subject technology generates a modified version of the recording of content. Further, the subject technology stores the modified version of the recording of content for subsequent playback on an electronic device.

    Gaze-driven recording of video
    3.
    发明授权

    公开(公告)号:US10951904B2

    公开(公告)日:2021-03-16

    申请号:US16713778

    申请日:2019-12-13

    Applicant: Apple Inc.

    Abstract: Systems and methods for gaze-driven recording of video are described. Some implementations may include accessing gaze data captured using one or more gaze-tracking sensors; applying a temporal filter to the gaze data to obtain a smoothed gaze estimate; determining a region of interest based on the smoothed gaze estimate, wherein the region of interest identifies a subset of a field of view; accessing a frame of video; recording a portion of the frame associated with the region of interest as an enhanced frame of video, wherein the portion of the frame corresponds to a smaller field of view than the frame; and storing, transmitting, or displaying the enhanced frame of video.

    Distributed encoding
    4.
    发明授权

    公开(公告)号:US11722540B2

    公开(公告)日:2023-08-08

    申请号:US17320191

    申请日:2021-05-13

    Applicant: Apple Inc.

    CPC classification number: H04L65/70 G02B27/017 G06F3/012 H04L65/762 H04L65/80

    Abstract: Techniques are disclosed relating to encoding recorded content for distribution to other computing devices. In some embodiments, a first computing device creates recorded content for transmission to a second computing device configured to present the recorded content. To encode the recorded content, the first computing device detects, via a network interface of the first computing device, one or more computing nodes available to encode the recorded content in one or more formats supported by the second computing device. The first computing device offloads the recorded content via the network interface to the one or more computing nodes for encoding in the one or more formats. In some embodiments, the second computing device receives a request from a user to stream content recorded by a first computing device and requests the content in a first format being encoded by a computing node assisting the first computing device.

    Multimodal inputs for computer-generated reality

    公开(公告)号:US11698674B2

    公开(公告)日:2023-07-11

    申请号:US17016190

    申请日:2020-09-09

    Applicant: Apple Inc.

    Abstract: Implementations of the subject technology provide determining an operating mode of an electronic device based at least in part on whether the electronic device is communicatively coupled to an associated base device. Based on the determined operating mode, the subject technology identifies a set of input modalities for initiating a recording of content within a field of view of the electronic device. The subject technology monitors sensor information generated by at least one sensor included in, or communicatively coupled to, the electronic device. Further, the subject technology initiates the recording of content within the field of view of the electronic device when the monitored sensor information indicates that at least one of the identified set of input modalities has been triggered.

    Gaze-Driven Recording of Video
    6.
    发明申请

    公开(公告)号:US20220295084A1

    公开(公告)日:2022-09-15

    申请号:US17825167

    申请日:2022-05-26

    Applicant: Apple Inc.

    Abstract: Systems and methods for gaze-driven recording of video are described. Some implementations may include accessing gaze data captured using one or more gaze-tracking sensors; applying a temporal filter to the gaze data to obtain a smoothed gaze estimate; determining a region of interest based on the smoothed gaze estimate, wherein the region of interest identifies a subset of a field of view; accessing a frame of video; recording a portion of the frame associated with the region of interest as an enhanced frame of video, wherein the portion of the frame corresponds to a smaller field of view than the frame; and storing, transmitting, or displaying the enhanced frame of video.

    Efficient delivery of multi-camera interactive content

    公开(公告)号:US11856042B2

    公开(公告)日:2023-12-26

    申请号:US18068254

    申请日:2022-12-19

    Applicant: Apple Inc.

    Abstract: Techniques are disclosed relating to encoding recorded content for distribution to other computing devices. In various embodiments, a first computing device records content of a physical environment in which the first computing device is located, the content being deliverable to a second computing device configured to present a corresponding environment based on the recorded content and content recorded by one or more additional computing devices. The first computing device determines a pose of the first computing device within the physical environment and encodes the pose in a manifest usable to stream the content recorded by the first computing device to the second computing device. The encoded pose is usable by the second computing device to determine whether to stream the content recorded by the first computing device.

    Efficient Delivery of Multi-Camera Interactive Content

    公开(公告)号:US20230216908A1

    公开(公告)日:2023-07-06

    申请号:US18068254

    申请日:2022-12-19

    Applicant: Apple Inc.

    Abstract: Techniques are disclosed relating to encoding recorded content for distribution to other computing devices. In various embodiments, a first computing device records content of a physical environment in which the first computing device is located, the content being deliverable to a second computing device configured to present a corresponding environment based on the recorded content and content recorded by one or more additional computing devices. The first computing device determines a pose of the first computing device within the physical environment and encodes the pose in a manifest usable to stream the content recorded by the first computing device to the second computing device. The encoded pose is usable by the second computing device to determine whether to stream the content recorded by the first computing device.

    Gaze-Driven Recording of Video
    9.
    发明申请

    公开(公告)号:US20210168387A1

    公开(公告)日:2021-06-03

    申请号:US17176677

    申请日:2021-02-16

    Applicant: Apple Inc.

    Abstract: Systems and methods for gaze-driven recording of video are described. Some implementations may include accessing gaze data captured using one or more gaze-tracking sensors; applying a temporal filter to the gaze data to obtain a smoothed gaze estimate; determining a region of interest based on the smoothed gaze estimate, wherein the region of interest identifies a subset of a field of view; accessing a frame of video; recording a portion of the frame associated with the region of interest as an enhanced frame of video, wherein the portion of the frame corresponds to a smaller field of view than the frame; and storing, transmitting, or displaying the enhanced frame of video.

    Multimodal inputs for computer-generated reality

    公开(公告)号:US12242664B2

    公开(公告)日:2025-03-04

    申请号:US18204892

    申请日:2023-06-01

    Applicant: Apple Inc.

    Abstract: Implementations of the subject technology provide determining an operating mode of an electronic device based at least in part on whether the electronic device is communicatively coupled to an associated base device. Based on the determined operating mode, the subject technology identifies a set of input modalities for initiating a recording of content within a field of view of the electronic device. The subject technology monitors sensor information generated by at least one sensor included in, or communicatively coupled to, the electronic device. Further, the subject technology initiates the recording of content within the field of view of the electronic device when the monitored sensor information indicates that at least one of the identified set of input modalities has been triggered.

Patent Agency Ranking