APPARATUS AND METHODS FOR OBJECT DETECTION USING MACHINE LEARNING PROCESSES

    公开(公告)号:US20240331372A1

    公开(公告)日:2024-10-03

    申请号:US18738636

    申请日:2024-06-10

    CPC classification number: G06V10/82 G06V10/25 G06V10/72 G06V10/764 G06V40/107

    Abstract: Methods, systems, and apparatuses are provided to automatically detect objects within images. For example, an image capture device may capture an image, and may apply a trained neural network to the image to generate an object value and a class value for each of a plurality of portions of the image. Further, the image capture device may determine, for each of the plurality of image portions, a confidence value based on the object value and the class value corresponding to each image portion. The image capture device may also detect an object within at least one image portion based on the confidence values. Further, the image capture device may output a bounding box corresponding to the at least one image portion. The bounding box defines an area of the image that includes one or more objects.

    Multi-Stage Neural Network Process for Keypoint Detection In An Image

    公开(公告)号:US20210166070A1

    公开(公告)日:2021-06-03

    申请号:US16700219

    申请日:2019-12-02

    Abstract: Embodiments include systems and methods for keypoint detection in an image. In embodiments, a processor of a computing device may apply to an image a first neural network that has been trained to define and output a plurality of regions. The processor may apply to each of the plurality of regions a respective second neural network to that has been trained to output a plurality of keypoints in each of the plurality of regions. The processor may apply to the plurality of keypoints a third neural network that has been trained to determine a correction for each of the plurality of keypoints to provide corrected keypoints suitable for the execution of an image processing function.

    PROXIMITY-BASED PROTOCOL FOR ENABLING MULTI-USER EXTENDED REALITY (XR) EXPERIENCE

    公开(公告)号:US20240242443A1

    公开(公告)日:2024-07-18

    申请号:US18153498

    申请日:2023-01-12

    Abstract: Systems and techniques are described herein for enabling a multi-user extended reality (XR) experience. In one illustrative example, a user device can receive, associated with a user from a host device, a message comprising a prompt to join an XR room hosted by the host device. The user device can connect to the host device based on the message. The user device can can obtain images of a three-dimensional (3D) scene of a physical environment and can transmit the images to the host device. The user device can receive synthetic content from the host device. A virtual representation of the user can be localized with respect to the XR room based on features in the images matched with features of the 3D scene of the physical environment. The user device can render the synthetic content for the XR room based on a pose of the apparatus.

    LOW-POWER FUSION FOR NEGATIVE SHUTTER LAG CAPTURE

    公开(公告)号:US20250106531A1

    公开(公告)日:2025-03-27

    申请号:US18974248

    申请日:2024-12-09

    Abstract: Systems and techniques are provided for processing one or more frames. For example, a process can include obtaining a first plurality of frames associated with a first settings domain from an image capture system, wherein the first plurality of frames is captured prior to obtaining a capture input. The process can include obtaining a reference frame associated with a second settings domain from the image capture system, wherein the reference frame is captured proximate to obtaining the capture input. The process can include obtaining a second plurality of frames associated with the second settings domain from the image capture system, wherein the second plurality of frames is captured after the reference frame. The process can include, based on the reference frame, transforming at least a portion of the first plurality of frames to generate a transformed plurality of frames associated with the second settings domain.

    LOW-POWER FUSION FOR NEGATIVE SHUTTER LAG CAPTURE

    公开(公告)号:US20240007760A1

    公开(公告)日:2024-01-04

    申请号:US18467563

    申请日:2023-09-14

    Abstract: Systems and techniques are provided for processing one or more frames. For example, a process can include obtaining a first plurality of frames associated with a first settings domain from an image capture system, wherein the first plurality of frames is captured prior to obtaining a capture input. The process can include obtaining a reference frame associated with a second settings domain from the image capture system, wherein the reference frame is captured proximate to obtaining the capture input. The process can include obtaining a second plurality of frames associated with the second settings domain from the image capture system, wherein the second plurality of frames is captured after the reference frame. The process can include, based on the reference frame, transforming at least a portion of the first plurality of frames to generate a transformed plurality of frames associated with the second settings domain.

    THREE-DIMENSIONAL SCAN REGISTRATION WITH DEFORMABLE MODELS

    公开(公告)号:US20220215564A1

    公开(公告)日:2022-07-07

    申请号:US17144102

    申请日:2021-01-07

    Abstract: Systems and techniques are provided for registering three-dimensional (3D) images to deformable models. An example method can include determining, based on an image of a target and associated depth information, a 3D mesh of the target; determining different sets of rotation and translation parameters based on modifications to rotation and translation parameters of the 3D mesh; generating, based on the different sets of rotation and translation parameters, different 3D meshes having different orientations, different poses, and/or different alignments relative to the target; determining different sets of model parameters associated with the different 3D meshes, based on modifications to the different sets of rotation and translation parameters; generating, based on the different sets of model parameters, different additional 3D meshes having different orientations, different poses, and/or different alignments relative to the target; and selecting a final 3D mesh of the target from the different additional 3D meshes.

    COMPACT ENCODED HEAT MAPS FOR KEYPOINT DETECTION NETWORKS

    公开(公告)号:US20210334516A1

    公开(公告)日:2021-10-28

    申请号:US16859836

    申请日:2020-04-27

    Abstract: A method is presented. The method includes determining a number of landmarks in an image comprising multiple pixels. The method also includes determining a number of channels for the image based on a function of the number of landmarks. The method further includes determining, for each one of the number of channels, a confidence of each pixel of the multiple pixels corresponding to a landmark. The method still further includes identifying the landmark in the image based on the confidence.

Patent Agency Ranking