-
公开(公告)号:US20240331372A1
公开(公告)日:2024-10-03
申请号:US18738636
申请日:2024-06-10
Applicant: QUALCOMM INCORPORATED
Inventor: Upal MAHBUB , Gokce DANE
IPC: G06V10/82 , G06V10/25 , G06V10/72 , G06V10/764 , G06V40/10
CPC classification number: G06V10/82 , G06V10/25 , G06V10/72 , G06V10/764 , G06V40/107
Abstract: Methods, systems, and apparatuses are provided to automatically detect objects within images. For example, an image capture device may capture an image, and may apply a trained neural network to the image to generate an object value and a class value for each of a plurality of portions of the image. Further, the image capture device may determine, for each of the plurality of image portions, a confidence value based on the object value and the class value corresponding to each image portion. The image capture device may also detect an object within at least one image portion based on the confidence values. Further, the image capture device may output a bounding box corresponding to the at least one image portion. The bounding box defines an area of the image that includes one or more objects.
-
公开(公告)号:US20210166070A1
公开(公告)日:2021-06-03
申请号:US16700219
申请日:2019-12-02
Applicant: QUALCOMM Incorporated
Inventor: Upal MAHBUB , RAKESH NATTOJI RAJARAM , Vasudev BHASKARAN
Abstract: Embodiments include systems and methods for keypoint detection in an image. In embodiments, a processor of a computing device may apply to an image a first neural network that has been trained to define and output a plurality of regions. The processor may apply to each of the plurality of regions a respective second neural network to that has been trained to output a plurality of keypoints in each of the plurality of regions. The processor may apply to the plurality of keypoints a third neural network that has been trained to determine a correction for each of the plurality of keypoints to provide corrected keypoints suitable for the execution of an image processing function.
-
公开(公告)号:US20240242443A1
公开(公告)日:2024-07-18
申请号:US18153498
申请日:2023-01-12
Applicant: QUALCOMM Incorporated
Inventor: Adithya Reddy NALLABOLU , Upal MAHBUB , Samuel SUNARJO , Gokce DANE
CPC classification number: G06T19/006 , G06F3/012 , G06F3/013 , G06F21/35 , G06T2219/024
Abstract: Systems and techniques are described herein for enabling a multi-user extended reality (XR) experience. In one illustrative example, a user device can receive, associated with a user from a host device, a message comprising a prompt to join an XR room hosted by the host device. The user device can connect to the host device based on the message. The user device can can obtain images of a three-dimensional (3D) scene of a physical environment and can transmit the images to the host device. The user device can receive synthetic content from the host device. A virtual representation of the user can be localized with respect to the XR room based on features in the images matched with features of the 3D scene of the physical environment. The user device can render the synthetic content for the XR room based on a pose of the apparatus.
-
公开(公告)号:US20230062187A1
公开(公告)日:2023-03-02
申请号:US17412113
申请日:2021-08-25
Applicant: QUALCOMM Incorporated
Inventor: Wesley James HOLLAND , Upal MAHBUB , Venkata Ravi Kiran DAYANA , Rengaraj THIRUPATHI
Abstract: Systems, methods, and non-transitory media are provided for predictive camera initialization. An example method can include obtain, from a first image capture device, image data depicting a scene; classify the scene based on the image data; based on the classification of the scene, predict a camera use event; and based on the predicted camera use event, adjust a power mode of at least one of the first image capture device and a second image capture device.
-
公开(公告)号:US20250106531A1
公开(公告)日:2025-03-27
申请号:US18974248
申请日:2024-12-09
Applicant: QUALCOMM Incorporated
Inventor: Wesley James HOLLAND , Micha GALOR GLUSKIN , Venkata Ravi Kiran DAYANA , Upal MAHBUB , Scott BARKER
IPC: H04N23/951 , G06T3/4053 , H04N23/68
Abstract: Systems and techniques are provided for processing one or more frames. For example, a process can include obtaining a first plurality of frames associated with a first settings domain from an image capture system, wherein the first plurality of frames is captured prior to obtaining a capture input. The process can include obtaining a reference frame associated with a second settings domain from the image capture system, wherein the reference frame is captured proximate to obtaining the capture input. The process can include obtaining a second plurality of frames associated with the second settings domain from the image capture system, wherein the second plurality of frames is captured after the reference frame. The process can include, based on the reference frame, transforming at least a portion of the first plurality of frames to generate a transformed plurality of frames associated with the second settings domain.
-
公开(公告)号:US20240098357A1
公开(公告)日:2024-03-21
申请号:US18523774
申请日:2023-11-29
Applicant: QUALCOMM Incorporated
Inventor: Wesley James HOLLAND , Upal MAHBUB , Venkata Ravi Kiran DAYANA , Rengaraj THIRUPATHI
Abstract: Systems, methods, and non-transitory media are provided for predictive camera initialization. An example method can include obtain, from a first image capture device, image data depicting a scene; classify the scene based on the image data; based on the classification of the scene, predict a camera use event; and based on the predicted camera use event, adjust a power mode of at least one of the first image capture device and a second image capture device.
-
公开(公告)号:US20240007760A1
公开(公告)日:2024-01-04
申请号:US18467563
申请日:2023-09-14
Applicant: QUALCOMM Incorporated
Inventor: Wesley James HOLLAND , Micha GALOR GLUSKIN , Venkata Ravi Kiran DAYANA , Upal MAHBUB , Scott BARKER
IPC: H04N23/951 , H04N23/68 , G06T3/40
CPC classification number: H04N23/951 , H04N23/6815 , H04N23/6812 , G06T3/4053 , H04N23/6811
Abstract: Systems and techniques are provided for processing one or more frames. For example, a process can include obtaining a first plurality of frames associated with a first settings domain from an image capture system, wherein the first plurality of frames is captured prior to obtaining a capture input. The process can include obtaining a reference frame associated with a second settings domain from the image capture system, wherein the reference frame is captured proximate to obtaining the capture input. The process can include obtaining a second plurality of frames associated with the second settings domain from the image capture system, wherein the second plurality of frames is captured after the reference frame. The process can include, based on the reference frame, transforming at least a portion of the first plurality of frames to generate a transformed plurality of frames associated with the second settings domain.
-
公开(公告)号:US20220215564A1
公开(公告)日:2022-07-07
申请号:US17144102
申请日:2021-01-07
Applicant: QUALCOMM Incorporated
Inventor: Samuel SUNARJO , Gokce DANE , Ashar ALI , Upal MAHBUB
Abstract: Systems and techniques are provided for registering three-dimensional (3D) images to deformable models. An example method can include determining, based on an image of a target and associated depth information, a 3D mesh of the target; determining different sets of rotation and translation parameters based on modifications to rotation and translation parameters of the 3D mesh; generating, based on the different sets of rotation and translation parameters, different 3D meshes having different orientations, different poses, and/or different alignments relative to the target; determining different sets of model parameters associated with the different 3D meshes, based on modifications to the different sets of rotation and translation parameters; generating, based on the different sets of model parameters, different additional 3D meshes having different orientations, different poses, and/or different alignments relative to the target; and selecting a final 3D mesh of the target from the different additional 3D meshes.
-
公开(公告)号:US20210334516A1
公开(公告)日:2021-10-28
申请号:US16859836
申请日:2020-04-27
Applicant: QUALCOMM Incorporated
Inventor: Upal MAHBUB , Rakesh NATTOJI RAJARAM , Vasudev BHASKARAN
Abstract: A method is presented. The method includes determining a number of landmarks in an image comprising multiple pixels. The method also includes determining a number of channels for the image based on a function of the number of landmarks. The method further includes determining, for each one of the number of channels, a confidence of each pixel of the multiple pixels corresponding to a landmark. The method still further includes identifying the landmark in the image based on the confidence.
-
公开(公告)号:US20240209843A1
公开(公告)日:2024-06-27
申请号:US18596523
申请日:2024-03-05
Applicant: QUALCOMM Incorporated
Inventor: Adithya Reddy NALLABOLU , Gokce DANE , Pirazh KHORRAMSHAHI , Upal MAHBUB
IPC: F03G3/08 , G01C19/00 , G01C19/02 , H02K7/02 , H02P25/024
CPC classification number: F03G3/083 , G01C19/00 , G01C19/02 , H02K7/02 , H02P25/024
Abstract: Systems and techniques are described for performing scalable voxel block selection. For example, a computing device can determine a fixed block configuration based on a storage size limitation. The computing device can select a plurality of blocks of the scene based on the fixed block configuration. The computing device can convert indices of the plurality of blocks associated with the fixed block configuration to indices of a plurality of blocks associated with a particular block configuration (of a plurality of block configurations) that corresponds to a particular 3DR application. The particular block configuration is different from the fixed block configuration.
-
-
-
-
-
-
-
-
-