Patent search ap:("QUALCOMM Incorporated") AND inv:"Senthil Kumar Yogamani" Page 4

31.

发明申请
STOCHASTIC DYNAMIC FIELD OF VIEW FOR MULTI-CAMERA BIRD’S EYE VIEW PERCEPTION IN AUTONOMOUS DRIVING 有权

公开(公告)号：US20250156997A1

公开(公告)日：2025-05-15

申请号：US18505923

申请日：2023-11-09

Applicant: QUALCOMM Incorporated

Inventor： Varun Ravi Kumar , Kiran Bangalore Ravi , Senthil Kumar Yogamani

IPC: G06T5/50 , G06T3/40 , G06T7/70 , G06V10/40 , G06V20/56

Abstract: An apparatus for processing image data includes a memory for storing the image data, wherein the image data comprises a first set of image data collected by a first camera comprising a first field of view (FOV) and a second set of image data collected by a second camera comprising a second FOV; and processing circuitry in communication with the memory. The processing circuitry is configured to: apply an encoder to extract, from the first set of image data, a first set of perspective view features; apply the encoder to extract, from the second set of image data, a second set of perspective view features; and project the first set of perspective view features and the second set of perspective view features onto a grid to generate a set of bird's eye view (BEV) features.

32.

发明申请
VOXEL-LEVEL FEATURE FUSION WITH GRAPH NEURAL NETWORKS AND DIFFUSION FOR 3D OBJECT DETECTION 有权

公开(公告)号：US20250095354A1

公开(公告)日：2025-03-20

申请号：US18467657

申请日：2023-09-14

Applicant: QUALCOMM Incorporated

Inventor： Varun Ravi Kumar , Debasmit Das , Senthil Kumar Yogamani

IPC: G06V10/86 , G06T3/00 , G06T5/00 , G06T7/194 , G06T7/55 , G06V10/80 , G06V10/82 , G06V20/58

Abstract: An apparatus includes a memory and processing circuitry in communication with the memory. The processing circuitry is configured to process a joint graph representation using a graph neural network (GNN) to form an enhanced graph representation. The joint graph representation includes first features from a voxelized point cloud, and second features from a plurality of camera images. The enhanced graph representation includes enhanced first features and enhanced second features. The processing circuitry is further configured to perform a diffusion processes on the enhanced first features and the enhanced second features of the enhanced graph representation to form a denoised graph representation having denoised first features and denoised second features, and fuse the denoised first features and the denoised second features of the denoised graph representation using a graph attention network (GAT) to form a fused point cloud having fused features.

33.

发明申请
GRAPH NEURAL NETWORK (GNN) IMPLEMENTED MULTI-MODAL SPATIOTEMPORAL FUSION 有权

公开(公告)号：US20250086979A1

公开(公告)日：2025-03-13

申请号：US18463109

申请日：2023-09-07

Applicant: QUALCOMM Incorporated

Inventor： Venkatraman Narayanan , Varun Ravi Kumar , Senthil Kumar Yogamani

IPC: G06V20/58 , G06V10/80 , G06V10/82

Abstract: Systems that support graph neural network (GNN) implemented multi-modal spatiotemporal fusion are provided. Identifying and tracking an object in images captured by an imaging system is facilitated by generating a graph based on multimodal data received from a plurality of sensors. The graph encodes spatial components and spatial data associated with the images and encodes temporal data associated with the images. Pooled features are generated, through application of a first graph attention network (GAT), by pooling spatial features and temporal features. The spatial features are based on the spatial component and on the spatial relationship, and the temporal features are based on the temporal relationship. A three dimensional bounding box associated with the object is decoded by propagating the pooled features through a fully connected layer.

34.

发明申请
RADAR AND CAMERA FUSION FOR VEHICLE APPLICATIONS 有权

公开(公告)号：US20250085413A1

公开(公告)日：2025-03-13

申请号：US18463049

申请日：2023-09-07

Applicant: QUALCOMM Incorporated

Inventor： Senthil Kumar Yogamani , Varun Ravi Kumar

IPC: G01S13/86 , G01S7/35 , G01S7/41 , G01S13/931

Abstract: This disclosure provides systems, methods, and devices for vehicle driving assistance systems that support image processing. In a first aspect, a method of image processing includes receiving image BEV features and receiving first radio detection and ranging (RADAR) BEV features. The first RADAR BEV features that are received are determined based on first RADAR data associated with a first data type. First normalized RADAR BEV features are determined, which includes rescaling the first RADAR BEV features using a first attention mechanism based on the image BEV features and the first RADAR BEV features. Fused data is determined that combines the first normalized RADAR BEV features and the image BEV features. Other aspects and features are also claimed and described.

35.

发明申请
CAMERA SOILING DETECTION USING ATTENTION-GUIDED CAMERA DEPTH AND LIDAR RANGE CONSISTENCY GATING 有权

公开(公告)号：US20250085407A1

公开(公告)日：2025-03-13

申请号：US18464769

申请日：2023-09-11

Applicant: QUALCOMM Incorporated

Inventor： Varun Ravi Kumar , Senthil Kumar Yogamani , Shivansh Rao

IPC: G01S7/497 , G01S17/86 , G01S17/931

Abstract: A method includes receiving a plurality of images, wherein a first image of the one or more images comprises a range image and a second image comprises a camera image and filtering the first image to generate a filtered first image. The method also includes generating a plurality of depth estimates based on the second image and generating an attention map by combining the filtered first image and the plurality of depth estimates. Additionally, the method includes generating a consistency score indicative of a consistency of depth estimates between the first image and the second image based on the attention map, modulating one or more features extracted from the second image based on the consistency score using a gating mechanism to generate modulated one or more features, and generating a classification of one or more soiled regions in the second image based on the modulated one or more features.

36.

发明申请
SEMI-AUTOMATIC PERCEPTION ANNOTATION SYSTEM 有权

公开(公告)号：US20250078437A1

公开(公告)日：2025-03-06

申请号：US18804633

申请日：2024-08-14

Applicant: QUALCOMM Incorporated

Inventor： Julia Kabalar , Mireille Lucette Laure Gregoire , Hazem Ahmed Mohamed Mohamed Rashed , Dorel Mircea Coman , Nirnai Ach , Kiran Bangalore Ravi , Senthil Kumar Yogamani

IPC: G06V10/25 , G06T7/70 , G06V20/58 , G06V20/70

Abstract: A method for selecting one or more Regions of Interest (RoIs) for human annotations includes obtaining sensor data generated by one or more sensors of a vehicle; applying at least one class-agnostic heuristic function to the sensor data to determine a presence and an approximate position of one or more objects in an RoI of the sensor data; selecting one or more RoIs having proposed annotations for the one or more objects for refinement by an annotator; and outputting the one or more selected RoIs.

37.

发明申请
NERF-BASED MULTI-SENSOR DATA FUSION FOR VEHICLE APPLICATIONS 有权

公开(公告)号：US20250029393A1

公开(公告)日：2025-01-23

申请号：US18356504

申请日：2023-07-21

Applicant: QUALCOMM Incorporated

Inventor： Venkatraman Narayanan , Varun Ravi Kumar , Senthil Kumar Yogamani

IPC: G06V20/58

Abstract: This disclosure provides systems, methods, and devices for vehicle driving assistance systems that support image processing. In a first aspect, a method of image processing includes receiving a plurality of image frames representative of a scene; receiving point cloud data representative of the scene; determining, using a NeRF model, a three-dimensional reconstruction of the scene based on the plurality of image frames; and outputting fused data that combines first BEV features of the three-dimensional reconstruction of the scene and second BEV features of the point cloud data. Other aspects and features are also claimed and described.

38.

发明申请
MULTI-MODAL ENCODER CHANNEL FUSION WITH CROSS-MODALITY AWARENESS 有权

公开(公告)号：US20250029355A1

公开(公告)日：2025-01-23

申请号：US18354074

申请日：2023-07-18

Applicant: QUALCOMM Incorporated

Inventor： Balaji Shankar Balachandran , Varun Ravi Kumar , Senthil Kumar Yogamani

IPC: G06V10/44 , G06V10/75 , G06V10/80

Abstract: This disclosure provides systems, methods, and devices for vehicle driving assistance systems that support image processing. In a first aspect, a method includes receiving an image frame representing a scene; receiving point cloud data representing the scene; determining first sets of image frame features; determining second sets of point cloud data features based on a plurality of voxels representing the point cloud data; determining a third set of features of the image frame based on a first set of features of the plurality of first sets of features of the image frame and a second set of features of the plurality of second sets of features of the point cloud data; and outputting fused data that combines the third set of features of the image frame and a fourth set of features of the point cloud data. Other aspects and features are also claimed and described.

39.

发明申请
ONLINE ADAPTIVE MULTI-SENSOR FUSION 有权

公开(公告)号：US20240412494A1

公开(公告)日：2024-12-12

申请号：US18332394

申请日：2023-06-09

Applicant: QUALCOMM Incorporated

Inventor： Balaji Shankar Balachandran , Varun Ravi Kumar , Senthil Kumar Yogamani

IPC: G06V10/80 , G01S13/89 , G01S17/89

Abstract: This disclosure provides systems, methods, and devices that support image processing. In a first aspect, a method for multi-sensor fusion includes receiving first information indicative of a first set of BEV features of image data captured by an image sensor; receiving second information indicative of a second set of BEV features of non-image sensor data captured by a non-image sensor; and determining fused data that combines the image data and the non-image sensor data based on the first information, the second information, and third information indicative of differences between BEV features of training data and the first set of BEV features and the second set of BEV features. The BEV features of the training data include a third set of BEV features associated with the image sensor and a fourth set of BEV features associated with the non-image sensor. Other aspects and features are also claimed and described.

40.

发明申请
ADAPTIVE BEV FEATURE MAPPING FOR VEHICLE APPLICATIONS 有权

公开(公告)号：US20240412486A1

公开(公告)日：2024-12-12

申请号：US18330113

申请日：2023-06-06

Applicant: QUALCOMM Incorporated

Inventor： Varun Ravi Kumar , Senthil Kumar Yogamani , Bala Murali Manoghar Sai Sudhakar

IPC: G06V10/77 , G06V10/14 , G06V20/58

Abstract: This disclosure provides systems, methods, and devices for vehicle driving assistance systems that support image processing. In a first aspect, a method of image processing includes receiving an image frame from an image sensor of a camera; receiving an indicator associated with a type of lens of the camera; determining a first tensor grid associated with the indicator, the first tensor grid including a plurality of image framework positions associated with the type of lens; and determining, using a machine learning model, a BEV feature map corresponding to the image frame based on features of the image frame and the first tensor grid. Other aspects and features are also claimed and described.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification