-
公开(公告)号:US20250060481A1
公开(公告)日:2025-02-20
申请号:US18452279
申请日:2023-08-18
Applicant: QUALCOMM Incorporated
Inventor: Meysam Sadeghigooghari , Varun Ravi Kumar , Senthil Kumar Yogamani
Abstract: An apparatus includes a memory and processing circuitry in communication with the memory. The processing circuitry is configured to apply, based on a positional encoding model, a first feature conditioning module to a set of bird's eye view (BEV) position data features corresponding to position data to generate a set of conditioned BEV position data features, and apply, based on the position encoding model, a second feature conditioning module to a set of perspective image data features corresponding to image data to generate a set of conditioned perspective image data features. The processing circuitry is also configured to generate, based on the positional encoding model, the set of conditioned BEV position data features, and the set of conditioned perspective image data features, a weighted summation. Additionally, the processing circuitry is configured to generate, based on the weighted summation, a set of BEV image data features.
-
公开(公告)号:US20250058789A1
公开(公告)日:2025-02-20
申请号:US18452292
申请日:2023-08-18
Applicant: QUALCOMM Incorporated
Abstract: A system for processing image data and position data, the system comprising: a memory for storing the image data and the position data; and processing circuitry in communication with the memory. The processing circuitry is configured to: apply a first encoder to extract, from the image data, a first set of features; apply a first decoder to determine, based on the first set of features, a first uncertainty score. Additionally, the processing circuitry is configured to apply a second encoder to extract, from the position data, a second set of features; apply a second decoder to determine, based on the second set of features, a second uncertainty score; and fuse the first set of features and the second set of features based on the first uncertainty score and the second uncertainty score.
-
公开(公告)号:US20240395007A1
公开(公告)日:2024-11-28
申请号:US18321520
申请日:2023-05-22
Applicant: QUALCOMM Incorporated
Inventor: Varun Ravi Kumar , Debasmit Das , Senthil Kumar Yogamani
Abstract: This disclosure provides systems, methods, and devices for vehicle driving assistance systems that support image processing. In a first aspect, a method of image processing includes receiving a plurality of image frames by a computing device and using machine learning models to identify corrupted or occluded image frames. A first machine learning model may identify corrupted image frames, while a second machine learning model may identify partially occluded image frames. The method may further include generating updated versions of image frames captured by vehicle cameras, such as based on feature vectors from the first and second machine learning models. The feature vectors may be fused and provided to a third machine learning model to generate updated versions of occluded image frames. The method may further include determining vehicle control instructions based on the updated versions. Other aspects and features are also claimed and described.
-
公开(公告)号:US20240371168A1
公开(公告)日:2024-11-07
申请号:US18311784
申请日:2023-05-03
Applicant: QUALCOMM Incorporated
Inventor: Deeksha Dixit , Varun Ravi Kumar , Senthil Kumar Yogamani
Abstract: This disclosure provides systems, methods, and devices for vehicle driving assistance systems that support image processing. In a first aspect, a method is provided that includes generating a top view image of an object using a plurality of images captured from different views. The method involves determining portions of the images that depict the object and generating novel views of the object from at least one novel view not present within the plurality of images. Corresponding portions containing an occluded view and an unobstructed view of the object are identified and corrected views for occluded views are determined based on corresponding unobstructed views using a machine learning model. A top view image may be then generated based on the corrected views. The invention enables improved visibility for autonomous driving systems in situations where objects are occluded or partially obstructed. Other aspects and features are also claimed and described.
-
公开(公告)号:US20240371147A1
公开(公告)日:2024-11-07
申请号:US18313287
申请日:2023-05-05
Applicant: QUALCOMM Incorporated
Inventor: Varun Ravi Kumar , Senthil Kumar Yogamani
Abstract: This disclosure provides systems, methods, and devices for vehicle driving assistance systems that support image processing. In a first aspect, a method of fusing features from near-field images and far-field images is provided that includes determining feature vectors and spatial locations for received images from near-field and far-field image sensors. A first set of weighted feature vectors may be determined based on spatial locations of the features and a second set of weighted feature vectors may be determined based on corresponding features between the feature vectors. Fused feature vectors may then be determined based on the weighted feature vectors, such as using a transformer attention process trained to select and combine features from both sets of weighted feature vectors. Vehicle control instructions may be determined based on the fused feature vectors. Other aspects and features are also claimed and described.
-
公开(公告)号:US20240153249A1
公开(公告)日:2024-05-09
申请号:US18467455
申请日:2023-09-14
Applicant: QUALCOMM Incorporated
Inventor: Shubhankar Mangesh Borse , Marvin Richard Klingner , Varun Ravi Kumar , Senthil Kumar Yogamani , Fatih Murat Porikli
IPC: G06V10/774 , G06V10/26 , G06V10/40 , G06V10/80 , G06V20/56
CPC classification number: G06V10/774 , G06V10/26 , G06V10/40 , G06V10/803 , G06V20/56
Abstract: This disclosure provides systems, methods, and devices for image signal processing that support training object recognition models. In a first aspect, a method of image processing includes training a first modality imaging system; receiving time-synchronized first input data samples and second input data samples from the first modality imaging system and a second modality imaging system, respectively; processing the first input data samples in the first modality imaging system to generate first output; processing the second input data samples in the second modality imaging system to generate second output; and training the second modality imaging system based on the first output and the second output. Other aspects and features are also claimed and described.
-
公开(公告)号:US20250157204A1
公开(公告)日:2025-05-15
申请号:US18509026
申请日:2023-11-14
Applicant: QUALCOMM Incorporated
Inventor: Venkatraman Narayanan , Varun Ravi Kumar , Senthil Kumar Yogamani
Abstract: This disclosure provides systems, methods, and devices for vehicle driving assistance systems that support image processing. In a first aspect, a method of image processing includes receiving image data from an image sensor; receiving ranging data from a ranging sensor; embedding first spatial features of the image data with first temporal information associated with the image data; embedding second spatial features of the ranging data with second temporal information associated with the ranging data; determining first bird's-eye-view (BEV) features based on the first spatial features embedded with first temporal information; determining second BEV features based on the second spatial features embedded with second temporal information; and determining, based on the first and second BEV features, a feature set for processing by a transformer network. The feature set includes at least a portion of both the first and second BEV features. Other aspects and features are also claimed and described.
-
公开(公告)号:US20250157178A1
公开(公告)日:2025-05-15
申请号:US18506507
申请日:2023-11-10
Applicant: QUALCOMM Incorporated
Inventor: Varun Ravi Kumar , Senthil Kumar Yogamani , Sweta Priyadarshi
Abstract: This disclosure provides systems, methods, and devices for vehicle driving assistance systems that support image processing. In a first aspect, an image processing method includes receiving image frames; determining an ordered set of neural rays based on the image frames; determining a graph network that represents each neural ray of the ordered set of neural rays as a sequence of points; and determining a feature set based on the graph network. Each neural ray of the ordered set of neural rays represents three-dimensional positions of pixels of an image frame. Each point on the graph network is associated with a node of a plurality of nodes of the graph network. The feature set includes features of each of the image frames. Other aspects and features are also claimed and described.
-
公开(公告)号:US20250139882A1
公开(公告)日:2025-05-01
申请号:US18498995
申请日:2023-10-31
Applicant: QUALCOMM Incorporated
Inventor: Behnaz Rezaei , Varun Ravi Kumar , Senthil Kumar Yogamani
Abstract: In some aspects of the disclosure, an apparatus includes a processing system that includes one or more processors and one or more memories coupled to the one or more processors. The processing system is configured to receive sensor data associated with a scene and to generate a cylindrical representation associated with the scene. The processing system is further configured to modify the cylindrical representation based on detecting a feature of the cylindrical representation being included in a first region of the cylindrical representation. Modifying the cylindrical representation includes relocating the feature from the first region to a second region that is different than the first region. The processing system is further configured to perform, based on the modified cylindrical representation, one or more three-dimensional (3D) perception operations associated with the scene.
-
公开(公告)号:US20250095173A1
公开(公告)日:2025-03-20
申请号:US18467035
申请日:2023-09-14
Applicant: QUALCOMM Incorporated
Inventor: Savitha Srinivasan , Varun Ravi Kumar , Senthil Kumar Yogamani
Abstract: An example device for training a neural network includes a memory configured to store a neural network model for the neural network; and a processing system comprising one or more processors implemented in circuitry, the processing system being configured to: extract image features from an image of an area, the image features representing objects in the area; extract point cloud features from a point cloud representation of the area, the point cloud features representing the objects in the area; add Gaussian noise to a ground truth depth map for the area to generate a noisy ground truth depth map, the ground truth depth map representing accurate positions of the objects in the area; and train the neural network using the image features, the point cloud features, and the noisy ground truth depth map to generate a depth map.
-
-
-
-
-
-
-
-
-