-
公开(公告)号:US20240339035A1
公开(公告)日:2024-10-10
申请号:US18745370
申请日:2024-06-17
Applicant: NVIDIA Corporation
Inventor: Davide Marco Onofrio , Hae-Jong Seo , David Nister , Minwoo Park , Neda Cvijetic
CPC classification number: G08G1/167 , G06F18/23 , G06N3/08 , G06V20/588
Abstract: In various examples, a path perception ensemble is used to produce a more accurate and reliable understanding of a driving surface and/or a path there through. For example, an analysis of a plurality of path perception inputs provides testability and reliability for accurate and redundant lane mapping and/or path planning in real-time or near real-time. By incorporating a plurality of separate path perception computations, a means of metricizing path perception correctness, quality, and reliability is provided by analyzing whether and how much the individual path perception signals agree or disagree. By implementing this approach—where individual path perception inputs fail in almost independent ways—a system failure is less statistically likely. In addition, with diversity and redundancy in path perception, comfortable lane keeping on high curvature roads, under severe road conditions, and/or at complex intersections, as well as autonomous negotiation of turns at intersections, may be enabled.
-
22.
公开(公告)号:US11906660B2
公开(公告)日:2024-02-20
申请号:US17005788
申请日:2020-08-28
Applicant: NVIDIA Corporation
Inventor: Tilman Wekel , Sangmin Oh , David Nister , Joachim Pehserl , Neda Cvijetic , Ibrahim Eden
IPC: G01S7/00 , G01S7/48 , G01S17/894 , G01S7/481 , G01S17/931 , G06V10/764 , G06V10/80 , G06V10/82 , G06V20/58 , G01S7/28
CPC classification number: G01S7/4802 , G01S7/481 , G01S17/894 , G01S17/931 , G06V10/764 , G06V10/80 , G06V10/82 , G06V20/58 , G01S7/28
Abstract: In various examples, a deep neural network (DNN) may be used to detect and classify animate objects and/or parts of an environment. The DNN may be trained using camera-to-LiDAR cross injection to generate reliable ground truth data for LiDAR range images. For example, annotations generated in the image domain may be propagated to the LiDAR domain to increase the accuracy of the ground truth data in the LiDAR domain—e.g., without requiring manual annotation in the LiDAR domain. Once trained, the DNN may output instance segmentation masks, class segmentation masks, and/or bounding shape proposals corresponding to two-dimensional (2D) LiDAR range images, and the outputs may be fused together to project the outputs into three-dimensional (3D) LiDAR point clouds. This 2D and/or 3D information output by the DNN may be provided to an autonomous vehicle drive stack to enable safe planning and control of the autonomous vehicle.
-
公开(公告)号:US11698272B2
公开(公告)日:2023-07-11
申请号:US17007873
申请日:2020-08-31
Applicant: NVIDIA Corporation
Inventor: Michael Kroepfl , Amir Akbarzadeh , Ruchi Bhargava , Vaibhav Thukral , Neda Cvijetic , Vadim Cugunovs , David Nister , Birgit Henke , Ibrahim Eden , Youding Zhu , Michael Grabner , Ivana Stojanovic , Yu Sheng , Jeffrey Liu , Enliang Zheng , Jordan Marr , Andrew Carley
CPC classification number: G01C21/3841 , G01C21/1652 , G01C21/3811 , G01C21/3867 , G01C21/3878 , G01C21/3896 , G06N3/02
Abstract: An end-to-end system for data generation, map creation using the generated data, and localization to the created map is disclosed. Mapstreams—or streams of sensor data, perception outputs from deep neural networks (DNNs), and/or relative trajectory data—corresponding to any number of drives by any number of vehicles may be generated and uploaded to the cloud. The mapstreams may be used to generate map data—and ultimately a fused high definition (HD) map—that represents data generated over a plurality of drives. When localizing to the fused HD map, individual localization results may be generated based on comparisons of real-time data from a sensor modality to map data corresponding to the same sensor modality. This process may be repeated for any number of sensor modalities and the results may be fused together to determine a final fused localization result.
-
公开(公告)号:US11604944B2
公开(公告)日:2023-03-14
申请号:US16514230
申请日:2019-07-17
Applicant: NVIDIA Corporation
Inventor: Minwoo Park , Xiaolin Lin , Hae-Jong Seo , David Nister , Neda Cvijetic
IPC: G06K9/62 , G06V10/75 , G06V20/56 , G06V10/44 , G06V10/764 , G06V10/776 , G06V10/82 , G06V10/94 , G05D1/00 , G06N3/04 , G06N3/08
Abstract: In various examples, systems and methods are disclosed that preserve rich spatial information from an input resolution of a machine learning model to regress on lines in an input image. The machine learning model may be trained to predict, in deployment, distances for each pixel of the input image at an input resolution to a line pixel determined to correspond to a line in the input image. The machine learning model may further be trained to predict angles and label classes of the line. An embedding algorithm may be used to train the machine learning model to predict clusters of line pixels that each correspond to a respective line in the input image. In deployment, the predictions of the machine learning model may be used as an aid for understanding the surrounding environment—e.g., for updating a world model—in a variety of autonomous machine applications.
-
公开(公告)号:US11520345B2
公开(公告)日:2022-12-06
申请号:US16781893
申请日:2020-02-04
Applicant: NVIDIA Corporation
Inventor: Davide Marco Onofrio , Hae-Jong Seo , David Nister , Minwoo Park , Neda Cvijetic
Abstract: In various examples, a path perception ensemble is used to produce a more accurate and reliable understanding of a driving surface and/or a path there through. For example, an analysis of a plurality of path perception inputs provides testability and reliability for accurate and redundant lane mapping and/or path planning in real-time or near real-time. By incorporating a plurality of separate path perception computations, a means of metricizing path perception correctness, quality, and reliability is provided by analyzing whether and how much the individual path perception signals agree or disagree. By implementing this approach—where individual path perception inputs fail in almost independent ways—a system failure is less statistically likely. In addition, with diversity and redundancy in path perception, comfortable lane keeping on high curvature roads, under severe road conditions, and/or at complex intersections, as well as autonomous negotiation of turns at intersections, may be enabled.
-
公开(公告)号:US20210201145A1
公开(公告)日:2021-07-01
申请号:US17116138
申请日:2020-12-09
Applicant: NVIDIA Corporation
Inventor: Trung Pham , Berta Rodriguez Hervas , Minwoo Park , David Nister , Neda Cvijetic
Abstract: In various examples, a three-dimensional (3D) intersection structure may be predicted using a deep neural network (DNN) based on processing two-dimensional (2D) input data. To train the DNN to accurately predict 3D intersection structures from 2D inputs, the DNN may be trained using a first loss function that compares 3D outputs of the DNN—after conversion to 2D space—to 2D ground truth data and a second loss function that analyzes the 3D predictions of the DNN in view of one or more geometric constraints—e.g., geometric knowledge of intersections may be used to penalize predictions of the DNN that do not align with known intersection and/or road structure geometries. As such, live perception of an autonomous or semi-autonomous vehicle may be used by the DNN to detect 3D locations of intersection structures from 2D inputs.
-
公开(公告)号:US20200026960A1
公开(公告)日:2020-01-23
申请号:US16514230
申请日:2019-07-17
Applicant: NVIDIA Corporation
Inventor: Minwoo Park , Xiaolin Lin , Hae-Jong Seo , David Nister , Neda Cvijetic
Abstract: In various examples, systems and methods are disclosed that preserve rich spatial information from an input resolution of a machine learning model to regress on lines in an input image. The machine learning model may be trained to predict, in deployment, distances for each pixel of the input image at an input resolution to a line pixel determined to correspond to a line in the input image. The machine learning model may further be trained to predict angles and label classes of the line. An embedding algorithm may be used to train the machine learning model to predict clusters of line pixels that each correspond to a respective line in the input image. In deployment, the predictions of the machine learning model may be used as an aid for understanding the surrounding environment—e.g., for updating a world model—in a variety of autonomous machine applications.
-
公开(公告)号:US12299892B2
公开(公告)日:2025-05-13
申请号:US18391276
申请日:2023-12-20
Applicant: NVIDIA Corporation
Inventor: Trung Pham , Berta Rodriguez Hervas , Minwoo Park , David Nister , Neda Cvijetic
IPC: G06T7/11 , G05B13/02 , G06F18/21 , G06F18/24 , G06N3/04 , G06N3/08 , G06T3/4046 , G06T5/70 , G06T11/20 , G06V10/26 , G06V10/34 , G06V10/44 , G06V10/82 , G06V20/56 , G06V30/19 , G06V30/262
Abstract: In various examples, live perception from sensors of a vehicle may be leveraged to detect and classify intersection contention areas in an environment of a vehicle in real-time or near real-time. For example, a deep neural network (DNN) may be trained to compute outputs—such as signed distance functions—that may correspond to locations of boundaries delineating intersection contention areas. The signed distance functions may be decoded and/or post-processed to determine instance segmentation masks representing locations and classifications of intersection areas or regions. The locations of the intersections areas or regions may be generated in image-space and converted to world-space coordinates to aid an autonomous or semi-autonomous vehicle in navigating intersections according to rules of the road, traffic priority considerations, and/or the like.
-
公开(公告)号:US20250138530A1
公开(公告)日:2025-05-01
申请号:US19005734
申请日:2024-12-30
Applicant: NVIDIA Corporation
Inventor: Minwoo Park , Xiaolin Lin , Hae-Jong Seo , David Nister , Neda Cvijetic
IPC: G05D1/00 , G05D1/228 , G06F18/214 , G06F18/23 , G06F18/2411 , G06N3/04 , G06V10/44 , G06V10/48 , G06V10/75 , G06V10/764 , G06V10/776 , G06V10/82 , G06V10/94 , G06V20/56
Abstract: In various examples, systems and methods are disclosed that preserve rich spatial information from an input resolution of a machine learning model to regress on lines in an input image. The machine learning model may be trained to predict, in deployment, distances for each pixel of the input image at an input resolution to a line pixel determined to correspond to a line in the input image. The machine learning model may further be trained to predict angles and label classes of the line. An embedding algorithm may be used to train the machine learning model to predict clusters of line pixels that each correspond to a respective line in the input image. In deployment, the predictions of the machine learning model may be used as an aid for understanding the surrounding environment—e.g., for updating a world model—in a variety of autonomous machine applications.
-
公开(公告)号:US20240230339A1
公开(公告)日:2024-07-11
申请号:US18615894
申请日:2024-03-25
Applicant: NVIDIA Corporation
Inventor: Trung Pham , Hang Dou , Berta Rodriguez Hervas , Minwoo Park , Neda Cvijetic , David Nister
CPC classification number: G01C21/26 , G06N3/04 , G06N3/08 , G06V10/454 , G06V10/462 , G06V10/764 , G06V10/82 , G06V20/56 , G06F2218/12
Abstract: In various examples, live perception from sensors of a vehicle may be leveraged to generate potential paths for the vehicle to navigate an intersection in real-time or near real-time. For example, a deep neural network (DNN) may be trained to compute various outputs—such as heat maps corresponding to key points associated with the intersection, vector fields corresponding to directionality, heading, and offsets with respect to lanes, intensity maps corresponding to widths of lanes, and/or classifications corresponding to line segments of the intersection. The outputs may be decoded and/or otherwise post-processed to reconstruct an intersection—or key points corresponding thereto—and to determine proposed or potential paths for navigating the vehicle through the intersection.
-
-
-
-
-
-
-
-
-