-
1.
公开(公告)号:US20220414974A1
公开(公告)日:2022-12-29
申请号:US17696490
申请日:2022-03-16
发明人: Sergey Zakharov , Wadim Kehl , Vitor Guizilini , Adrien David Gaidon , Rares A. Ambrus , Dennis Park , Joshua Tenenbaum , Jiajun Wu , Fredo Durand , Vincent Sitzmann
摘要: Systems and methods described herein relate to reconstructing a scene in three dimensions from a two-dimensional image. One embodiment processes an image using a detection transformer to detect an object in the scene and to generate a NOCS map of the object and a background depth map; uses MLPs to relate the object to a differentiable database of object priors (PriorDB); recovers, from the NOCS map, a partial 3D object shape; estimates an initial object pose; fits a PriorDB object prior to align in geometry and appearance with the partial 3D shape to produce a complete shape and refines the initial pose estimate; generates an editable and re-renderable 3D scene reconstruction based, at least in part, on the complete shape, the refined pose estimate, and the depth map; and controls the operation of a robot based, at least in part, on the editable and re-renderable 3D scene reconstruction.
-
公开(公告)号:US11479239B2
公开(公告)日:2022-10-25
申请号:US17097727
申请日:2020-11-13
摘要: Aspects of the disclosure provide a parking method and system for a vehicle. For example, the parking system can include a processor and an output circuit. As another example, the parking system can include the processor and a compelling unit. The processor can be configured to determine whether a first weather condition, such as snowing, raining, icing and hailing, at a first parking location meets a first criterion, and identify, when the first weather condition meets the first criterion, a second parking location at which a second weather condition meets a second criterion different from the first criterion. The output unit can be configured to output an alert to the vehicle parked at the first parking location indicating that the vehicle is to be moved to the second parking location. The compelling unit can be configured to compel the vehicle to move to the second parking location.
-
公开(公告)号:US20210056321A1
公开(公告)日:2021-02-25
申请号:US16745560
申请日:2020-01-17
摘要: An artificial intelligence perception system for detecting one or more objects includes one or more processors, at least one sensor, and a memory device. The memory device includes an image capture module, an object identifying module, and a logical scaffold module. The image capture module and the object identifying module cause the one or more processors to obtain sensor information of a field of view from a sensor, identify an object within the sensor information, and determine at least one property of the object. The logical scaffold module causes the one or more processors to determine, by a logical scaffold, when the at least one property of the object as determined by the object identifying module is one of a true condition or a false condition.
-
4.
公开(公告)号:US11887248B2
公开(公告)日:2024-01-30
申请号:US17696490
申请日:2022-03-16
发明人: Sergey Zakharov , Wadim Kehl , Vitor Guizilini , Adrien David Gaidon , Rares A. Ambrus , Dennis Park , Joshua Tenenbaum , Jiajun Wu , Fredo Durand , Vincent Sitzmann
CPC分类号: G06T15/205 , G06T7/70 , G06T19/20 , G06T2207/10024 , G06T2207/20084 , G06T2219/2016
摘要: Systems and methods described herein relate to reconstructing a scene in three dimensions from a two-dimensional image. One embodiment processes an image using a detection transformer to detect an object in the scene and to generate a NOCS map of the object and a background depth map; uses MLPs to relate the object to a differentiable database of object priors (PriorDB); recovers, from the NOCS map, a partial 3D object shape; estimates an initial object pose; fits a PriorDB object prior to align in geometry and appearance with the partial 3D shape to produce a complete shape and refines the initial pose estimate; generates an editable and re-renderable 3D scene reconstruction based, at least in part, on the complete shape, the refined pose estimate, and the depth map; and controls the operation of a robot based, at least in part, on the editable and re-renderable 3D scene reconstruction.
-
公开(公告)号:US11518408B2
公开(公告)日:2022-12-06
申请号:US17097580
申请日:2020-11-13
IPC分类号: B60Q1/00 , B60W60/00 , G06K9/62 , B60W40/09 , B60W50/14 , B60W30/18 , A61B5/18 , A61B5/00 , G06V20/59 , G06V40/16 , B60W40/08
摘要: A driver monitor system and method for predicting impairment of a user of a vehicle. The system includes video cameras, an input device for inputting a list of medications being taken by the driver. Processing circuitry predicts side effects of the medications based on the half-life of the medication, detecting eye gaze movement, eye lid position, and facial expression of the user using images from the video camera, predicting whether the user is transitioning into an impaired physical state that is a side effect of the medications, verifying the side effect of the medications, determining whether the user is fit to drive using the verified side effects of the medications, and outputting to the vehicle an instruction to operate the vehicle in a level of automation that makes up for the at least one side effect or to perform a safe pull over operation of the vehicle.
-
公开(公告)号:US11501525B2
公开(公告)日:2022-11-15
申请号:US16843026
申请日:2020-04-08
发明人: Rui Hou , Jie Li , Vitor Guizilini , Adrien David Gaidon , Dennis Park , Arjun Bhargava
摘要: Systems and methods for panoptic image segmentation are disclosed herein. One embodiment performs semantic segmentation and object detection on an input image, wherein the object detection generates a plurality of bounding boxes associated with an object in the input image; selects a query bounding box from among the plurality of bounding boxes; maps at least one of the bounding boxes in the plurality of bounding boxes other than the query bounding box to the query bounding box based on similarity between the at least one of the bounding boxes and the query bounding box to generate a mask assignment for the object, the mask assignment defining a contour of the object; compares the mask assignment with results of the semantic segmentation to produce a refined mask assignment for the object; and outputs a panoptic segmentation of the input image that includes the refined mask assignment for the object.
-
公开(公告)号:US20220300851A1
公开(公告)日:2022-09-22
申请号:US17205709
申请日:2021-03-18
发明人: Dennis Park , Adrien David Gaidon
摘要: A system for training a multi-task model includes a processor and a memory in communication with the processor. The memory has a multi-task training module having instructions that, when executed by the processor, causes the processor to provide simulation training data having a plurality of samples to a multi-task model capable of performing at least a first task and a second task using at least one shared. The training module further causes the processor to determine a first value (gradience or loss) for the first task and a second value (gradience or loss) for a second task using the simulation training data and the at least one shared parameter, determine a task induced variance between the first value and the second value, and iteratively adjust the at least one shared parameter to reduce the task induced variance.
-
公开(公告)号:US20220301202A1
公开(公告)日:2022-09-22
申请号:US17333537
申请日:2021-05-28
发明人: Dennis Park , Rares A. Ambrus , Vitor Guizilini , Jie Li , Adrien David Gaidon
摘要: System, methods, and other embodiments described herein relate to performing depth estimation and object detection using a common network architecture. In one embodiment, a method includes generating, using a backbone of a combined network, a feature map at multiple scales from an input image. The method includes decoding, using a top-down pathway of the combined network, the feature map to provide features at the multiple scales. The method includes generating, using a head of the combined network, a depth map from the features for a scene depicted in the input image, and bounding boxes identifying objects in the input image.
-
公开(公告)号:US12086695B2
公开(公告)日:2024-09-10
申请号:US17205709
申请日:2021-03-18
发明人: Dennis Park , Adrien David Gaidon
摘要: A system for training a multi-task model includes a processor and a memory in communication with the processor. The memory has a multi-task training module having instructions that, when executed by the processor, causes the processor to provide simulation training data having a plurality of samples to a multi-task model capable of performing at least a first task and a second task using at least one shared. The training module further causes the processor to determine a first value (gradience or loss) for the first task and a second value (gradience or loss) for a second task using the simulation training data and the at least one shared parameter, determine a task induced variance between the first value and the second value, and iteratively adjust the at least one shared parameter to reduce the task induced variance.
-
公开(公告)号:US12067785B2
公开(公告)日:2024-08-20
申请号:US17358497
申请日:2021-06-25
发明人: Rares A. Ambrus , Dennis Park , Vitor Guizilini , Jie Li , Adrien David Gaidon
IPC分类号: G06K9/62 , G01S17/42 , G01S17/89 , G01S17/931 , G06F18/21 , G06F18/2113 , G06F18/214 , G06F18/25 , G06K9/00 , G06K9/46 , G06N3/04 , G06N3/08 , G06N20/00 , G06T7/10 , G06T7/11 , G06T7/50 , G06V10/46 , G06V10/75 , G06V20/56 , G06V20/58
CPC分类号: G06V20/58 , G01S17/42 , G01S17/89 , G01S17/931 , G06F18/2113 , G06F18/2155 , G06F18/217 , G06F18/251 , G06N3/04 , G06N3/08 , G06N20/00 , G06T7/10 , G06T7/11 , G06T7/50 , G06V10/462 , G06V10/757 , G06V20/56 , G06T2207/10024 , G06T2207/10028 , G06T2207/20016 , G06T2207/20081 , G06T2207/20084 , G06T2207/30248
摘要: Systems, methods, and other embodiments described herein relate to evaluating a perception network in relation to the accuracy of depth estimates and object detections. In one embodiment, a method includes segmenting range data associated with an image according to bounding boxes of objects identified in the image to produce masked data. The method includes comparing the masked data with corresponding depth estimates in the depth map according to an evaluation mask that correlates the depth estimates with the depth map. The method includes providing a metric that quantifies the comparing to assess a network that generated the depth map and the bounding boxes.
-
-
-
-
-
-
-
-
-