-
公开(公告)号:US10884433B2
公开(公告)日:2021-01-05
申请号:US16100479
申请日:2018-08-10
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Manmohan Chandraker , Hyo Jin Kim
IPC: G05D1/08 , G06T7/73 , G05D1/00 , G08G5/04 , B64C39/02 , G08G5/00 , G06N3/04 , G06N3/08 , G06N5/04 , G06T7/00 , G06F16/29 , G01C21/32
Abstract: A computer-implemented method, system, and computer program product are provided for a stabilization system utilizing pose estimation in an aerial drone. The method includes receiving, by a pose estimation system, a plurality of images from one or more cameras. The method also includes predicting, by the pose estimation system, a pose from the score map and a combined feature map, the combined feature map correlated from a pair of the plurality of images. The method additionally includes moving, by a propulsion system, the aerial drone responsive to the pose.
-
12.
公开(公告)号:US10832084B2
公开(公告)日:2020-11-10
申请号:US16526306
申请日:2019-07-30
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Mohammed E. Fathy Salem , Muhammad Zeeshan Zia , Paul Vernaza , Manmohan Chandraker
Abstract: A method for estimating dense 3D geometric correspondences between two input point clouds by employing a 3D convolutional neural network (CNN) architecture is presented. The method includes, during a training phase, transforming the two input point clouds into truncated distance function voxel grid representations, feeding the truncated distance function voxel grid representations into individual feature extraction layers with tied weights, extracting low-level features from a first feature extraction layer, extracting high-level features from a second feature extraction layer, normalizing the extracted low-level features and high-level features, and applying deep supervision of multiple contrastive losses and multiple hard negative mining modules at the first and second feature extraction layers. The method further includes, during a testing phase, employing the high-level features capturing high-level semantic information to obtain coarse matching locations, and refining the coarse matching locations with the low-level features to capture low-level geometric information for estimating precise matching locations.
-
公开(公告)号:US10762359B2
公开(公告)日:2020-09-01
申请号:US16029167
申请日:2018-07-06
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Mohammed E. F. Salem , Muhammad Zeeshan Zia , Paul Vernaza , Manmohan Chandraker
Abstract: Systems and methods for detecting traffic scenarios include an image capturing device which captures two or more images of an area of a traffic environment with each image having a different view of vehicles and a road in the traffic environment. A hierarchical feature extractor concurrently extracts features at multiple neural network layers from each of the images, with the features including geometric features and semantic features, and for estimating correspondences between semantic features for each of the images and refining the estimated correspondences with correspondences between the geometric features of each of the images to generate refined correspondence estimates. A traffic localization module uses the refined correspondence estimates to determine locations of vehicles in the environment in three dimensions to automatically determine a traffic scenario according to the locations of vehicles. A notification device generates a notification of the traffic scenario.
-
14.
公开(公告)号:US10679075B2
公开(公告)日:2020-06-09
申请号:US16029126
申请日:2018-07-06
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Mohammed E. F. Salem , Muhammad Zeeshan Zia , Paul Vernaza , Manmohan Chandraker
Abstract: Systems and methods for correspondence estimation and flexible ground modeling include communicating two-dimensional (2D) images of an environment to a correspondence estimation module, including a first image and a second image captured by an image capturing device. First features, including geometric features and semantic features, are hierarchically extract from the first image with a first convolutional neural network (CNN) according to activation map weights, and second features, including geometric features and semantic features, are hierarchically extracted from the second image with a second CNN according to the activation map weights. Correspondences between the first features and the second features are estimated, including hierarchical fusing of geometric correspondences and semantic correspondences. A 3-dimensional (3D) model of a terrain is estimated using the estimated correspondences belonging to the terrain surface. Relative locations of elements and objects in the environment are determined according to the 3D model of the terrain. A user is notified of the relative locations.
-
公开(公告)号:US10331974B2
公开(公告)日:2019-06-25
申请号:US15709897
申请日:2017-09-20
Applicant: NEC Laboratories America, Inc.
Inventor: Muhammad Zeeshan Zia , Quoc-Huy Tran , Xiang Yu , Manmohan Chandraker , Chi Li
IPC: G06K9/00 , G06K9/62 , G06F17/50 , G06N3/02 , G06T11/60 , G06T15/40 , G05D1/02 , G08G1/16 , G06T7/73 , G06N3/08 , G06T15/10 , B60W30/00 , G08G1/0962 , G06T7/55 , G06K9/46
Abstract: An action recognition system and method are provided. The action recognition system includes an image capture device configured to capture an actual image depicting an object. The action recognition system includes a processor configured to render, based on a set of 3D CAD models, synthetic images with corresponding intermediate shape concept labels. The processor is configured to form a multi-layer CNN which jointly models multiple intermediate shape concepts, based on the rendered synthetic images. The processor is configured to perform an intra-class appearance variation-aware and occlusion-aware 3D object parsing on the actual image by applying the CNN thereto to generate an image pair including a 2D and 3D geometric structure of the object. The processor is configured to control a device to perform a response action in response to an identification of an action performed by the object, wherein the identification of the action is based on the image pair.
-
公开(公告)号:US11599974B2
公开(公告)日:2023-03-07
申请号:US17090508
申请日:2020-11-05
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Bingbing Zhuang , Pan Ji , Manmohan Chandraker
Abstract: A method for jointly removing rolling shutter (RS) distortions and blur artifacts in a single input RS and blurred image is presented. The method includes generating a plurality of RS blurred images from a camera, synthesizing RS blurred images from a set of GS sharp images, corresponding GS sharp depth maps, and synthesized RS camera motions by employing a structure-and-motion-aware RS distortion and blur rendering module to generate training data to train a single-view joint RS correction and deblurring convolutional neural network (CNN), and predicting an RS rectified and deblurred image from the single input RS and blurred image by employing the single-view joint RS correction and deblurring CNN.
-
17.
公开(公告)号:US11462112B2
公开(公告)日:2022-10-04
申请号:US16787727
申请日:2020-02-11
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Samuel Schulter , Paul Vernaza , Buyu Liu , Pan Ji , Yi-Hsuan Tsai , Manmohan Chandraker
Abstract: A method is provided in an Advanced Driver-Assistance System (ADAS). The method extracts, from an input video stream including a plurality of images using a multi-task Convolutional Neural Network (CNN), shared features across different perception tasks. The perception tasks include object detection and other perception tasks. The method concurrently solves, using the multi-task CNN, the different perception tasks in a single pass by concurrently processing corresponding ones of the shared features by respective different branches of the multi-task CNN to provide a plurality of different perception task outputs. Each respective different branch corresponds to a respective one of the different perception tasks. The method forms a parametric representation of a driving scene as at least one top-view map responsive to the plurality of different perception task outputs. The method controls an operation of the vehicle for collision avoidance responsive to the at least one top-view map indicating an impending collision.
-
公开(公告)号:US20210065391A1
公开(公告)日:2021-03-04
申请号:US16987705
申请日:2020-08-07
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Pan Ji , Manmohan Chandraker , Lokender Tiwari
Abstract: A method for improving geometry-based monocular structure from motion (SfM) by exploiting depth maps predicted by convolutional neural networks (CNNs) is presented. The method includes capturing a sequence of RGB images from an unlabeled monocular video stream obtained by a monocular camera, feeding the RGB images into a depth estimation/refinement module, outputting depth maps, feeding the depth maps and the RGB images to a pose estimation/refinement module, the depths maps and the RGB images collectively defining pseudo RGB-D images, outputting camera poses and point clouds, and constructing a 3D map of a surrounding environment displayed on a visualization device.
-
19.
公开(公告)号:US20200372614A1
公开(公告)日:2020-11-26
申请号:US16867805
申请日:2020-05-06
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Bingbing Zhuang , Pan Ji , Manmohan Chandraker
Abstract: A method for correcting blur effects is presented. The method includes generating a plurality of images from a camera, synthesizing blurred images from sharp image counterparts to generate training data to train a structure-and-motion-aware convolutional neural network (CNN), and predicting a camera motion and a depth map from a single blurred image by employing the structure-and-motion-aware CNN to remove blurring from the single blurred image.
-
20.
公开(公告)号:US20200286383A1
公开(公告)日:2020-09-10
申请号:US16787727
申请日:2020-02-11
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Samuel Schulter , Paul Vernaza , Buyu Liu , Pan Ji , Yi-Hsuan Tsai , Manmohan Chandraker
Abstract: A method is provided in an Advanced Driver-Assistance System (ADAS). The method extracts, from an input video stream including a plurality of images using a multi-task Convolutional Neural Network (CNN), shared features across different perception tasks. The perception tasks include object detection and other perception tasks. The method concurrently solves, using the multi-task CNN, the different perception tasks in a single pass by concurrently processing corresponding ones of the shared features by respective different branches of the multi-task CNN to provide a plurality of different perception task outputs. Each respective different branch corresponds to a respective one of the different perception tasks. The method forms a parametric representation of a driving scene as at least one top-view map responsive to the plurality of different perception task outputs. The method controls an operation of the vehicle for collision avoidance responsive to the at least one top-view map indicating an impending collision.
-
-
-
-
-
-
-
-
-