-
公开(公告)号:US10884433B2
公开(公告)日:2021-01-05
申请号:US16100479
申请日:2018-08-10
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Manmohan Chandraker , Hyo Jin Kim
IPC: G05D1/08 , G06T7/73 , G05D1/00 , G08G5/04 , B64C39/02 , G08G5/00 , G06N3/04 , G06N3/08 , G06N5/04 , G06T7/00 , G06F16/29 , G01C21/32
Abstract: A computer-implemented method, system, and computer program product are provided for a stabilization system utilizing pose estimation in an aerial drone. The method includes receiving, by a pose estimation system, a plurality of images from one or more cameras. The method also includes predicting, by the pose estimation system, a pose from the score map and a combined feature map, the combined feature map correlated from a pair of the plurality of images. The method additionally includes moving, by a propulsion system, the aerial drone responsive to the pose.
-
公开(公告)号:US10853627B2
公开(公告)日:2020-12-01
申请号:US16145537
申请日:2018-09-28
Applicant: NEC Laboratories America, Inc.
Inventor: Xiang Yu , Xi Yin , Kihyuk Sohn , Manmohan Chandraker
Abstract: A computer-implemented method, system, and computer program product are provided for facial recognition. The method includes receiving, by a processor device, a plurality of images. The method also includes extracting, by the processor device with a feature extractor utilizing a convolutional neural network (CNN) with an enlarged intra-class variance of long-tail classes, feature vectors for each of the plurality of images. The method additionally includes generating, by the processor device with a feature generator, discriminative feature vectors for each of the feature vectors. The method further includes classifying, by the processor device utilizing a fully connected classifier, an identity from the discriminative feature vector. The method also includes control an operation of a processor-based machine to react in accordance with the identity.
-
23.
公开(公告)号:US10832084B2
公开(公告)日:2020-11-10
申请号:US16526306
申请日:2019-07-30
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Mohammed E. Fathy Salem , Muhammad Zeeshan Zia , Paul Vernaza , Manmohan Chandraker
Abstract: A method for estimating dense 3D geometric correspondences between two input point clouds by employing a 3D convolutional neural network (CNN) architecture is presented. The method includes, during a training phase, transforming the two input point clouds into truncated distance function voxel grid representations, feeding the truncated distance function voxel grid representations into individual feature extraction layers with tied weights, extracting low-level features from a first feature extraction layer, extracting high-level features from a second feature extraction layer, normalizing the extracted low-level features and high-level features, and applying deep supervision of multiple contrastive losses and multiple hard negative mining modules at the first and second feature extraction layers. The method further includes, during a testing phase, employing the high-level features capturing high-level semantic information to obtain coarse matching locations, and refining the coarse matching locations with the low-level features to capture low-level geometric information for estimating precise matching locations.
-
公开(公告)号:US10762359B2
公开(公告)日:2020-09-01
申请号:US16029167
申请日:2018-07-06
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Mohammed E. F. Salem , Muhammad Zeeshan Zia , Paul Vernaza , Manmohan Chandraker
Abstract: Systems and methods for detecting traffic scenarios include an image capturing device which captures two or more images of an area of a traffic environment with each image having a different view of vehicles and a road in the traffic environment. A hierarchical feature extractor concurrently extracts features at multiple neural network layers from each of the images, with the features including geometric features and semantic features, and for estimating correspondences between semantic features for each of the images and refining the estimated correspondences with correspondences between the geometric features of each of the images to generate refined correspondence estimates. A traffic localization module uses the refined correspondence estimates to determine locations of vehicles in the environment in three dimensions to automatically determine a traffic scenario according to the locations of vehicles. A notification device generates a notification of the traffic scenario.
-
25.
公开(公告)号:US10679075B2
公开(公告)日:2020-06-09
申请号:US16029126
申请日:2018-07-06
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Mohammed E. F. Salem , Muhammad Zeeshan Zia , Paul Vernaza , Manmohan Chandraker
Abstract: Systems and methods for correspondence estimation and flexible ground modeling include communicating two-dimensional (2D) images of an environment to a correspondence estimation module, including a first image and a second image captured by an image capturing device. First features, including geometric features and semantic features, are hierarchically extract from the first image with a first convolutional neural network (CNN) according to activation map weights, and second features, including geometric features and semantic features, are hierarchically extracted from the second image with a second CNN according to the activation map weights. Correspondences between the first features and the second features are estimated, including hierarchical fusing of geometric correspondences and semantic correspondences. A 3-dimensional (3D) model of a terrain is estimated using the estimated correspondences belonging to the terrain surface. Relative locations of elements and objects in the environment are determined according to the 3D model of the terrain. A user is notified of the relative locations.
-
公开(公告)号:US10678257B2
公开(公告)日:2020-06-09
申请号:US16146202
申请日:2018-09-28
Applicant: NEC Laboratories America, Inc.
Inventor: Samuel Schulter , Paul Vernaza , Manmohan Chandraker , Menghua Zhai
IPC: G05D1/02 , G05D1/00 , G06T17/05 , G06T7/194 , G06T7/514 , G06T7/50 , G06K9/00 , G06T15/20 , G06T7/11 , G06T3/00 , G06T15/40
Abstract: Systems and methods for generating an occlusion-aware bird's eye view map of a road scene include identifying foreground objects and background objects in an input image to extract foreground features and background features corresponding to the foreground objects and the background objects, respectively. The foreground objects are masked from the input image with a mask. Occluded objects and depths of the occluded objects are inferred by predicting semantic features and depths in masked areas of the masked image according to contextual information related to the background features visible in the masked image. The foreground objects and the background objects are mapped to a three-dimensional space according to locations of each of the foreground objects, the background objects and occluded objects using the inferred depths. A bird's eye view is generated from the three-dimensional space and displayed with a display device.
-
公开(公告)号:US20200050900A1
公开(公告)日:2020-02-13
申请号:US16526073
申请日:2019-07-30
Applicant: NEC Laboratories America, Inc.
Inventor: Samuel Schulter , Ziyan Wang , Buyu Liu , Manmohan Chandraker
Abstract: A method for implementing parametric models for scene representation to improve autonomous task performance includes generating an initial map of a scene based on at least one image corresponding to a perspective view of the scene, the initial map including a non-parametric top-view representation of the scene, implementing a parametric model to obtain a scene element representation based on the initial map, the scene element representation providing a description of one or more scene elements of the scene and corresponding to an estimated semantic layout of the scene, identifying one or more predicted locations of the one or more scene elements by performing three-dimensional localization based on the at least one image, and obtaining an overlay for performing an autonomous task by placing the one or more scene elements with the one or more respective predicted locations onto the scene element representation.
-
公开(公告)号:US10474882B2
公开(公告)日:2019-11-12
申请号:US15888747
申请日:2018-02-05
Applicant: NEC Laboratories America, Inc.
Inventor: Xiang Yu , Kihyuk Sohn , Manmohan Chandraker
Abstract: A video surveillance system is provided. The system includes a device configured to capture an input image of a subject located in an area. The system further includes a processor. The processor estimates, using a three-dimensional Morphable Model (3DMM) conditioned Generative Adversarial Network, 3DMM coefficients for the subject of the input image. The subject varies from an ideal front pose. The processor produces, using an image generator, a synthetic frontal face image of the subject of the input image based on the input image and coefficients. An area spanning the frontal face of the subject is made larger in the synthetic than in the input image. The processor provides, using a discriminator, a decision of whether the subject of the synthetic image is an actual person. The processor provides, using a face recognition engine, an identity of the subject in the input image based on the synthetic and input images.
-
29.
公开(公告)号:US10402690B2
公开(公告)日:2019-09-03
申请号:US15801688
申请日:2017-11-02
Applicant: NEC Laboratories America, Inc.
Inventor: Paul Vernaza , Manmohan Chandraker
Abstract: Systems and methods for training semantic segmentation. Embodiments of the present invention include predicting semantic labeling of each pixel in each of at least one training image using a semantic segmentation model. Further included is predicting semantic boundaries at boundary pixels of objects in the at least one training image using a semantic boundary model concurrently with predicting the semantic labeling. Also included is propagating sparse labels to every pixel in the at least one training image using the predicted semantic boundaries. Additionally, the embodiments include optimizing a loss function according the predicted semantic labeling and the propagated sparse labels to concurrently train the semantic segmentation model and the semantic boundary model to accurately and efficiently generate a learned semantic segmentation model from sparsely annotated training images.
-
公开(公告)号:US10331974B2
公开(公告)日:2019-06-25
申请号:US15709897
申请日:2017-09-20
Applicant: NEC Laboratories America, Inc.
Inventor: Muhammad Zeeshan Zia , Quoc-Huy Tran , Xiang Yu , Manmohan Chandraker , Chi Li
IPC: G06K9/00 , G06K9/62 , G06F17/50 , G06N3/02 , G06T11/60 , G06T15/40 , G05D1/02 , G08G1/16 , G06T7/73 , G06N3/08 , G06T15/10 , B60W30/00 , G08G1/0962 , G06T7/55 , G06K9/46
Abstract: An action recognition system and method are provided. The action recognition system includes an image capture device configured to capture an actual image depicting an object. The action recognition system includes a processor configured to render, based on a set of 3D CAD models, synthetic images with corresponding intermediate shape concept labels. The processor is configured to form a multi-layer CNN which jointly models multiple intermediate shape concepts, based on the rendered synthetic images. The processor is configured to perform an intra-class appearance variation-aware and occlusion-aware 3D object parsing on the actual image by applying the CNN thereto to generate an image pair including a 2D and 3D geometric structure of the object. The processor is configured to control a device to perform a response action in response to an identification of an action performed by the object, wherein the identification of the action is based on the image pair.
-
-
-
-
-
-
-
-
-