-
公开(公告)号:US20210110178A1
公开(公告)日:2021-04-15
申请号:US17128492
申请日:2020-12-21
Applicant: NEC Laboratories America, Inc.
Inventor: Yi-Hsuan Tsai , Kihyuk Sohn , Buyu Liu , Manmohan Chandraker , Jong-Chyi Su
IPC: G06K9/00 , G06K9/62 , B60W30/095 , B60W30/09 , B60W10/18 , B60W10/20 , G08G1/16 , B60W50/00 , G06N3/08 , G06N3/04
Abstract: Systems and methods for obstacle detection are provided. The system aligns image level features between a source domain and a target domain based on an adversarial learning process while training a domain discriminator. The target domain includes one or more road scenes having obstacles. The system selects, using the domain discriminator, unlabeled samples from the target domain that are far away from existing annotated samples from the target domain. The system selects, based on a prediction score of each of the unlabeled samples, samples with lower prediction scores. The system annotates the samples with the lower prediction scores.
-
公开(公告)号:US20190354807A1
公开(公告)日:2019-11-21
申请号:US16400376
申请日:2019-05-01
Applicant: NEC Laboratories America, Inc.
Inventor: Yi-Hsuan Tsai , Samuel Schulter , Kihyuk Sohn , Manmohan Chandraker
Abstract: Systems and methods for domain adaptation for structured output via disentangled representations are provided. The system receives a ground truth of a source domain. The ground truth is used in a task loss function for a first convolutional neural network that predicts at least one output based on inputs from the source domain and a target domain. The system clusters the ground truth of the source domain into a predetermined number of clusters, and predicts, via a second convolutional neural network, a structure of label patches. The structure includes an assignment of each of the at least one output of the first convolutional neural network to the predetermined number of clusters. A cluster loss is computed for the predicted structure of label patches, and an adversarial loss function is applied to the predicted structure of label patches to align the source domain and the target domain on a structural level.
-
公开(公告)号:US10474929B2
公开(公告)日:2019-11-12
申请号:US15906710
申请日:2018-02-27
Applicant: NEC Laboratories America, Inc.
Inventor: Wongun Choi , Samuel Schulter , Kihyuk Sohn , Manmohan Chandraker
Abstract: A system is provided for unsupervised cross-domain image generation relative to a first and second image domain that each include real images. A first generator generates synthetic images similar to real images in the second domain while including a semantic content of real images in the first domain. A second generator generates synthetic images similar to real images in the first domain while including a semantic content of real images in the second domain. A first discriminator discriminates real images in the first domain against synthetic images generated by the second generator. A second discriminator discriminates real images in the second domain against synthetic images generated by the first generator. The discriminators and generators are deep neural networks and respectively form a generative network and a discriminative network in a cyclic GAN framework configured to increase an error rate of the discriminative network to improve synthetic image quality.
-
公开(公告)号:US10474880B2
公开(公告)日:2019-11-12
申请号:US15888629
申请日:2018-02-05
Applicant: NEC Laboratories America, Inc.
Inventor: Xiang Yu , Kihyuk Sohn , Manmohan Chandraker
Abstract: A face recognition system is provided. The system includes a device configured to capture an input image of a subject. The system further includes a processor. The processor estimates, using a 3D Morphable Model (3DMM) conditioned Generative Adversarial Network, 3DMM coefficients for the subject of the input image. The subject varies from an ideal front pose. The processor produces, using an image generator, a synthetic frontal face image of the subject of the input image based on the input image and the 3DMM coefficients. An area spanning the frontal face of the subject is made larger in the synthetic image than in the input image. The processor provides, using a discriminator, a decision indicative of whether the subject of the synthetic image is an actual person. The processor provides, using a face recognition engine, an identity of the subject in the input image based on the synthetic and input images.
-
公开(公告)号:US20190138855A1
公开(公告)日:2019-05-09
申请号:US16112080
申请日:2018-08-24
Applicant: NEC Laboratories America, Inc.
Inventor: Kihyuk Sohn , Manmohan Chandraker , Xiang Yu
Abstract: A computer-implemented method, system, and computer program product are provided for activity recognition in a surveillance system. The method includes receiving a plurality of unlabeled videos from one or more cameras. The method also includes classifying an activity in each of the plurality of unlabeled videos. The method additionally includes controlling an operation of a processor-based machine to react in accordance with the activity.
-
公开(公告)号:US20190095704A1
公开(公告)日:2019-03-28
申请号:US16145257
申请日:2018-09-28
Applicant: NEC Laboratories America, Inc.
Inventor: Xiang Yu , Xi Yin , Kihyuk Sohn , Manmohan Chandraker
Abstract: A computer-implemented method, system, and computer program product are provided for facial recognition. The method includes receiving, by a processor device, a plurality of images. The method also includes extracting, by the processor device with a feature extractor utilizing a convolutional neural network (CNN) with an enlarged intra-class variance of long-tail classes, feature vectors for each of the plurality of images. The method additionally includes generating, by the processor device with a feature generator, discriminative feature vectors for each of the feature vectors. The method further includes classifying, by the processor device utilizing a fully connected classifier, an identity from the discriminative feature vector. The method also includes control an operation of a processor-based machine to react in accordance with the identity.
-
公开(公告)号:US20190066493A1
公开(公告)日:2019-02-28
申请号:US16051924
申请日:2018-08-01
Applicant: NEC Laboratories America, Inc.
Inventor: Kihyuk Sohn , Luan Tran , Xiang Yu , Manmohan Chandraker
Abstract: Systems and methods for performing domain adaptation include collecting a labeled source image having a view of an object. Viewpoints of the object in the source image are synthesized to generate view augmented source images. Photometrics of each of the viewpoints of the object are adjusted to generate lighting and view augmented source images. Features are extracted from each of the lighting and view augmented source images with a first feature extractor and from captured images captured by an image capture device with a second feature extractor. The extracted features are classified using domain adaptation with domain adversarial learning between extracted features of the captured images and extracted features of the lighting and view augmented source images. Labeled target images are displayed corresponding to each of the captured images including labels corresponding to classifications of the extracted features of the captured images.
-
公开(公告)号:US20180307947A1
公开(公告)日:2018-10-25
申请号:US15906710
申请日:2018-02-27
Applicant: NEC Laboratories America, Inc.
Inventor: Wongun Choi , Samuel Schulter , Kihyuk Sohn , Manmohan Chandraker
CPC classification number: G06K9/6259 , G06K9/03 , G06K9/6201 , G06K9/66 , G06N3/0454 , G06N3/0472 , G06N3/08 , G06T11/001 , G06T11/60
Abstract: A system is provided for unsupervised cross-domain image generation relative to a first and second image domain that each include real images. A first generator generates synthetic images similar to real images in the second domain while including a semantic content of real images in the first domain. A second generator generates synthetic images similar to real images in the first domain while including a semantic content of real images in the second domain. A first discriminator discriminates real images in the first domain against synthetic images generated by the second generator. A second discriminator discriminates real images in the second domain against synthetic images generated by the first generator. The discriminators and generators are deep neural networks and respectively form a generative network and a discriminative network in a cyclic GAN framework configured to increase an error rate of the discriminative network to improve synthetic image quality.
-
39.
公开(公告)号:US20180268266A1
公开(公告)日:2018-09-20
申请号:US15889958
申请日:2018-02-06
Applicant: NEC Laboratories America, Inc.
Inventor: Kihyuk Sohn , Xiang Yu , Manmohan Chandraker
CPC classification number: G06K9/66 , G06K9/00268 , G06K9/00288 , G06K9/00718 , G06K9/00744 , G06K9/00771 , G06K9/4628 , G06K9/6201 , G06K9/6217 , G06K9/6262 , G06K9/6274 , G06K2009/00738 , G06N3/02 , G06N3/0454 , G06N3/08 , G06N3/088 , G06N20/00 , G06T7/70 , G06T9/002 , G06T2207/20081 , G08B13/196 , G08B13/19613
Abstract: A surveillance system is provided that includes a device configured to capture a video sequence, formed from a set of unlabeled testing video frames, of a target area. The surveillance system further includes a processor configured to pre-train a recognition engine formed from a reference set of CNNs on a still image domain that includes labeled training still image frames. The processor adapts the recognition engine to a video domain to form an adapted recognition engine, by applying a non-reference set of CNNs to domains including the still image and video domains and a degraded image domain. The degraded image domain includes labeled synthetically degraded versions of the frames included in the still image domain. The video domain includes random unlabeled training video frames. The processor recognizes, using the adapted engine, at least one object in the target area. A display device displays the recognized objects.
-
40.
公开(公告)号:US20180268222A1
公开(公告)日:2018-09-20
申请号:US15890005
申请日:2018-02-06
Applicant: NEC Laboratories America, Inc.
Inventor: Kihyuk Sohn , Xiang Yu , Manmohan Chandraker
CPC classification number: G06K9/66 , G06K9/00268 , G06K9/00288 , G06K9/00718 , G06K9/00744 , G06K9/00771 , G06K9/4628 , G06K9/6201 , G06K9/6217 , G06K9/6262 , G06K9/6274 , G06K2009/00738 , G06N3/02 , G06N3/0454 , G06N3/08 , G06N3/088 , G06N20/00 , G06T7/70 , G06T9/002 , G06T2207/20081 , G08B13/196 , G08B13/19613
Abstract: An action recognition system is provided that includes a device configured to capture a video sequence formed from a set of unlabeled testing video frames. The system further includes a processor configured to pre-train a recognition engine formed from a reference set of CNNs on a still image domain that includes labeled training still image frames. The processor adapts the recognition engine to a video domain to form an adapted engine, by applying non-reference CNNs to domains that include the still image and video domains and a degraded image domain that includes labeled synthetically degraded versions of the frames in the still image domain. The video domain includes random unlabeled training video frames. The processor recognizes, using the adapted engine, an action performed by at least one object in the sequence, and controls a device to perform a response action in response to an action type of the action.
-
-
-
-
-
-
-
-
-