-
公开(公告)号:US10474883B2
公开(公告)日:2019-11-12
申请号:US15803292
申请日:2017-11-03
Applicant: NEC Laboratories America, Inc.
Inventor: Xiang Yu , Kihyuk Sohn , Manmohan Chandraker , Xi Peng
IPC: G06K9/00 , G06N3/04 , G06N3/08 , G08B13/196 , G06K9/62
Abstract: A computer-implemented method, system, and computer program product is provided for pose-invariant facial recognition. The method includes generating, by a processor using a recognition neural network, a rich feature embedding for identity information and non-identity information for each of one or more images. The method also includes generating, by the processor using a Siamese reconstruction network, one or more pose-invariant features by employing the rich feature embedding for identity information and non-identity information. The method additionally includes identifying, by the processor, a user by employing the one or more pose-invariant features. The method further includes controlling an operation of a processor-based machine to change a state of the processor-based machine, responsive to the identified user in the one or more images.
-
公开(公告)号:US10402701B2
公开(公告)日:2019-09-03
申请号:US15889913
申请日:2018-02-06
Applicant: NEC Laboratories America, Inc.
Inventor: Kihyuk Sohn , Xiang Yu , Manmohan Chandraker
IPC: G06K9/00 , G06K9/66 , G06N3/08 , G06N20/00 , G06K9/62 , G06T7/70 , G06T9/00 , G06K9/46 , G06N3/02 , G08B13/196 , G06N3/04
Abstract: A face recognition system is provided that includes a device configured to capture a video sequence formed from a set of unlabeled testing video frames. The system includes a processor configured to pre-train a face recognition engine formed from reference CNNs on a still image domain that includes labeled training still image frames of faces. The processor adapts the face recognition engine to a video domain to form an adapted engine, by applying non-reference CNNs to domains including the still image and video domains and a degraded image domain. The degraded image domain includes labeled synthetically degraded versions of the frames included in the still image domain. The video domain includes random unlabeled training video frames. The processor recognizes, using the adapted engine, identities of persons corresponding to at least one face in the video sequence to obtain a set of identities. A display device displays the set of identities.
-
公开(公告)号:US20190095699A1
公开(公告)日:2019-03-28
申请号:US16145578
申请日:2018-09-28
Applicant: NEC Laboratories America, Inc.
Inventor: Xiang Yu , Xi Yin , Kihyuk Sohn , Manmohan Chandraker
Abstract: A computer-implemented method, system, and computer program product are provided for facial recognition. The method includes receiving, by a processor device, a plurality of images. The method also includes extracting, by the processor device with a feature extractor utilizing a convolutional neural network (CNN) with an enlarged intra-class variance of long-tail classes, feature vectors for each of the plurality of images. The method additionally includes generating, by the processor device with a feature generator, discriminative feature vectors for each of the feature vectors. The method further includes classifying, by the processor device utilizing a fully connected classifier, an identity from the discriminative feature vector. The method also includes control an operation of a processor-based machine to react in accordance with the identity.
-
公开(公告)号:US20180268201A1
公开(公告)日:2018-09-20
申请号:US15888629
申请日:2018-02-05
Applicant: NEC Laboratories America, Inc.
Inventor: Xiang Yu , Kihyuk Sohn , Manmohan Chandraker
CPC classification number: G06K9/00288 , G06F16/71 , G06F16/743 , G06F16/784 , G06K9/00201 , G06K9/00208 , G06K9/00214 , G06K9/00255 , G06K9/00275 , G06K9/00771 , G06K9/00899 , G06K9/4628 , G06K9/6256 , G06T19/20 , G06T2210/44
Abstract: A face recognition system is provided. The system includes a device configured to capture an input image of a subject. The system further includes a processor. The processor estimates, using a 3D Morphable Model (3DMM) conditioned Generative Adversarial Network, 3DMM coefficients for the subject of the input image. The subject varies from an ideal front pose. The processor produces, using an image generator, a synthetic frontal face image of the subject of the input image based on the input image and the 3DMM coefficients. An area spanning the frontal face of the subject is made larger in the synthetic image than in the input image. The processor provides, using a discriminator, a decision indicative of whether the subject of the synthetic image is an actual person. The processor provides, using a face recognition engine, an identity of the subject in the input image based on the synthetic and input images.
-
公开(公告)号:US20180129910A1
公开(公告)日:2018-05-10
申请号:US15709748
申请日:2017-09-20
Applicant: NEC Laboratories America, Inc.
Inventor: Muhammad Zeeshan Zia , Quoc-Huy Tran , Xiang Yu , Manmohan Chandraker , Chi Li
CPC classification number: G06K9/6256 , B60T2201/022 , B60W30/00 , G05D1/0221 , G06F17/5009 , G06K9/00201 , G06K9/00208 , G06K9/00624 , G06K9/00771 , G06K9/00805 , G06K9/4628 , G06K9/6255 , G06N3/02 , G06N3/084 , G06T7/55 , G06T7/74 , G06T11/60 , G06T15/10 , G06T15/40 , G06T2207/20101 , G06T2207/30261 , G06T2210/22 , G08G1/0962 , G08G1/166 , H04N7/00
Abstract: A system and method are provided. The system includes an image capture device configured to capture an actual image depicting an object. The system also includes a processor. The processor is configured to render, based on a set of 3D Computer Aided Design (CAD) models, a set of synthetic images with corresponding intermediate shape concept labels. The processor is also configured to form a multi-layer Convolutional Neural Network (CNN) which jointly models multiple intermediate shape concepts, based on the rendered synthetic images. The processor is further configured to perform an intra-class appearance variation-aware and occlusion-aware 3D object parsing on the actual image by applying the CNN to the actual image to output an image pair including a 2D geometric structure and a 3D geometric structure of the object depicted in the actual image.
-
公开(公告)号:US20180025242A1
公开(公告)日:2018-01-25
申请号:US15637465
申请日:2017-06-29
Applicant: NEC Laboratories America, Inc. , NEC Hong Kong Limited
Inventor: Manmohan Chandraker , Xiang Yu , Eric Lau , Elsa Wong
CPC classification number: G06F21/32 , G06F21/6218 , G06F2221/2133 , G06K9/00221 , G06K9/00228 , G06K9/00255 , G06K9/00281 , G06K9/00288 , G06K9/00624 , G06K9/00791 , G06K9/00906 , G06K9/4652 , G06K9/66 , G06N99/005 , G07C9/00158 , G07C9/00166 , H04L63/0861 , H04L63/1483
Abstract: A facility access control system and corresponding method are provided. The facility access control system includes a camera configured to capture an input image of a subject attempting to enter or exit a restricted facility. The facility access control system further includes a memory storing a deep learning model configured to perform multi-task learning for a pair of tasks including a liveness detection task and a face recognition task. The facility access control system also includes a processor configured to apply the deep learning model to the input image to recognize an identity of the subject in the input image regarding being authorized for access to the facility and a liveness of the subject. The liveness detection task is configured to evaluate a plurality of different distracter modalities corresponding to different physical spoofing materials to prevent face spoofing for the face recognition task.
-
公开(公告)号:US20240355090A1
公开(公告)日:2024-10-24
申请号:US18639534
申请日:2024-04-18
Applicant: NEC Laboratories America, Inc.
IPC: G06V10/74 , G06F16/532 , G06F40/186 , G06V10/774 , G06V20/60 , G06V20/70
CPC classification number: G06V10/761 , G06F16/532 , G06V10/774 , G06V20/60 , G06V20/70 , G06F40/186
Abstract: Systems and methods are provided for matching one or more images using conditional similarity pseudo-labels, including analyzing an unlabeled dataset of images, accessing a foundational vision-language model trained on a plurality of image-text pairs, and defining a set of attributes each comprising multiple possible values for generating pseudo-labels based on notions of similarity (NoS). Text prompts are generated for each attribute value using a prompt template and encoding the text prompts using a text encoder of the foundational model. Each image in the dataset of images is processed through a vision encoder of the foundational model to obtain visual features, the visual features are compared against encoded text prompts to assign a pseudo-label for each attribute for each image, and a conditional similarity network (CSN) is trained with the pseudo-labeled images to generate a conditional similarity model.
-
公开(公告)号:US20240160927A1
公开(公告)日:2024-05-16
申请号:US18503313
申请日:2023-11-07
Applicant: NEC Laboratories America, Inc.
Inventor: Yumin Suh , Samuel Schulter , Xiang Yu , Abhishek Aich
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: Systems and methods for performing multiple tasks with a single artificial intelligence model that can include training a supernet model for an application by splitting the application into tasks, and splitting the supernet model into subnets. The methods and systems can further assign the tasks computing budgets, and match the tasks to subnets by matching the computing budget of the tasks to the computing capacity of the subnets. Further, the methods and systems can perform the tasks with matching subnets to produce parameters that are used by the supernet to perform the application. The supernet combines all of the task to produce a model for the application and the supernet retains weights for the tasks to be used in subsequent applications.
-
公开(公告)号:US20240154784A1
公开(公告)日:2024-05-09
申请号:US18498677
申请日:2023-10-31
Applicant: NEC Laboratories America, Inc.
Inventor: Francesco Pittaluga , Xiang Yu , Salman Khan
CPC classification number: H04L9/002 , G06F21/602 , G06V40/172 , H04L9/0869 , H04N25/10
Abstract: An optical encryption camera includes a sensor array and a filter positioned over the sensor array to receive light prior to the sensor array. The filter includes a multiplexing mask and a scaling mask in sequence. The multiplexing mask and the scaling mask combine to provide an encryption key to encrypt image data prior to capture.
-
公开(公告)号:US20240037187A1
公开(公告)日:2024-02-01
申请号:US18484832
申请日:2023-10-11
Applicant: NEC Laboratories America, Inc.
Inventor: Yi-Hsuan Tsai , Xiang Yu , Bingbing Zhuang , Manmohan Chandraker , Donghyun Kim
IPC: G06F18/213 , G06N3/08 , G06V10/75 , G06F18/22 , G06F18/214
CPC classification number: G06F18/213 , G06N3/08 , G06V10/751 , G06F18/22 , G06F18/2155
Abstract: Video methods and systems include extracting features of a first modality and a second modality from a labeled first training dataset in a first domain and an unlabeled second training dataset in a second domain. A video analysis model is trained using contrastive learning on the extracted features, including optimization of a loss function that includes a cross-domain regularization part and a cross-modality regularization part.
-
-
-
-
-
-
-
-
-