-
公开(公告)号:US20200097757A1
公开(公告)日:2020-03-26
申请号:US16580199
申请日:2019-09-24
Applicant: NEC Laboratories America, Inc.
Inventor: Renqiang Min , Kai Li , Bing Bai , Hans Peter Graf
Abstract: A computer-implemented method and system are provided for training a model for New Class Categorization (NCC) of a test image. The method includes decoupling, by a hardware processor, a feature extraction part from a classifier part of a deep classification model by reparametrizing learnable weight variables of the classifier part as a combination of learnable variables of the feature extraction part and of a classification weight generator of the classifier part. The method further includes training, by the hardware processor, the deep classification model to obtain a trained deep classification model by (i) learning the feature extraction part as a multiclass classification task, and (ii) episodically training the classifier part by learning a classification weight generator which outputs classification weights given a training image.
-
公开(公告)号:US10474951B2
公开(公告)日:2019-11-12
申请号:US15271589
申请日:2016-09-21
Applicant: NEC Laboratories America, Inc.
Inventor: Renqiang Min , Huahua Wang , Asim Kadav
Abstract: Methods and systems for training a neural network include sampling multiple local sub-networks from a global neural network. The local sub-networks include a subset of neurons from each layer of the global neural network. The plurality of local sub-networks are trained at respective local processing devices to produce trained local parameters. The trained local parameters from each local sub-network are averaged to produce trained global parameters.
-
公开(公告)号:US10402658B2
公开(公告)日:2019-09-03
申请号:US15794802
申请日:2017-10-26
Applicant: NEC Laboratories America, Inc.
Inventor: Renqiang Min , Yunchen Pu
IPC: G06K9/00 , G06K9/46 , G06N3/04 , G06K9/66 , H04N5/278 , G06K9/62 , H04N21/218 , H04N21/234 , H04N21/488 , G06K9/72 , H04N7/18
Abstract: A video retrieval system is provided, that includes a set of servers, configured to retrieve a video sequence from a database and forward it to a requesting device responsive to a match between an input text and a caption for the video sequence. The servers are further configured to translate the video sequence into the caption by (A) applying a C3D to image frames of the video sequence to obtain therefor (i) intermediate feature representations across L convolutional layers and (ii) top-layer features, (B) producing a first word of the caption for the video sequence by applying the top-layer features to a LSTM, and (C) producing subsequent words of the caption by (i) dynamically performing spatiotemporal attention and layer attention using the representations to form a context vector, and (ii) applying the LSTM to the context vector, a previous word of the caption, and a hidden state of the LSTM.
-
公开(公告)号:US20190244513A1
公开(公告)日:2019-08-08
申请号:US16248897
申请日:2019-01-16
Applicant: NEC Laboratories America, Inc.
Inventor: Alexandru Niculescu-Mizil , Renqiang Min , Eric Cosatto , Farley Lai , Hans Peter Graf , Xavier Fontaine
CPC classification number: G08B29/186 , G06K9/4604 , G06K9/6256 , G06T3/403 , G06T7/0004 , G06T7/001 , G06T2207/20081 , G06T2207/20084
Abstract: A false alarm reduction system and method are provided for reducing false alarms in an automatic defect detection system. The false alarm reduction system includes a defect detection system, generating a list of image boxes marking detected potential defects in an input image. The false alarm reduction system further includes a feature extractor, transforming each of the image boxes in the list into a respective set of numerical features. The false alarm reduction system also includes a classifier, computing as a classification outcome for the each of the image boxes whether the detected potential defect is a true defect or a false alarm responsive to the respective set of numerical features for each of the image boxes.
-
35.
公开(公告)号:US20190244337A1
公开(公告)日:2019-08-08
申请号:US16248955
申请日:2019-01-16
Applicant: NEC Laboratories America, Inc.
Inventor: Alexandru Niculescu-Mizil , Renqiang Min , Eric Cosatto , Farley Lai , Hans Peter Graf , Xavier Fontaine
CPC classification number: G08B29/186 , G06K9/4604 , G06K9/6256 , G06T3/403 , G06T7/0004 , G06T7/001 , G06T2207/20081 , G06T2207/20084
Abstract: A false alarm reduction system is provided that includes a processor cropping each input image at randomly chosen positions to form cropped images of a same size at different scales in different contexts. The system further includes a CONDA-GMM, having a first and a second conditional deep autoencoder for respectively (i) taking each cropped image without a respective center block as input for measuring a discrepancy between a reconstructed and a target center block, and (ii) taking an entirety of cropped images with the target center block. The CONDA-GMM constructs density estimates based on reconstruction error features and low-dimensional embedding representations derived from image encodings. The processor determines an anomaly existence based on a prediction of a likelihood of the anomaly existing in a framework of a CGMM, given the context being a representation of the cropped image with the center block removed and having a discrepancy above a threshold.
-
公开(公告)号:US10296793B2
公开(公告)日:2019-05-21
申请号:US15479408
申请日:2017-04-05
Applicant: NEC Laboratories America, Inc.
Inventor: Renqiang Min , Yang Gao , Eric Cosatto
Abstract: A method, a computer program product, and a system are provided for video based action recognition. The system includes a processor. One or more frames from one or more video sequences are received. A feature vector for each patch of the one or more frames is generated using a deep convolutional neural network. An attention factor for the feature vectors is generated based on a within-frame attention and a between-frame attention. A target action is identified using a multi-layer deep long short-term memory process applied to the attention factor, said target action representing at least one of the one or more video sequences. An operation of a processor-based machine is controlled to change a state of the processor-based machine, responsive to the at least one of the one or more video sequences including the identified target action.
-
公开(公告)号:US20190122655A1
公开(公告)日:2019-04-25
申请号:US16163988
申请日:2018-10-18
Applicant: NEC Laboratories America, Inc.
Inventor: Renqiang Min , Dinghan Shen
Abstract: A computer-implemented method, computer program product, and computer processing system are provided for word embedding. The method includes receiving, by a processor device, a word embedding matrix. The method further includes generating, by a processor device, an average pooling vector and a max pooling vector, based on the word embedding matrix. The method also includes generating, by the processor device, a prediction by applying a Multi-Layer Perceptron (MLP) to the average pooling vector and the max pooling vector.
-
38.
公开(公告)号:US20180121734A1
公开(公告)日:2018-05-03
申请号:US15794758
申请日:2017-10-26
Applicant: NEC Laboratories America, Inc.
Inventor: Renqiang Min , Yunchen Pu
CPC classification number: G06K9/00751 , G06K9/00718 , G06K9/00758 , G06K9/00771 , G06K9/00973 , G06K9/4628 , G06K9/6257 , G06K9/6277 , G06K9/66 , G06K9/726 , G06K2009/00738 , G06N3/0445 , G06N3/0454 , G06N3/08 , H04N5/278 , H04N7/181 , H04N7/183 , H04N21/2181 , H04N21/23418 , H04N21/4884
Abstract: A system is provided for video captioning. The system includes a processor. The processor is configured to apply a three-dimensional Convolutional Neural Network (C3D) to image frames of a video sequence to obtain, for the video sequence, (i) intermediate feature representations across L convolutional layers and (ii) top-layer features. The processor is further configured to produce a first word of an output caption for the video sequence by applying the top-layer features to a Long Short Term Memory (LSTM). The processor is further configured to produce subsequent words of the output caption by (i) dynamically performing spatiotemporal attention and layer attention using the intermediate feature representations to form a context vector, and (ii) applying the LSTM to the context vector, a previous word of the output caption, and a hidden state of the LSTM. The system further includes a display device for displaying the output caption to a user.
-
公开(公告)号:US20170294091A1
公开(公告)日:2017-10-12
申请号:US15479430
申请日:2017-04-05
Applicant: NEC Laboratories America, Inc.
Inventor: Renqiang Min , Yang Gao , Eric Cosatto
CPC classification number: G06K9/00711 , G06K9/4628 , G06K9/6274 , G06K2009/00738 , G06N3/0427 , G06N3/0445 , G06N3/0454 , G06N3/08 , G06N5/045 , G08B13/19613 , G08B13/19695 , H04N5/77 , H04N7/183
Abstract: A video monitoring system and method are provided. The video monitoring system includes a camera. The camera is positioned to monitor an area and capture live video to provide a live video stream. The video monitoring system also includes a security processing system. The security processing system includes a processor and memory coupled to the processor. The security processing system is programmed to detect and identify a target action sequence in the live video stream using a multi-layer deep long short-term memory process on are attention factor that is based on an within-frame attention and an between-frame attention. The security processing system is further programmed to trigger an action to alert that a target action sequence has been detected.
-
公开(公告)号:US09183503B2
公开(公告)日:2015-11-10
申请号:US13908715
申请日:2013-06-03
Applicant: NEC Laboratories America, Inc.
Inventor: Renqiang Min , Yanjun Qi
IPC: G06N5/02
CPC classification number: G06N5/025
Abstract: Systems and methods are provided for identifying combinatorial feature interactions, including capturing statistical dependencies between categorical variables, with the statistical dependencies being stored in a computer readable storage medium. A model is selected based on the statistical dependencies using a neighborhood estimation strategy, with the neighborhood estimation strategy including generating sets of arbitrarily high-order feature interactions using at least one rule forest and optimizing one or more likelihood functions. A damped mean-field approach is applied to the model to obtain parameters of a Markov random field (MRF); a sparse high-order semi-restricted MRF is produced by adding a hidden layer to the MRF; indirect long-range dependencies between feature groups are modeled using the sparse high-order semi-restricted MRF; and a combinatorial dependency structure between variables is output.
Abstract translation: 提供了用于识别组合特征交互的系统和方法,包括捕获分类变量之间的统计依赖性,并将统计依赖性存储在计算机可读存储介质中。 基于使用邻域估计策略的统计依赖性来选择模型,邻域估计策略包括使用至少一个规则林生成任意高阶特征交互的集合并且优化一个或多个似然函数。 将阻尼平均场方法应用于模型以获得马尔可夫随机场(MRF)的参数; 通过向MRF添加隐藏层来产生稀疏高阶半限制MRF; 特征组之间的间接长程依赖关系使用稀疏高阶半限制MRF进行建模; 并输出变量之间的组合依赖结构。
-
-
-
-
-
-
-
-
-