Patent search ap:("NEC Laboratories America Page Inc.") AND inv:"Asim Kadav"

1.

发明授权
Keypoint based action localization 有权

公开(公告)号：US12198397B2

公开(公告)日：2025-01-14

申请号：US17586284

申请日：2022-01-27

Applicant: NEC Laboratories America, Inc.

Inventor： Asim Kadav , Farley Lai , Hans Peter Graf , Yi Huang

IPC: G06V10/26 , G06T7/246 , G06V10/44 , G06V10/82 , G06V20/58 , G06V40/10 , G08G1/16

Abstract: A computer-implemented method is provided for action localization. The method includes converting one or more video frames into person keypoints and object keypoints. The method further includes embedding position, timestamp, instance, and type information with the person keypoints and object keypoints to obtain keypoint embeddings. The method also includes predicting, by a hierarchical transformer encoder using the keypoint embeddings, human actions and bounding box information of when and where the human actions occur in the one or more video frames.

2.

发明授权
Semi-automatic data collection and association for multi-camera tracking 有权

公开(公告)号：US12131489B2

公开(公告)日：2024-10-29

申请号：US17741735

申请日：2022-05-11

Applicant: NEC Laboratories America, Inc.

Inventor： Farley Lai , Asim Kadav , Likitha Lakshminarayanan

IPC: G06T7/292 , G06T7/223 , G06T7/246 , G06V20/52

CPC classification number: G06T7/292 , G06T7/223 , G06T7/248 , G06V20/52 , G06T2200/24 , G06T2207/10016 , G06T2207/20081 , G06T2207/30196 , G06T2207/30232

Abstract: A surveillance system is provided. The surveillance system is configured for (i) detecting and tracking persons locally for each camera input video stream using the common area anchor boxes and assigning each detected ones of the persons a local track id, (ii) associating a same person in overlapping camera views to a global track id, and collecting associated track boxes as the same person moves in different camera views over time using a priority queue and the local track id and the global track id, (iii) performing track data collection to derive a spatial transformation through matched track box spatial features of a same person over time for scene coverage and (iv) learning a multi-camera tracker given visual features from matched track boxes of distinct people across cameras based on the derived spatial transformation.

3.

发明公开
COMPOSITIONAL REASONING OF GORUP ACTIVITY IN VIDEOS WITH KEYPOINT-ONLY MODALITY 审中-公开

公开(公告)号：US20230148017A1

公开(公告)日：2023-05-11

申请号：US17960370

申请日：2022-10-05

Applicant: NEC Laboratories America, Inc.

Inventor： Asim Kadav , Farley Lai , Hans Peter Graf , Honglu Zhou

IPC: G06V20/40 , G06V40/10 , G06V10/77 , G06V10/774

CPC classification number: G06V20/41 , G06V10/774 , G06V10/7715 , G06V20/46 , G06V20/49 , G06V40/10

Abstract: A method for compositional reasoning of group activity in videos with keypoint-only modality is presented. The method includes obtaining video frames from a video stream received from a plurality of video image capturing devices, extracting keypoints all of persons detected in the video frames to define keypoint data, tokenizing the keypoint data with time and segment information, clustering groups of keypoint persons in the video frames and passing the clustering groups through multi-scale prediction, and performing a prediction to provide a group activity prediction of a scene in the video frames.

4.

发明授权
Communication efficient sparse-reduce in distributed machine learning 有权

公开(公告)号：US11356334B2

公开(公告)日：2022-06-07

申请号：US15980243

申请日：2018-05-15

Applicant: NEC Laboratories America, Inc.

Inventor： Asim Kadav , Erik Kruus

IPC: G06F17/00 , H04L41/16 , G06F9/52 , H04L41/12 , G06N20/00 , G06F15/76

Abstract: A method is provided for sparse communication in a parallel machine learning environment. The method includes determining a fixed communication cost for a sparse graph to be computed. The sparse graph is (i) determined from a communication graph that includes all the machines in a target cluster of the environment, and (ii) represents a communication network for the target cluster having (a) an overall spectral gap greater than or equal to a minimum threshold, and (b) certain information dispersal properties such that an intermediate output from a given node disperses to all other nodes of the sparse graph in lowest number of time steps given other possible node connections. The method further includes computing the sparse graph, based on the communication graph and the fixed communication cost. The method also includes initiating a propagation of the intermediate output in the parallel machine learning environment using a topology of the sparse graph.

5.

发明授权
Memory efficient scalable deep learning with model parallelization 有权

公开(公告)号：US10474951B2

公开(公告)日：2019-11-12

申请号：US15271589

申请日：2016-09-21

Applicant: NEC Laboratories America, Inc.

Inventor： Renqiang Min , Huahua Wang , Asim Kadav

IPC: G06N3/08 , G06F17/16 , G06N3/04

Abstract: Methods and systems for training a neural network include sampling multiple local sub-networks from a global neural network. The local sub-networks include a subset of neurons from each layer of the global neural network. The plurality of local sub-networks are trained at respective local processing devices to produce trained local parameters. The trained local parameters from each local sub-network are averaged to produce trained global parameters.

6.

发明申请
FACE RECOGNITION USING STAGE-WISE MINI BATCHING TO IMPROVE CACHE UTILIZATION 审中-公开

公开(公告)号：US20180060240A1

公开(公告)日：2018-03-01

申请号：US15678889

申请日：2017-08-16

Applicant: NEC Laboratories America, Inc.

Inventor： Asim Kadav , Farley Lai

IPC: G06F12/0875 , G06K9/00 , G06K9/66 , G06N99/00 , G06N3/02

CPC classification number: G06N3/084 , G06F12/0875 , G06F2212/455 , G06K9/00255 , G06K9/00288 , G06K9/00986 , G06K9/66 , G06N3/02 , G06N3/063 , G06N20/00 , G06T1/20 , H04L41/16 , H04L67/2842

Abstract: A face recognition system and method for face recognition are provided. The face recognition system includes a camera for capturing an input image of a face of a person to be recognized. The face recognition system further includes a cache. The face recognition system further includes a set of one or more processors configured to (i) improve a utilization of the cache by the one or more processors during multiple training stages of a neural network configured to perform face recognition, by performing a stage-wise mini-batch process on a set of samples used for the multiple training stages, and (ii) recognize the person by applying the neural network to the input image during a recognition stage. The stage-wise mini-batch process waits for each of the multiple training stages to complete using a system wait primitive to improve the utilization of the cache.

7.

发明申请
SECURITY SYSTEM USING A CONVOLUTIONAL NEURAL NETWORK WITH PRUNED FILTERS 审中-公开

公开(公告)号：US20170337467A1

公开(公告)日：2017-11-23

申请号：US15590666

申请日：2017-05-09

Applicant: NEC Laboratories America, Inc.

Inventor： Asim Kadav , Igor Durdanovic , Hans Peter Graf , Hao Li

IPC: G06N3/04 , G06K9/62 , G06F17/17 , G06K9/66 , G06F17/15

CPC classification number: G06N3/082 , G06F17/153 , G06F17/17 , G06K9/00771 , G06K9/4628 , G06K9/6228 , G06K9/627 , G06K9/6296 , G06K9/66 , G06K2009/00738 , G06N3/0427 , G06N3/0454 , G06N3/0481 , G06N5/045 , G08B13/00 , G08B29/186 , H03H2222/04

Abstract: Security systems and methods for detecting intrusion events include one or more sensors configured to monitor an environment. A pruned convolutional neural network (CNN) is configured process information from the one or more sensors to classify events in the monitored environment. CNN filters having the smallest summed weights have been pruned from the pruned CNN. An alert module is configured to detect an intrusion event in the monitored environment based on event classifications. A control module is configured to perform a security action based on the detection of an intrusion event.

8.

发明授权
Multi-hop transformer for spatio-temporal reasoning and localization 有权

公开(公告)号：US11741712B2

公开(公告)日：2023-08-29

申请号：US17463757

申请日：2021-09-01

Applicant: NEC Laboratories America, Inc.

Inventor： Asim Kadav , Farley Lai , Hans Peter Graf , Alexandru Niculescu-Mizil , Renqiang Min , Honglu Zhou

IPC: G06V20/40 , G06T7/73 , G06T7/246 , G06F18/213 , G06N3/045

CPC classification number: G06V20/41 , G06F18/213 , G06N3/045 , G06T7/246 , G06T7/73 , G06V20/46 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084 , G06V2201/07

Abstract: A method for using a multi-hop reasoning framework to perform multi-step compositional long-term reasoning is presented. The method includes extracting feature maps and frame-level representations from a video stream by using a convolutional neural network (CNN), performing object representation learning and detection, linking objects through time via tracking to generate object tracks and image feature tracks, feeding the object tracks and the image feature tracks to a multi-hop transformer that hops over frames in the video stream while concurrently attending to one or more of the objects in the video stream until the multi-hop transformer arrives at a correct answer, and employing video representation learning and recognition from the objects and image context to locate a target object within the video stream.

9.

发明申请
SELF-SUPERVISED MULTIMODAL REPRESENTATION LEARNING WITH CASCADE POSITIVE EXAMPLE MINING 有权

公开(公告)号：US20230086023A1

公开(公告)日：2023-03-23

申请号：US17940599

申请日：2022-09-08

Applicant: NEC Laboratories America, Inc.

Inventor： Farley Lai , Asim Kadav , Cheng-En Wu

IPC: G06V20/40 , G06V10/771

Abstract: A method for model training and deployment includes training, by a processor, a model to learn video representations with a self-supervised contrastive loss by performing progressive training in phases with an incremental number of positive instances from one or more video sequences, resetting the learning rate schedule in each of the phases, and inheriting model weights from a checkpoint from a previous training phase. The method further includes updating the trained model with the self-supervised contrastive loss given multiple positive instances obtained from Cascade K-Nearest Neighbor mining of the one or more video sequences by extracting features in different modalities to compute similarities between the one or more video sequences and selecting a top-k similar instances with features in different modalities. The method also includes fine-tuning the trained model for a downstream task. The method additionally includes deploying the trained model for a target application inference for the downstream task.

10.

发明申请
KEYPOINT BASED ACTION LOCALIZATION 有权

公开(公告)号：US20220237884A1

公开(公告)日：2022-07-28

申请号：US17586284

申请日：2022-01-27

Applicant: NEC Laboratories America, Inc.

Inventor： Asim Kadav , Farley Lai , Hans Peter Graf , Yi Huang

IPC: G06V10/26 , G06T7/246 , G06V40/10 , G06V10/44 , G06V10/82 , G06V20/58 , G08G1/16

Abstract: A computer-implemented method is provided for action localization. The method includes converting one or more video frames into person keypoints and object keypoints. The method further includes embedding position, timestamp, instance, and type information with the person keypoints and object keypoints to obtain keypoint embeddings. The method also includes predicting, by a hierarchical transformer encoder using the keypoint embeddings, human actions and bounding box information of when and where the human actions occur in the one or more video frames.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification