-
公开(公告)号:US20250148911A1
公开(公告)日:2025-05-08
申请号:US18933054
申请日:2024-10-31
Applicant: NEC Laboratories America, Inc.
Inventor: Manmohan Chandraker , Francesco Pittaluga , Bingbing Zhuang , Wei-Jer Chang
IPC: G08G1/0967 , G06N20/00 , G08G1/01 , G08G1/16
Abstract: Methods and systems include determining actions for agents in a driving scenario using a diffusion model, based on individual controllable behavior patterns for the agents. A state of the driving scenario is updated based on the determined actions for the plurality of agents. The determination of actions and the update of the state are repeated in a closed-loop fashion to generate simulated trajectories for the plurality of agents. A planner model is trained to select actions for an operating agent based on the simulated trajectories.
-
公开(公告)号:US20250148736A1
公开(公告)日:2025-05-08
申请号:US18924258
申请日:2024-10-23
Applicant: NEC Laboratories America, Inc.
Inventor: Bingbing Zhuang , Ziyu Jiang , Manmohan Chandraker , Shanlin Sun
Abstract: A computer-implemented method for synthesizing an image includes extracting agent neural radiance fields (NeRFs) from driving video logs and storing agent NeRFs in a database. For a driving video log to be edited, a scene NeRF and agent NeRFs are extracted from the driving video log to be edited. One or more agent NeRFs are selected from the database to insert into or replace existing agents in a traffic scene of the driving video log based on photorealism criteria. The traffic scene is edited by inserting a selected agent NeRF into the traffic scene, replacing existing agents in the traffic scene with the selected agent NeRF, or removing one or more existing agents from the traffic scene. An image of the edited traffic scene is synthesized by composing edited agent NeRFs with the scene NeRF and performing volume rendering.
-
公开(公告)号:US20250145176A1
公开(公告)日:2025-05-08
申请号:US18934817
申请日:2024-11-01
Applicant: NEC Laboratories America, Inc.
Inventor: Manmohan Chandraker , Francesco Pittaluga , Vijay Kumar Baikampady Gopalkrishna , Sharan Satish Prema
Abstract: Methods and systems for operating a vehicle include prompting a large language model LLM to generate parameters for a rule-based planner based on historical data for vehicles in a road scene. A trajectory is generated using the parameters. A driving action is performed to implement the trajectory.
-
公开(公告)号:US20240378454A1
公开(公告)日:2024-11-14
申请号:US18659738
申请日:2024-05-09
Applicant: NEC Laboratories America, Inc.
IPC: G06N3/096
Abstract: Systems and methods for optimizing models for open-vocabulary detection. Region proposals can be obtained by employing a pre-trained vision-language model and a pre-trained region proposal network. Object feature predictions can be obtained by employing a trained teacher neural network with the region proposals. Object feature predictions can be filtered above a threshold to obtain pseudo labels. A student neural network with a split-and-fusion detection head can be trained by utilizing the region proposals, base ground truth class labels and the pseudo labels. The pseudo labels can be optimized by reducing the noise from the pseudo labels by employing the trained split-and-fusion detection head of the trained student neural network to obtain optimized object detections. An action can be performed relative to a scene layout based on the optimized object detections.
-
公开(公告)号:US20240354583A1
公开(公告)日:2024-10-24
申请号:US18615535
申请日:2024-03-25
Applicant: NEC Laboratories America, Inc.
Inventor: Sparsh Garg , Samuel Schulter , Bingbing Zhuang , Manmohan Chandraker
IPC: G06N3/0895
CPC classification number: G06N3/0895
Abstract: Methods and systems for training a model include annotating a subset of an unlabeled training dataset, that includes images of road scenes, with labels. A road defect detection model is iteratively trained, including adding pseudo-labels to a remainder of examples from the unlabeled training dataset and training the road defect detection model based on the labels and the pseudo-labels.
-
公开(公告)号:US20240037188A1
公开(公告)日:2024-02-01
申请号:US18484839
申请日:2023-10-11
Applicant: NEC Laboratories America, Inc.
Inventor: Yi-Hsuan Tsai , Xiang Yu , Bingbing Zhuang , Manmohan Chandraker , Donghyun Kim
IPC: G06F18/213 , G06N3/08 , G06V10/75 , G06F18/22 , G06F18/214
CPC classification number: G06F18/213 , G06N3/08 , G06V10/751 , G06F18/22 , G06F18/2155
Abstract: Video methods and systems include extracting features of a first modality and a second modality from a labeled first training dataset in a first domain and an unlabeled second training dataset in a second domain. A video analysis model is trained using contrastive learning on the extracted features, including optimization of a loss function that includes a cross-domain regularization part and a cross-modality regularization part.
-
公开(公告)号:US11710346B2
公开(公告)日:2023-07-25
申请号:US17330832
申请日:2021-05-26
Applicant: NEC Laboratories America, Inc.
Inventor: Manmohan Chandraker , Ting Wang , Xiang Xu , Francesco Pittaluga , Gaurav Sharma , Yi-Hsuan Tsai , Masoud Faraki , Yuheng Chen , Yue Tian , Ming-Fang Huang , Jian Fang
IPC: G06V40/16 , G06T3/00 , G06V10/774
CPC classification number: G06V40/172 , G06T3/0006 , G06V10/774 , G06V40/171
Abstract: Methods and systems for training a neural network include generate an image of a mask. A copy of an image is generated from an original set of training data. The copy is altered to add the image of a mask to a face detected within the copy. An augmented set of training data is generated that includes the original set of training data and the altered copy. A neural network model is trained to recognize masked faces using the augmented set of training data.
-
公开(公告)号:US20230196122A1
公开(公告)日:2023-06-22
申请号:US17899913
申请日:2022-08-31
Applicant: NEC Laboratories America, Inc.
Inventor: Yumin Suh , Samuel Schulter , Xiang Yu , Masoud Faraki , Manmohan Chandraker , Dripta Raychaudhuri
IPC: G06N3/0985
CPC classification number: G06N3/0985
Abstract: Systems and methods for generating a hypernetwork configured to be trained for a plurality of tasks; receiving a task preference vector identifying a hierarchical priority for the plurality of tasks, and a resource constraint as a tuple; finding tree sub-structures and the corresponding modulation of features for every tuple within an N-stream anchor network; optimizing a branching regularized loss function to train an edge hypernet; and training a weight hypernet, keeping the anchor net and the edge hypernet fixed.
-
公开(公告)号:US11604945B2
公开(公告)日:2023-03-14
申请号:US17128535
申请日:2020-12-21
Applicant: NEC Laboratories America, Inc.
Inventor: Yi-Hsuan Tsai , Kihyuk Sohn , Buyu Liu , Manmohan Chandraker , Jong-Chyi Su
IPC: G06K9/00 , G06K9/62 , B60W30/095 , B60W30/09 , B60W10/20 , B60W10/18 , B60W50/00 , G08G1/16 , G06N3/08 , G06V10/25 , G06V20/58 , G06V20/56
Abstract: Systems and methods for lane marking and road sign recognition are provided. The system aligns image level features between a source domain and a target domain based on an adversarial learning process while training a domain discriminator. The target domain includes one or more road scenes having lane markings and road signs. The system selects, using the domain discriminator, unlabeled samples from the target domain that are far away from existing annotated samples from the target domain. The system selects, based on a prediction score of each of the unlabeled samples, samples with lower prediction scores. The system annotates the samples with the lower prediction scores.
-
公开(公告)号:US11604943B2
公开(公告)日:2023-03-14
申请号:US16400376
申请日:2019-05-01
Applicant: NEC Laboratories America, Inc.
Inventor: Yi-Hsuan Tsai , Samuel Schulter , Kihyuk Sohn , Manmohan Chandraker
Abstract: Systems and methods for domain adaptation for structured output via disentangled representations are provided. The system receives a ground truth of a source domain. The ground truth is used in a task loss function for a first convolutional neural network that predicts at least one output based on inputs from the source domain and a target domain. The system clusters the ground truth of the source domain into a predetermined number of clusters, and predicts, via a second convolutional neural network, a structure of label patches. The structure includes an assignment of each of the at least one output of the first convolutional neural network to the predetermined number of clusters. A cluster loss is computed for the predicted structure of label patches, and an adversarial loss function is applied to the predicted structure of label patches to align the source domain and the target domain on a structural level.
-
-
-
-
-
-
-
-
-