-
公开(公告)号:US12154188B2
公开(公告)日:2024-11-26
申请号:US17890849
申请日:2022-08-18
Applicant: NVIDIA Corporation
Inventor: Fnu Ratnesh Kumar , Farzin Aghdasi , Parthasarathy Sriram , Edwin Weill
IPC: G06T1/20 , G06F17/18 , G06N3/045 , G06N3/047 , G06N3/08 , G06V10/764 , G06V10/82 , G06V20/52 , G06V20/58
Abstract: In various examples, a neural network may be trained for use in vehicle re-identification tasks—e.g., matching appearances and classifications of vehicles across frames—in a camera network. The neural network may be trained to learn an embedding space such that embeddings corresponding to vehicles of the same identify are projected closer to one another within the embedding space, as compared to vehicles representing different identities. To accurately and efficiently learn the embedding space, the neural network may be trained using a contrastive loss function or a triplet loss function. In addition, to further improve accuracy and efficiency, a sampling technique—referred to herein as batch sample—may be used to identify embeddings, during training, that are most meaningful for updating parameters of the neural network.
-
公开(公告)号:US12087077B2
公开(公告)日:2024-09-10
申请号:US18347471
申请日:2023-07-05
Applicant: NVIDIA Corporation
Inventor: Parthasarathy Sriram , Fnu Ratnesh Kumar , Anil Ubale , Farzin Aghdasi , Yan Zhai , Subhashree Radhakrishnan
CPC classification number: G06V40/103 , G06N3/045 , G06N3/08 , G06T7/248 , G06V10/26 , G06V10/454 , G06V10/82 , G06V20/52 , G06V40/10 , G06T2207/10016 , G06T2207/20084 , G06T2207/30196 , G06T2207/30232 , G06T2207/30241
Abstract: In various examples, sensor data—such as masked sensor data—may be used as input to a machine learning model to determine a confidence for object to person associations. The masked sensor data may focus the machine learning model on particular regions of the image that correspond to persons, objects, or some combination thereof. In some embodiments, coordinates corresponding to persons, objects, or combinations thereof, in addition to area ratios between various regions of the image corresponding to the persons, objects, or combinations thereof, may be used to further aid the machine learning model in focusing on important regions of the image for determining the object to person associations.
-
23.
公开(公告)号:US20240233387A1
公开(公告)日:2024-07-11
申请号:US18612058
申请日:2024-03-21
Applicant: NVIDIA Corporation
Inventor: Parthasarathy Sriram , Ratnesh Kumar , Farzin Aghdasi , Arman Toorians , Milind Naphade , Sujit Biswas , Vinay Kolar , Bhanu Pisupati , Aaron Bartholomew
IPC: G06V20/54 , G06F18/231 , G06F18/2413 , G06T7/246 , G06T7/292 , G06T7/70 , G06T7/73 , G06V10/147 , G06V10/20 , G06V10/25 , G06V10/762 , G06V10/764 , G06V10/82 , G06V20/52 , G06V20/58 , G06V20/62 , G06V40/20 , H04N23/90
CPC classification number: G06V20/54 , G06F18/231 , G06F18/24143 , G06T7/246 , G06T7/292 , G06T7/70 , G06T7/73 , G06V10/147 , G06V10/25 , G06V10/255 , G06V10/7625 , G06V10/764 , G06V10/82 , G06V20/52 , G06V20/584 , G06V40/20 , H04N23/90 , G06T2207/10016 , G06T2207/20081 , G06T2207/30201 , G06T2207/30232 , G06T2207/30241 , G06T2207/30264 , G06V20/625 , G06V2201/08
Abstract: The present disclosure provides various approaches for smart area monitoring suitable for parking garages or other areas. These approaches may include ROI-based occupancy detection to determine whether particular parking spots are occupied by leveraging image data from image sensors, such as cameras. These approaches may also include multi-sensor object tracking using multiple sensors that are distributed across an area that leverage both image data and spatial information regarding the area, to provide precise object tracking across the sensors. Further approaches relate to various architectures and configurations for smart area monitoring systems, as well as visualization and processing techniques. For example, as opposed to presenting video of an area captured by cameras, 3D renderings may be generated and played from metadata extracted from sensors around the area.
-
24.
公开(公告)号:US11941887B2
公开(公告)日:2024-03-26
申请号:US17943489
申请日:2022-09-13
Applicant: NVIDIA Corporation
Inventor: Parthasarathy Sriram , Ratnesh Kumar , Farzin Aghdasi , Arman Toorians , Milind Naphade , Sujit Biswas , Vinay Kolar , Bhanu Pisupati , Aaron Bartholomew
IPC: G06V40/20 , G06F18/231 , G06F18/2413 , G06T7/246 , G06T7/292 , G06T7/70 , G06T7/73 , G06V10/147 , G06V10/20 , G06V10/25 , G06V10/762 , G06V10/764 , G06V10/82 , G06V20/52 , G06V20/54 , G06V20/58 , H04N5/247 , H04N23/90 , G06V20/62
CPC classification number: G06V20/54 , G06F18/231 , G06F18/24143 , G06T7/246 , G06T7/292 , G06T7/70 , G06T7/73 , G06V10/147 , G06V10/25 , G06V10/255 , G06V10/7625 , G06V10/764 , G06V10/82 , G06V20/52 , G06V20/584 , G06V40/20 , H04N23/90 , G06T2207/10016 , G06T2207/20081 , G06T2207/30201 , G06T2207/30232 , G06T2207/30241 , G06T2207/30264 , G06V20/625 , G06V2201/08
Abstract: The present disclosure provides various approaches for smart area monitoring suitable for parking garages or other areas. These approaches may include ROI-based occupancy detection to determine whether particular parking spots are occupied by leveraging image data from image sensors, such as cameras. These approaches may also include multi-sensor object tracking using multiple sensors that are distributed across an area that leverage both image data and spatial information regarding the area, to provide precise object tracking across the sensors. Further approaches relate to various architectures and configurations for smart area monitoring systems, as well as visualization and processing techniques. For example, as opposed to presenting video of an area captured by cameras, 3D renderings may be generated and played from metadata extracted from sensors around the area.
-
公开(公告)号:US20230351795A1
公开(公告)日:2023-11-02
申请号:US18347471
申请日:2023-07-05
Applicant: NVIDIA Corporation
Inventor: Parthasarathy Sriram , Fnu Ratnesh Kumar , Anil Ubale , Farzin Aghdasi , Yan Zhai , Subhashree Radhakrishnan
CPC classification number: G06V40/103 , G06N3/08 , G06T7/248 , G06V10/26 , G06V20/52 , G06V40/10 , G06N3/045 , G06V10/82 , G06V10/454 , G06T2207/30232 , G06T2207/10016 , G06T2207/20084 , G06T2207/30196 , G06T2207/30241
Abstract: In various examples, sensor data—such as masked sensor data—may be used as input to a machine learning model to determine a confidence for object to person associations. The masked sensor data may focus the machine learning model on particular regions of the image that correspond to persons, objects, or some combination thereof. In some embodiments, coordinates corresponding to persons, objects, or combinations thereof, in addition to area ratios between various regions of the image corresponding to the persons, objects, or combinations thereof, may be used to further aid the machine learning model in focusing on important regions of the image for determining the object to person associations.
-
26.
公开(公告)号:US20230342666A1
公开(公告)日:2023-10-26
申请号:US18139016
申请日:2023-04-25
Applicant: NVIDIA Corporation
Inventor: Steve Masson , Farzin Aghdasi , Parthasarathy Sriram , Arvind Sai Kumar , Varun Praveen
IPC: G06N20/00
CPC classification number: G06N20/00
Abstract: Devices, systems, and techniques for experiment-based training of machine learning models (MLMs) using early stopping. The techniques include starting training tracks (TTs) that train candidate MLMs using the same training data and respective sets of training settings, performing a first evaluation of a first candidate MLM prior to completion of a corresponding first TT, and responsive to the first evaluation, placing the first TT on an inactive status, inactive status indicating that further training of the first candidate MLM is to be ceased. The techniques further include continuing at least a second TT using the training data, and responsive to conclusion of the TTs, selecting, as one or more final MLMs, the first candidate MLM or a second candidate MLM.
-
27.
公开(公告)号:US20230342600A1
公开(公告)日:2023-10-26
申请号:US18139000
申请日:2023-04-25
Applicant: NVIDIA Corporation
Inventor: Steve Masson , Farzin Aghdasi , Parthasarathy Sriram , Arvind Sai Kumar , Varun Praveen
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: Devices, systems, and techniques for provisioning of cloud-based machine learning training, optimization, and deployment services. The techniques include providing, to a remote client device, a list of available machine learning models (MLMs), receiving from the remote client device an indication of selected MLM(s) from the provided list, identifying training settings for selected MLM(s), identifying a training data for the selected MLM(s), configuring, using the identified training settings, execution of one or more processes to train the selected MLM(s) using the identified training data, and providing to the remote client device a representation of completed training of at least one MLM.
-
公开(公告)号:US20230153612A1
公开(公告)日:2023-05-18
申请号:US18056559
申请日:2022-11-17
Applicant: NVIDIA Corporation
Inventor: Yu Wang , Farzin Aghdasi , Parthasarathy Sriram
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: When visiting a child node in a graph corresponding to a deep learning model to analyze the child node for pruning in the deep learning model, data identifying pruning information corresponding to one or more parent nodes may be determined and used to access the pruning information. For example, a list of parent nodes of the parent node may be used to access the pruning information for the visit to the child node. The graph may be explored using recursion to iteratively visit nodes to determine portions of pruning information for pruning a node where a portion of the pruning information determined for prior visits to the nodes may be reused. A layer of the deep learning model including multiple dependent convolutions may be pruned by treating each convolution as a separate node and/or layer.
-
29.
公开(公告)号:US20230016568A1
公开(公告)日:2023-01-19
申请号:US17943489
申请日:2022-09-13
Applicant: NVIDIA Corporation
Inventor: Parthasarathy Sriram , Ratnesh Kumar , Farzin Aghdasi , Arman Toorians , Milind Naphade , Sujit Biswas , Vinay Kolar , Bhanu Pisupati , Aaron Bartholomew
IPC: G06V40/20 , G06T7/70 , H04N5/247 , G06T7/246 , G06V10/147 , G06V20/52 , G06V10/20 , G06T7/292 , G06T7/73 , G06K9/62 , G06V20/58 , G06V20/54 , G06V20/62
Abstract: The present disclosure provides various approaches for smart area monitoring suitable for parking garages or other areas. These approaches may include ROI-based occupancy detection to determine whether particular parking spots are occupied by leveraging image data from image sensors, such as cameras. These approaches may also include multi-sensor object tracking using multiple sensors that are distributed across an area that leverage both image data and spatial information regarding the area, to provide precise object tracking across the sensors. Further approaches relate to various architectures and configurations for smart area monitoring systems, as well as visualization and processing techniques. For example, as opposed to presenting video of an area captured by cameras, 3D renderings may be generated and played from metadata extracted from sensors around the area.
-
公开(公告)号:US20200160185A1
公开(公告)日:2020-05-21
申请号:US16197986
申请日:2018-11-21
Applicant: NVIDIA CORPORATION
Inventor: Varun Praveen , Anil Ubale , Parthasarathy Sriram , Greg Heinrich , Tayfun Gurel
Abstract: Input layers of an element-wise operation in a neural network can be pruned such that the shape (e.g., the height, the width, and the depth) of the pruned layers matches. A pruning engine identifies all of the input layers into the element-wise operation. For each set of corresponding neurons in the input layers, the pruning engine equalizes the metrics associated with the neurons to generate an equalized metric associated with the set. The pruning engine prunes the input layers based on the equalized metrics generated for each unique set of corresponding neurons.
-
-
-
-
-
-
-
-
-