-
公开(公告)号:US20220012536A1
公开(公告)日:2022-01-13
申请号:US17483688
申请日:2021-09-23
Applicant: NVIDIA Corporation
Inventor: Ting-Chun Wang , Ming-Yu Liu , Bryan Christopher Catanzaro , Jan Kautz , Andrew J. Tao
Abstract: A method, computer readable medium, and system are disclosed for creating an image utilizing a map representing different classes of specific pixels within a scene. One or more computing systems use the map to create a preliminary image. This preliminary image is then compared to an original image that was used to create the map. A determination is made whether the preliminary image matches the original image, and results of the determination are used to adjust the computing systems that created the preliminary image, which improves a performance of such computing systems. The adjusted computing systems are then used to create images based on different input maps representing various object classes of specific pixels within a scene.
-
公开(公告)号:US11132543B2
公开(公告)日:2021-09-28
申请号:US15855887
申请日:2017-12-27
Applicant: NVIDIA Corporation
Inventor: Rajeev Ranjan , Shalini De Mello , Jan Kautz
IPC: G06K9/00 , G06K9/62 , G06T7/11 , G06T7/70 , G06N3/04 , G06T7/73 , G06K9/32 , G06N3/08 , G06K9/46
Abstract: A method, computer readable medium, and system are disclosed for performing unconstrained appearance-based gaze estimation. The method includes the steps of identifying an image of an eye and a head orientation associated with the image of the eye, determining an orientation for the eye by analyzing, within a convolutional neural network (CNN), the image of the eye and the head orientation associated with the image of the eye, and returning the orientation of the eye.
-
公开(公告)号:US20210271977A1
公开(公告)日:2021-09-02
申请号:US17325024
申请日:2021-05-19
Applicant: NVIDIA Corporation
Inventor: Xiaodong Yang , Pavlo Molchanov , Jan Kautz
Abstract: A method, computer readable medium, and system are disclosed for visual sequence learning using neural networks. The method includes the steps of replacing a non-recurrent layer within a trained convolutional neural network model with a recurrent layer to produce a visual sequence learning neural network model and transforming feedforward weights for the non-recurrent layer into input-to-hidden weights of the recurrent layer to produce a transformed recurrent layer. The method also includes the steps of setting hidden-to-hidden weights of the recurrent layer to initial values and processing video image data by the visual sequence learning neural network model to generate classification or regression output data.
-
公开(公告)号:US11082720B2
公开(公告)日:2021-08-03
申请号:US16191174
申请日:2018-11-14
Applicant: NVIDIA Corporation
Inventor: Yi-Hsuan Tsai , Ming-Yu Liu , Deqing Sun , Ming-Hsuan Yang , Jan Kautz
IPC: H04N19/85 , H04N19/91 , H04N19/436 , H04N19/46
Abstract: A method, computer readable medium, and system are disclosed for identifying residual video data. This data describes data that is lost during a compression of original video data. For example, the original video data may be compressed and then decompressed, and this result may be compared to the original video data to determine the residual video data. This residual video data is transformed into a smaller format by means of encoding, binarizing, and compressing, and is sent to a destination. At the destination, the residual video data is transformed back into its original format and is used during the decompression of the compressed original video data to improve a quality of the decompressed original video data.
-
公开(公告)号:US11037051B2
公开(公告)日:2021-06-15
申请号:US16565885
申请日:2019-09-10
Applicant: NVIDIA Corporation
Inventor: Kihwan Kim , Jinwei Gu , Chen Liu , Jan Kautz
Abstract: Planar regions in three-dimensional scenes offer important geometric cues in a variety of three-dimensional perception tasks such as scene understanding, scene reconstruction, and robot navigation. Image analysis to detect planar regions can be performed by a deep learning architecture that includes a number of neural networks configured to estimate parameters for the planar regions. The neural networks process an image to detect an arbitrary number of plane objects in the image. Each plane object is associated with a number of estimated parameters including bounding box parameters, plane normal parameters, and a segmentation mask. Global parameters for the image, including a depth map, can also be estimated by one of the neural networks. Then, a segmentation refinement network jointly optimizes (i.e., refines) the segmentation masks for each instance of the plane objects and combines the refined segmentation masks to generate an aggregate segmentation mask for the image.
-
156.
公开(公告)号:US20210150757A1
公开(公告)日:2021-05-20
申请号:US16690015
申请日:2019-11-20
Applicant: NVIDIA Corporation
Inventor: Siva Karthik Mustikovela , Varun Jampani , Shalini De Mello , Sifei Liu , Umar Iqbal , Jan Kautz
Abstract: Apparatuses, systems, and techniques to identify orientations of objects within images. In at least one embodiment, one or more neural networks are trained to identify an orientations of one or more objects based, at least in part, on one or more characteristics of the object other than the object's orientation.
-
公开(公告)号:US20210133990A1
公开(公告)日:2021-05-06
申请号:US16675120
申请日:2019-11-05
Applicant: NVIDIA Corporation
Inventor: Benjamin David Eckart , Wentao Yuan , Varun Jampani , Kihwan Kim , Jan Kautz
Abstract: Apparatuses, systems, and techniques to generate a 3D model of an object. In at least one embodiment, a 3D model of an object is generated by one or more neural networks, based on a plurality of images of the object.
-
公开(公告)号:US10984286B2
公开(公告)日:2021-04-20
申请号:US16265725
申请日:2019-02-01
Applicant: NVIDIA Corporation
Inventor: Aysegul Dundar , Ming-Yu Liu , Ting-Chun Wang , John Zedlewski , Jan Kautz
IPC: G06K9/62 , G06K9/32 , G06K9/00 , G01N3/08 , G06N3/04 , G06T7/10 , G06T3/00 , G06T11/00 , G06T15/00 , G06N3/08
Abstract: A style transfer neural network may be used to generate stylized synthetic images, where real images provide the style (e.g., seasons, weather, lighting) for transfer to synthetic images. The stylized synthetic images may then be used to train a recognition neural network. In turn, the trained neural network may be used to predict semantic labels for the real images, providing recognition data for the real images. Finally, the real training dataset (real images and predicted recognition data) and the synthetic training dataset are used by the style transfer neural network to generate stylized synthetic images. The training of the neural network, prediction of recognition data for the real images, and stylizing of the synthetic images may be repeated for a number of iterations. The stylization operation more closely aligns a covariate of the synthetic images to the covariate of the real images, improving accuracy of the recognition neural network.
-
公开(公告)号:US20210073612A1
公开(公告)日:2021-03-11
申请号:US16566797
申请日:2019-09-10
Applicant: NVIDIA Corporation
Inventor: Arash Vahdat , Arun Mohanray Mallya , Ming-Yu Liu , Jan Kautz
Abstract: In at least one embodiment, differentiable neural architecture search and reinforcement learning are combined under one framework to discover network architectures with desired properties such as high accuracy, low latency, or both. In at least one embodiment, an objective function for search based on generalization error prevents the selection of architectures prone to overfitting.
-
公开(公告)号:US20210064907A1
公开(公告)日:2021-03-04
申请号:US16998890
申请日:2020-08-20
Applicant: NVIDIA Corporation
Inventor: Xiaodong Yang , Yang Zou , Zhiding Yu , Jan Kautz
Abstract: Object re-identification refers to a process by which images that contain an object of interest are retrieved from a set of images captured using disparate cameras or in disparate environments. Object re-identification has many useful applications, particularly as it is applied to people (e.g. person tracking). Current re-identification processes rely on convolutional neural networks (CNNs) that learn re-identification for a particular object class from labeled training data specific to a certain domain (e.g. environment), but that do not apply well in other domains. The present disclosure provides cross-domain disentanglement of id-related and id-unrelated factors. In particular, the disentanglement is performed using a labeled image set and an unlabeled image set, respectively captured from different domains but for a same object class. The identification-related features may then be used to train a neural network to perform re-identification of objects in that object class from images captured from the second domain.
-
-
-
-
-
-
-
-
-