-
公开(公告)号:US20250121843A1
公开(公告)日:2025-04-17
申请号:US18485035
申请日:2023-10-11
Applicant: Nvidia Corporation
Inventor: Niranjan Avadhanam , Yuzhuo Ren
Abstract: Approaches presented herein provide for the automated determination of a level of impairment of a person, as may be relevant to the performance of a task. A light and camera-based system can be used to determine factors such as gaze nystagmus that are indicative of inebriation or impairment. A test system can simulate motion of a light using a determined pattern, and capture image data of at least the eye region of a person attempting to follow the motion. The captured image data can be analyzed using a neural network to infer at least one behavior of the user, and the behavior determination(s) can be used to determine a capacity or level of impairment of a user. An appropriate action can be taken, such as to allow a person with full capacity to operate a vehicle or perform a task, or to block access to such operation or performance if the person is determined to be impaired beyond an allowable amount.
-
公开(公告)号:US20240112472A1
公开(公告)日:2024-04-04
申请号:US17959934
申请日:2022-10-04
Applicant: NVIDIA Corporation
Inventor: Yuzhuo Ren , Dawid Stanislaw Pajak , Niranjan Avadhanam , Guangli DAI
CPC classification number: G06V20/56 , G06T7/90 , G06T15/205 , G06V10/16 , G06V10/25 , G06V10/56 , G06T2207/10024 , G06T2207/30252
Abstract: In various examples, color statistic(s) from ground projections are used to harmonize color between reference and target frames representing an environment. The reference and target frames may be projected onto a representation of the ground (e.g., a ground plane) of the environment, an overlapping region between the projections may be identified, and the portion of each projection that lands in the overlapping region may be taken as a corresponding ground projection. Color statistics (e.g., mean, variance, standard deviation, kurtosis, skew, correlation(s) between color channels) may be computed from the ground projections (or a portion thereof, such as a majority cluster) and used to modify the colors of the target frame to have updated color statistics that match those from the ground projection of the reference frame, thereby harmonizing color across the reference and target frames.
-
公开(公告)号:US20220026987A1
公开(公告)日:2022-01-27
申请号:US17498353
申请日:2021-10-11
Applicant: Nvidia Corporation
Inventor: Feng Hu , Niranjan Avadhanam , Yuzhuo Ren , Sujay Yadawadkar , Sakthivel Sivaraman , Hairong Jiang , Siyue Wu
Abstract: Apparatuses, systems, and techniques are described to determine locations of objects using images including digital representations of those objects. In at least one embodiment, a gaze of one or more occupants of a vehicle is determined independently of a location of one or more sensors used to detect those occupants.
-
公开(公告)号:US12236351B2
公开(公告)日:2025-02-25
申请号:US18497501
申请日:2023-10-30
Applicant: Nvidia Corporation
Inventor: Feng Hu , Niranjan Avadhanam , Yuzhuo Ren , Sujay Yadawadkar , Sakthivel Sivaraman , Hairong Jiang , Siyue Wu
IPC: G06N3/084 , G06F3/01 , G06F18/21 , G06N3/08 , G06V10/764 , G06V10/82 , G06V20/59 , G06V40/18 , G06V40/19
Abstract: Apparatuses, systems, and techniques are described to determine locations of objects using images including digital representations of those objects. In at least one embodiment, a gaze of one or more occupants of a vehicle is determined independently of a location of one or more sensors used to detect those occupants.
-
公开(公告)号:US20230351807A1
公开(公告)日:2023-11-02
申请号:US17661706
申请日:2022-05-02
Applicant: NVIDIA Corporation
Inventor: Yuzhuo Ren , Weili Nie , Arash Vahdat , Animashree Anandkumar , Nishant Puri , Niranjan Avadhanam
IPC: G06V40/16 , G06V10/82 , G06V10/774 , G06V10/62
CPC classification number: G06V40/176 , G06V10/82 , G06V10/774 , G06V10/62 , G06V40/164
Abstract: A machine learning model (MLM) may be trained and evaluated. Attribute-based performance metrics may be analyzed to identify attributes for which the MLM is performing below a threshold when each are present in a sample. A generative neural network (GNN) may be used to generate samples including compositions of the attributes, and the samples may be used to augment the data used to train the MLM. This may be repeated until one or more criteria are satisfied. In various examples, a temporal sequence of data items, such as frames of a video, may be generated which may form samples of the data set. Sets of attribute values may be determined based on one or more temporal scenarios to be represented in the data set, and one or more GNNs may be used to generate the sequence to depict information corresponding to the attribute values.
-
6.
公开(公告)号:US20230064049A1
公开(公告)日:2023-03-02
申请号:US17462833
申请日:2021-08-31
Applicant: Nvidia Corporation
Inventor: Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche
IPC: G06K9/00 , G06T7/73 , G06F3/01 , G06F16/632 , G06T15/06
Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.
-
公开(公告)号:US20220121867A1
公开(公告)日:2022-04-21
申请号:US17076690
申请日:2020-10-21
Applicant: NVIDIA Corporation
Inventor: Nuri Murat Arar , Niranjan Avadhanam , Yuzhuo Ren
IPC: G06K9/00 , B60W40/08 , B60W60/00 , B60W30/095 , B60W50/14
Abstract: In various examples, estimated field of view or gaze information of a user may be projected external to a vehicle and compared to vehicle perception information corresponding to an environment outside of the vehicle. As a result, interior monitoring of a driver or occupant of the vehicle may be used to determine whether the driver or occupant has processed or seen certain object types, environmental conditions, or other information exterior to the vehicle. For a more holistic understanding of the state of the user, attentiveness and/or cognitive load of the user may be monitored to determine whether one or more actions should be taken. As a result, notifications, AEB system activations, and/or other actions may be determined based on a more complete state of the user as determined based on cognitive load, attentiveness, and/or a comparison between external perception of the vehicle and estimated perception of the user.
-
公开(公告)号:US12230040B2
公开(公告)日:2025-02-18
申请号:US18516531
申请日:2023-11-21
Applicant: Nvidia Corporation
Inventor: Yuzhuo Ren , Niranjan Avadhanam
Abstract: State information can be determined for a subject that is robust to different inputs or conditions. For drowsiness, facial landmarks can be determined from captured image data and used to determine a set of blink parameters. These parameters can be used, such as with a temporal network, to estimate a state (e.g., drowsiness) of the subject. To improve robustness, an eye state determination network can determine eye state from the image data, without reliance on intermediate landmarks, that can be used, such as with another temporal network, to estimate the state of the subject. A weighted combination of these values can be used to determine an overall state of the subject. To improve accuracy, individual behavior patterns and context information can be utilized to account for variations in the data due to subject variation or current context rather than changes in state.
-
9.
公开(公告)号:US20240112376A1
公开(公告)日:2024-04-04
申请号:US17959940
申请日:2022-10-04
Applicant: NVIDIA Corporation
Inventor: Yuzhuo Ren , Dawid Stanislaw Pajak , Niranjan Avadhanam
CPC classification number: G06T11/001 , G06T11/60
Abstract: In various examples, color harmonization is applied to images of an environment in a reference light space. For example, different cameras on an ego-object may use independent capturing algorithms to generate processed images of the environment representing a common time slice using different capture configuration parameters. The processed images may be transformed into deprocessed images by inverting one or more stages of image processing to transform the processed images into a reference light space of linear light, and color harmonization may be applied to the deprocessed images in the reference light space. After applying color harmonization, corresponding image processing may be reapplied to the harmonized images using corresponding capture configuration parameters, the resulting processed harmonized images may be stitched into a stitched image, and a visualization of the stitched image may be presented (e.g., on a monitor visible to an occupant or operator of the ego-object).
-
公开(公告)号:US11657535B2
公开(公告)日:2023-05-23
申请号:US16653959
申请日:2019-10-15
Applicant: NVIDIA Corporation
Inventor: Feng Hu , Yuzhuo Ren , Niranjan Avadhanam , Ankit Pashiney
CPC classification number: G06T7/80 , G06F3/14 , G06T7/70 , H04N17/002 , G06T2207/30268
Abstract: Systems and methods for automatic camera calibration without using a robotic actuator or similar hardware. An electronic display screen projects an image of a simulated three-dimensional calibration pattern, such as a checkerboard, oriented in a particular pose. The camera captures an image of the calibration pattern that is displayed on the screen, and this image together with the transform of the simulated three-dimensional calibration pattern are used to calibrate the camera. Multiple different pictures of different poses are employed to determine the optimal set of poses that produces the lowest reprojection error. To aid in selecting different poses, i.e., spatial positions and orientations of the simulated three-dimensional calibration pattern, poses may be selected from only that portion of the camera's field of view which is expected to be typically used in operation of the camera.
-
-
-
-
-
-
-
-
-