-
公开(公告)号:US20230065399A1
公开(公告)日:2023-03-02
申请号:US17410580
申请日:2021-08-24
Applicant: Nvidia Corporation
Inventor: Yuzhuo Ren , Niranjan Avadhanam
Abstract: State information can be determined for a subject that is robust to different inputs or conditions. For drowsiness, facial landmarks can be determined from captured image data and used to determine a set of blink parameters. These parameters can be used, such as with a temporal network, to estimate a state (e.g., drowsiness) of the subject. To improve robustness, an eye state determination network can determine eye state from the image data, without reliance on intermediate landmarks, that can be used, such as with another temporal network, to estimate the state of the subject. A weighted combination of these values can be used to determine an overall state of the subject. To improve accuracy, individual behavior patterns and context information can be utilized to account for variations in the data due to subject variation or current context rather than changes in state.
-
公开(公告)号:US20220207756A1
公开(公告)日:2022-06-30
申请号:US17139587
申请日:2020-12-31
Applicant: NVIDIA Corporation
Inventor: Yuzhuo Ren , Niranjan Avadhanam
Abstract: In various examples, two or more cameras in an automotive surround view system generate two or more input images to be stitched, or combined, into a single stitched image. In an embodiment, to improve the quality of a stitched image, a feedback module calculates two or more scores representing errors between the stitched image and one or more input images. If a computed score indicates structural errors in the stitched image, the feedback module calculates and applies one or more geometric transforms to apply to the one or more input images. If a computed score indicates color errors in the stitched image, the feedback module calculates and applies one or more photometric transforms to apply to the one or more input images.
-
公开(公告)号:US11144754B2
公开(公告)日:2021-10-12
申请号:US16544442
申请日:2019-08-19
Applicant: Nvidia Corporation
Inventor: Feng Hu , Niranjan Avadhanam , Yuzhuo Ren , Sujay Yadawadkar , Sakthivel Sivaraman , Hairong Jiang , Siyue Wu
Abstract: Apparatuses, systems, and techniques are described to determine locations of objects using images including digital representations of those objects. In at least one embodiment, a gaze of one or more occupants of a vehicle is determined independently of a location of one or more sensors used to detect those occupants.
-
14.
公开(公告)号:US20250124734A1
公开(公告)日:2025-04-17
申请号:US18999826
申请日:2024-12-23
Applicant: Nvidia Corporation
Inventor: Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche
IPC: G06V40/10 , G06F3/01 , G06F16/632 , G06T7/73 , G06T15/06
Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.
-
公开(公告)号:US20250042413A1
公开(公告)日:2025-02-06
申请号:US18922003
申请日:2024-10-21
Applicant: Nvidia Corporation
Inventor: Yuzhuo Ren , Niranjan Avadhanam
Abstract: State information can be determined for a subject that is robust to different inputs or conditions. For drowsiness, facial landmarks can be determined from captured image data and used to determine a set of blink parameters. These parameters can be used, such as with a temporal network, to estimate a state (e.g., drowsiness) of the subject. To improve robustness, an eye state determination network can determine eye state from the image data, without reliance on intermediate landmarks, that can be used, such as with another temporal network, to estimate the state of the subject. A weighted combination of these values can be used to determine an overall state of the subject. To improve accuracy, individual behavior patterns and context information can be utilized to account for variations in the data due to subject variation or current context rather than changes in state.
-
16.
公开(公告)号:US12211308B2
公开(公告)日:2025-01-28
申请号:US17462833
申请日:2021-08-31
Applicant: Nvidia Corporation
Inventor: Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche
IPC: G06V40/10 , G06F3/01 , G06F16/632 , G06T7/73 , G06T15/06
Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.
-
17.
公开(公告)号:US11978266B2
公开(公告)日:2024-05-07
申请号:US17076690
申请日:2020-10-21
Applicant: NVIDIA Corporation
Inventor: Nuri Murat Arar , Niranjan Avadhanam , Yuzhuo Ren
CPC classification number: G06V20/597 , B60W30/0956 , B60W40/08 , B60W50/14 , B60W60/001 , G06V20/56 , B60W30/09 , B60W2540/22 , B60W2540/221 , B60W2540/223 , B60W2540/225
Abstract: In various examples, estimated field of view or gaze information of a user may be projected external to a vehicle and compared to vehicle perception information corresponding to an environment outside of the vehicle. As a result, interior monitoring of a driver or occupant of the vehicle may be used to determine whether the driver or occupant has processed or seen certain object types, environmental conditions, or other information exterior to the vehicle. For a more holistic understanding of the state of the user, attentiveness and/or cognitive load of the user may be monitored to determine whether one or more actions should be taken. As a result, notifications, AEB system activations, and/or other actions may be determined based on a more complete state of the user as determined based on cognitive load, attentiveness, and/or a comparison between external perception of the vehicle and estimated perception of the user.
-
公开(公告)号:US11954862B2
公开(公告)日:2024-04-09
申请号:US17479648
申请日:2021-09-20
Applicant: NVIDIA Corporation
Inventor: Yuzhuo Ren , Niranjan Avadhanam , Rajath Bellipady Shetty
CPC classification number: G06T7/10 , A61B5/024 , A61B5/087 , G16H30/40 , G06T2207/20081 , G06T2207/20084 , G06T2207/30088 , G06T2207/30201
Abstract: A neural network system leverages dual attention, specifically both spatial attention and channel attention, to jointly estimate heart rate and respiratory rate of a subject by processing images of the subject. A motion neural network receives images of the subject and estimates heart and breath rates of the subject using both spatial and channel domain attention masks to focus processing on particular feature data. An appearance neural network computes a spatial attention mask from the images of the subject and may indicate that features associated with the subject's face (as opposed to the subject's hair or shoulders) to accurately estimate the heart and/or breath rate. Channel-wise domain attention is learned during training and recalibrates channel-wise feature responses to select the most informative features for processing. The channel attention mask is learned during training and can be used for different subjects during deployment.
-
公开(公告)号:US11948315B2
公开(公告)日:2024-04-02
申请号:US17139587
申请日:2020-12-31
Applicant: NVIDIA Corporation
Inventor: Yuzhuo Ren , Niranjan Avadhanam
IPC: G06T3/00 , B60R1/00 , G06T3/14 , G06T3/4038 , G06T5/50 , G06T7/00 , G06T7/33 , G06T7/60 , G06T7/90 , B25J9/16 , H04N23/90
CPC classification number: G06T7/33 , B60R1/00 , G06T3/14 , G06T3/4038 , G06T5/50 , G06T7/0002 , G06T7/60 , G06T7/90 , B25J9/1689 , B60R2300/105 , B60R2300/20 , B60R2300/303 , B60R2300/804 , B60R2300/806 , B60R2300/8093 , G06T2207/20084 , G06T2207/20212 , G06T2207/30168 , G06T2207/30252 , H04N23/90
Abstract: In various examples, two or more cameras in an automotive surround view system generate two or more input images to be stitched, or combined, into a single stitched image. In an embodiment, to improve the quality of a stitched image, a feedback module calculates two or more scores representing errors between the stitched image and one or more input images. If a computed score indicates structural errors in the stitched image, the feedback module calculates and applies one or more geometric transforms to apply to the one or more input images. If a computed score indicates color errors in the stitched image, the feedback module calculates and applies one or more photometric transforms to apply to the one or more input images.
-
公开(公告)号:US20230065491A1
公开(公告)日:2023-03-02
申请号:US17410564
申请日:2021-08-24
Applicant: Nvidia Corporation
Inventor: Yuzhuo Ren , Niranjan Avadhanam
Abstract: State information can be determined for a subject that is robust to different inputs or conditions. For drowsiness, facial landmarks can be determined from captured image data and used to determine a set of blink parameters. These parameters can be used, such as with a temporal network, to estimate a state (e.g., drowsiness) of the subject. To improve robustness, an eye state determination network can determine eye state from the image data, without reliance on intermediate landmarks, that can be used, such as with another temporal network, to estimate the state of the subject. A weighted combination of these values can be used to determine an overall state of the subject. To improve accuracy, individual behavior patterns and context information can be utilized to account for variations in the data due to subject variation or current context rather than changes in state.
-
-
-
-
-
-
-
-
-