-
公开(公告)号:US20250117980A1
公开(公告)日:2025-04-10
申请号:US18484122
申请日:2023-10-10
Applicant: NVIDIA Corporation
Inventor: Yuzhuo REN , Niranjan AVADHANAM
IPC: G06T11/00 , G06V10/77 , G06V10/774 , G06V10/82 , G06V20/59
Abstract: In various examples, infrared image data (e.g., frames of an infrared video feed) may be colorized by applying the infrared image data and/or a corresponding edge map to a generator of a generative adversarial network (GAN). The GAN may be trained with or without paired ground truth RGB and infrared (and/or edge map) images. In an example of the latter scenario, a first generator G(IR)→RGB and a second generator G(RGB)→IR may be trained in a first chain, their positions may be swapped in a second chain, and the second chain may be trained. In some embodiments, edges may be emphasized by weighting edge pixels (e.g., determined from a corresponding edge map) higher than non-edge pixels when backpropagating loss. After training, G(IR)→RGB may be used to generate RGB image data from infrared image data (and/or a corresponding edge map).
-
公开(公告)号:US20250117981A1
公开(公告)日:2025-04-10
申请号:US18484306
申请日:2023-10-10
Applicant: NVIDIA Corporation
Inventor: Yuzhuo REN , Niranjan AVADHANAM
IPC: G06T11/00 , G06V10/77 , G06V10/774 , G06V10/82 , G06V20/59
Abstract: In various examples, infrared image data (e.g., frames of an infrared (IR) video feed) may be colorized by transferring color statistics from an RGB image with an overlapping field of view, by modifying one or more dimensions of an encoded representation of a generated RGB image, and/or otherwise. For example, segmentation may be applied to the IR and RGB image data, and the one or more colors or statistics may be transferred from a segmented region of the RGB image data to a corresponding segmented region of the IR image data. In some embodiments, synthesized RGB image data may be fined tuned by transferring color or color statistic(s) from corresponding real RGB image data, and/or by modifying one or more dimensions of an encoded representation of the synthesized RGB image data.
-
公开(公告)号:US20230316773A1
公开(公告)日:2023-10-05
申请号:US18173630
申请日:2023-02-23
Applicant: NVIDIA Corporation
Inventor: Niranjan AVADHANAM , Ratin KUMAR
CPC classification number: G06V20/58 , B60W30/06 , B60W2420/42 , B60W2420/52
Abstract: In various examples, sensor data may be captured by sensors of an ego-object, such as a vehicle traveling in a physical environment, and a representation of the sensor data may be streamed from the ego-object to a remote location to facilitate various remote experiences, such as streaming to a remote viewer (e.g., a friend or relative), streaming to a remote or fleet operator, streaming to a mobile app configured to self-park or summon an ego-object, rendering a 3D augmented reality (AR) or virtual reality (VR) representation of the physical environment, and/or others. In some embodiments, the stream includes one or more command channels used to control data collection, rendering, stream content, or even vehicle maneuvers, such as during an emergency, self-park, or summon scenario.
-
公开(公告)号:US20250022223A1
公开(公告)日:2025-01-16
申请号:US18221018
申请日:2023-07-12
Applicant: NVIDIA Corporation
Inventor: Yuzhuo REN , Niranjan AVADHANAM
Abstract: In various examples, a visualization of an environment may be generated using a Panini projection that is optimized based on detected scene content. For example, image data of an environment may be perspective projected (e.g., using a rectilinear projection) to generate a reference projection image, which may be analyzed to detect the presence of vanishing points and/or horizontal lines (e.g., in a central region). The image data of the environment may be projected using a Panini projection that is optimized based on distances to detected objects, the absence of a detected vanishing point, and/or the presence of a detected horizontal line to generate a Panini projection image. In some embodiments, vertical compression is applied to the Panini projection image to correct for distortion of horizontal lines (e.g., based on the presence of a detected horizontal line).
-
公开(公告)号:US20250022218A1
公开(公告)日:2025-01-16
申请号:US18353441
申请日:2023-07-17
Applicant: NVIDIA Corporation
Inventor: Nuri Murat ARAR , Niranjan AVADHANAM , Yuzhuo REN , Hairong JIANG
Abstract: In various examples, updates to a dynamic seam placement and/or fitted 3D bowl may be at least partially concealed using temporal masking. A future time in which a predicted change in dynamic seam placement and/or fitted 3D bowl exceeds some threshold may be determined. A predicted dynamic seam placement and/or fitted 3D bowl update may be temporally masked by triggering the update before arriving at the future time to compensate for the latency of the temporal filtering and/or by adjusting the temporal filter size (e.g., shortening a temporal window over which temporal filtering is applied) in anticipation of the predicted dynamic seam placement and/or fitted 3D bowl update, effectively maintaining some of the smoothing effects of temporal filtering, while reducing the latency.
-
6.
公开(公告)号:US20230316458A1
公开(公告)日:2023-10-05
申请号:US18173589
申请日:2023-02-23
Applicant: NVIDIA Corporation
Inventor: Yuzhuo REN , Kenneth TURKOWSKI , Nuri Murat ARAR , Orazio GALLO , Jan KAUTZ , Niranjan AVADHANAM , Hang SU
CPC classification number: G06T3/4038 , G06T7/74
Abstract: In various examples, dynamic seam placement is used to position seams in regions of overlapping image data to avoid crossing salient objects or regions. Objects may be detected from image frames representing overlapping views of an environment surrounding an ego-object such as a vehicle. The images may be aligned to create an aligned composite image or surface (e.g., a panorama, a 360° image, bowl shaped surface) with regions of overlapping image data, and a representation of the detected objects and/or salient regions (e.g., a saliency mask) may be generated and projected onto the aligned composite image or surface. Seams may be positioned in the overlapping regions to avoid or minimize crossing salient pixels represented in the projected masks, and the image data may be blended at the seams to create a stitched image or surface (e.g., a stitched panorama, stitched 360° image, stitched textured surface).
-
公开(公告)号:US20250142208A1
公开(公告)日:2025-05-01
申请号:US18494138
申请日:2023-10-25
Applicant: NVIDIA Corporation
Inventor: Sakthivel SIVARAMAN , Rajath SHETTY , Animesh KHEMKA , Niranjan AVADHANAM
IPC: H04N23/667 , B60R1/20 , H04N23/11 , H04N23/83
Abstract: In various examples, an image processing pipeline may switch between different operating or switching modes based on speed of ego-motion and/or the active gear (e.g., park vs. drive) of a vehicle or other ego-machine in which an RGB/IR camera is being used. For example, a first operating or switching mode that toggles between IR and RGB imaging modes at a fixed frame rate or interval may be used when the vehicle is in motion, in a particular gear (e.g., drive), and/or traveling above a threshold speed. In another example, a second operating or switching mode that toggles between IR and RGB imaging modes based on detected light intensity may be used when the vehicle is in stationary, in park (or out of gear), and/or traveling below a threshold speed.
-
公开(公告)号:US20250136130A1
公开(公告)日:2025-05-01
申请号:US18499913
申请日:2023-11-01
Applicant: NVIDIA Corporation
Inventor: Rajath SHETTY , Ratin KUMAR , Niral Lalit PATHAK , Niranjan AVADHANAM
IPC: B60W50/10 , B60W60/00 , G06V20/58 , G06V20/59 , G06V20/62 , G06V30/14 , G06V30/19 , G06V30/262 , G10L15/183
Abstract: Various embodiments of the present disclosure relate to operator assistance based on extracting natural language characters from one or more sensed objects. For instance, particular embodiments may generate a natural language utterance based on extracting natural language text in a nearby traffic sign. In an illustrative example, particular embodiments may detect, via object detection and within image data, one or more regions of the image data depicting the traffic sign. Particular embodiments can then extract one or more first natural language characters represented in the traffic sign based at least on performing optical character recognition within the one or more regions of the image data in response to detecting the one or more regions of the image data depicting the traffic sign.
-
9.
公开(公告)号:US20160284060A1
公开(公告)日:2016-09-29
申请号:US13956735
申请日:2013-08-01
Applicant: NVIDIA Corporation
Inventor: Varun Kumar ALLAGADAPA , Niranjan AVADHANAM , Thrinadh KOTTANA , Shalini GUPTA
CPC classification number: G06T1/20 , G06K9/4661 , G06T5/002 , G06T5/50 , G06T2207/10016 , G06T2207/20004 , G06T2207/20182
Abstract: A method of noise filter parameter adaptation, the method comprising receiving a current video frame comprising a plurality of pixels. A table lookup is performed, using current statistical values associated with the current video frame. Noise filter parameters are adapted, based on current lighting conditions as determined from the performed table lookup. The current lighting conditions correspond to the current statistical values. The current video frame is noise filtered as defined by the adapted noise filter parameters.
Abstract translation: 一种噪声滤波器参数自适应的方法,所述方法包括接收包括多个像素的当前视频帧。 使用与当前视频帧相关联的当前统计值来执行表查找。 基于从执行的表查找确定的当前照明条件来适应噪声滤波器参数。 当前的照明条件对应于当前的统计值。 当前视频帧由适应的噪声滤波器参数定义进行噪声滤波。
-
公开(公告)号:US20250157170A1
公开(公告)日:2025-05-15
申请号:US18507740
申请日:2023-11-13
Applicant: NVIDIA Corporation
Inventor: Yuzhuo REN , Yining DENG , Dawid Stanislaw PAJAK , Robin JENKIN , Niranjan AVADHANAM
Abstract: In various examples, metadata-based image harmonization for image stitching systems and applications are disclosed. Systems and methods are disclosed that preprocess images with respect to rendering parameters, with the effect of blending those parameters at a border between images to facilitate a smooth rendering when those images are stitched together. An image signal processing (ISP) parameter harmonization function may input metadata parameters associated with a set of images to match and blend one or more of the rendering parameters across an overlapping border between images prior to applying those images to a stitching algorithm. A scaling of the metadata parameter may be performed using a parameter gain function. Pixels in both images located along the border are adjusted to the same boundary metadata parameter value, and smoothed based on the parameter gain function. A discontinuity in rendering parameters is avoided, substantially avoiding corresponding artifacts in the resulting stitched image.
-
-
-
-
-
-
-
-
-