-
公开(公告)号:US20220092400A1
公开(公告)日:2022-03-24
申请号:US17543551
申请日:2021-12-06
Applicant: Intel Corporation
Inventor: Noam Elron , Ben Berlin , Dmitry Rudoy , Amir Goren , Noam Levy
Abstract: A method, system, and article of highly efficient neural network video image processing uses temporal correlations.
-
公开(公告)号:US20240249392A1
公开(公告)日:2024-07-25
申请号:US18583642
申请日:2024-02-21
Applicant: Intel Corporation
Inventor: Dmitry Rudoy , Rakefet Kol , Noam Elron , Noam Levy
CPC classification number: G06T5/50 , G06T3/40 , G06T5/20 , G06T7/10 , G06T2207/20084 , G06T2207/20221
Abstract: A high-level understanding of the scene captured by a camera allows for the use of scene-level understanding in the processing of the captured image. A downscaled image of a captured scene is generated and used as a basis for artificial intelligence analysis before the full image of the captured scene is processed. The downscaled image is generated concurrently with the capturing of the raw image at the image sensor and before full image signal processor (ISP) processing. Neural networks and other AI algorithms can be applied directly to the downscaled image to perform high-level understanding using minimal resources. The processing of the full scale captured image can be adapted to specific scenarios based on the understanding rather than undergoing all-purpose processing. The high-level understanding is provided to the full image processing pipe for enhancements in image quality, video conferencing, face detection, and other user experiences.
-
公开(公告)号:US20250005765A1
公开(公告)日:2025-01-02
申请号:US18342549
申请日:2023-06-27
Applicant: Intel Corporation
Inventor: Dmitry Rudoy , Rakefet Kol , Noam Elron , Noam Levy
Abstract: Systems, apparatus, articles of manufacture, and methods are disclosed to process images using segmentation. An example apparatus includes interface circuitry, machine readable instructions, and programmable circuitry to at least one of instantiate or execute the machine readable instructions to generate a scaled frame from an input video frame, segment, with a neural network, the scaled frame to generate a scaled segmentation map based on the scaled frame, the scaled segmentation map to associate pixels of the scaled frame with ones of a plurality of segments in the scaled frame, and generate an output video frame based on the input video frame and an upscaled version of the scaled segmentation map.
-
公开(公告)号:US20240192774A1
公开(公告)日:2024-06-13
申请号:US18584782
申请日:2024-02-22
Applicant: Intel Corporation
Inventor: Elad Sunray , Dmitry Rudoy , Noam Levy
CPC classification number: G06F3/013 , G06F3/012 , G06T3/60 , G06T7/73 , G06T7/80 , G06T2207/20084 , G06T2207/20132 , G06T2207/30201
Abstract: Systems and methods for real-time, efficient, monocular gaze position determination that can be performed in real-time on a consumer-grade laptop. Gaze tracking can be used for human-computer interactions, such as window selection, user attention on screen information, gaming, augmented reality, and virtual reality. Gaze position estimation from a monocular camera involves estimating the line-of-sight of a user and intersecting the line-of-sight with a two-dimensional (2D) screen. The system uses a neural network to determine gaze position within about four degrees of accuracy while maintaining very low computational complexity. The system can be used to determine gaze position across multiple screens, determining which screen a user is viewing as well as a gaze target area on the screen. There are many different scenarios in which a gaze position estimation system can be used, including different head poses, different facial expressions, different cameras, different screens, and various illumination scenarios.
-
公开(公告)号:US20220398780A1
公开(公告)日:2022-12-15
申请号:US17845652
申请日:2022-06-21
Applicant: Intel Corporation
Inventor: Oskar Pelc , Dmitry Rudoy , Noam Levy
Abstract: Methods, apparatus, systems, and articles of manufacture are disclosed to calibrate a stereo camera. An example apparatus includes means for determining a motion grid between a first image and a second image captured by the stereo camera; means for determining a calibration value to calibrate the stereo camera based on a prior calibration value, a relative orientation between the first image and the second image based on the motion grid, and a metric indicative of calibration improvement; and means for estimating a depth based on the calibration value.
-
公开(公告)号:US09826149B2
公开(公告)日:2017-11-21
申请号:US14670642
申请日:2015-03-27
Applicant: Intel Corporation
Inventor: Edmond Chalom , Dmitry Rudoy , Noam Levy , Harish Essaky Sankaran , Jarno Nikkanen , Joni-Matti Maatta , German Voronov
CPC classification number: H04N5/23229 , G06K9/00684 , G06T7/11 , G06T2207/10004 , G06T2207/20084 , H04N5/23222
Abstract: Methods, apparatuses and systems may provide for operating a machine learning device by obtaining training image data, conducting an offline prediction analysis of the training image data with respect to one or more real-time parameters of an image capture device, and generating one or more parameter detection models based on the offline prediction analysis. Additionally, methods, apparatuses and systems may provide for operating the image capture device by obtaining a candidate image associated with the image capture device, determining that the candidate image corresponds to a particular type of scene represented in a parameter prediction model, and adjusting one or more real-time parameters of the image capture device based at least in part on one or more parameter values associated with the particular type of scene.
-
-
-
-
-