MINIMAL IMAGE SIGNAL PROCESSING PIPELINE FOR AN EARLY SCENE UNDERSTANDING

    公开(公告)号:US20240249392A1

    公开(公告)日:2024-07-25

    申请号:US18583642

    申请日:2024-02-21

    Abstract: A high-level understanding of the scene captured by a camera allows for the use of scene-level understanding in the processing of the captured image. A downscaled image of a captured scene is generated and used as a basis for artificial intelligence analysis before the full image of the captured scene is processed. The downscaled image is generated concurrently with the capturing of the raw image at the image sensor and before full image signal processor (ISP) processing. Neural networks and other AI algorithms can be applied directly to the downscaled image to perform high-level understanding using minimal resources. The processing of the full scale captured image can be adapted to specific scenarios based on the understanding rather than undergoing all-purpose processing. The high-level understanding is provided to the full image processing pipe for enhancements in image quality, video conferencing, face detection, and other user experiences.

    METHODS AND APPARATUS TO PROCESS IMAGES USING SEGMENTATION

    公开(公告)号:US20250005765A1

    公开(公告)日:2025-01-02

    申请号:US18342549

    申请日:2023-06-27

    Abstract: Systems, apparatus, articles of manufacture, and methods are disclosed to process images using segmentation. An example apparatus includes interface circuitry, machine readable instructions, and programmable circuitry to at least one of instantiate or execute the machine readable instructions to generate a scaled frame from an input video frame, segment, with a neural network, the scaled frame to generate a scaled segmentation map based on the scaled frame, the scaled segmentation map to associate pixels of the scaled frame with ones of a plurality of segments in the scaled frame, and generate an output video frame based on the input video frame and an upscaled version of the scaled segmentation map.

    DETERMINATION OF GAZE POSITION ON MULTIPLE SCREENS USING A MONOCULAR CAMERA

    公开(公告)号:US20240192774A1

    公开(公告)日:2024-06-13

    申请号:US18584782

    申请日:2024-02-22

    Abstract: Systems and methods for real-time, efficient, monocular gaze position determination that can be performed in real-time on a consumer-grade laptop. Gaze tracking can be used for human-computer interactions, such as window selection, user attention on screen information, gaming, augmented reality, and virtual reality. Gaze position estimation from a monocular camera involves estimating the line-of-sight of a user and intersecting the line-of-sight with a two-dimensional (2D) screen. The system uses a neural network to determine gaze position within about four degrees of accuracy while maintaining very low computational complexity. The system can be used to determine gaze position across multiple screens, determining which screen a user is viewing as well as a gaze target area on the screen. There are many different scenarios in which a gaze position estimation system can be used, including different head poses, different facial expressions, different cameras, different screens, and various illumination scenarios.

    AUTONOMOUS STEREO CAMERA CALIBRATION

    公开(公告)号:US20220398780A1

    公开(公告)日:2022-12-15

    申请号:US17845652

    申请日:2022-06-21

    Abstract: Methods, apparatus, systems, and articles of manufacture are disclosed to calibrate a stereo camera. An example apparatus includes means for determining a motion grid between a first image and a second image captured by the stereo camera; means for determining a calibration value to calibrate the stereo camera based on a prior calibration value, a relative orientation between the first image and the second image based on the motion grid, and a metric indicative of calibration improvement; and means for estimating a depth based on the calibration value.

Patent Agency Ranking