Method, apparatus, and system for providing real-world distance information from a monocular image

    公开(公告)号:US10997740B2

    公开(公告)日:2021-05-04

    申请号:US16511892

    申请日:2019-07-15

    Abstract: An approach is provided for estimating a real-world depth information from a monocular image. The approach, for example, involves determining a vanishing point of the monocular image captured by a camera. The approach also involves generating a vanishing point ray from an optical center of the camera through the vanishing point on an image plane of the monocular image to infinity. The approach further involves generating a center line ray from the optical center through a geometric center of the image plane to a feature line that is parallel to the vanishing point ray at a lateral distance. The approach further involves generating a feature ray from the optical center through a location of the feature on the image plane to the feature line. The approach further involves computing the real-world distances of the feature based on image coordinates of the rays, lines, angles derived therefrom, and a known pixel-wise distance of the monocular image.

    Deep neural network architecture for image segmentation

    公开(公告)号:US11600006B2

    公开(公告)日:2023-03-07

    申请号:US16171814

    申请日:2018-10-26

    Abstract: An apparatus and method for encoding objects in a camera-captured image with a deep neural network pipeline including multiple convolutional neural networks or convolutional layers. After identifying at least a portion of the camera-capture image, a first convolutional layer is applied to the at least the portion of the camera-captured image and multiple subregion representations are pooled from the output of the first convolutional layer. One or more additional convolutions are performed. At least one deconvolution is performed and concatenated with the output of one or more convolutions. One or more final convolutions are performed. The at least the portion of the camera-captured image is classified as an object category in response to an output of the one or more final convolutions.

Patent Agency Ranking