Image sequence processing using neural networks

Invention Grant

US12106554B2 Image sequence processing using neural networks 有权

Please log in to see more content

Patent Title: Image sequence processing using neural networks
Application No.: US17619968

Application Date: 2019-06-18
Publication No.: US12106554B2

Publication Date: 2024-10-01
Inventor: Nicolas Livet
Applicant: XZIMG LIMITED
Applicant Address: CN Hong Kong
Assignee: XZIMG LIMITED
Current Assignee: XZIMG LIMITED
Current Assignee Address: CN Hong Kong
Agency: NIXON & VANDERHYE
International Application: PCT/EP2019/066031 2019.06.18
International Announcement: WO2020/253947A 2020.12.24
Date entered country: 2021-12-16
Main IPC: G06V10/82
IPC: G06V10/82 ; G06N3/044 ; G06N3/084 ; G06T7/20 ; G06V10/26 ; G06V10/764 ; G06V10/77 ; G06V10/772 ; G06V10/774 ; G06V10/80

Image sequence processing using neural networks

Abstract:

A recurrent multi-task CNN with an encoder and multiple decoders infers single value output and dense (image) outputs such as heatmaps and segmentation masks. Recurrence is obtained by reinjecting (with mere concatenation) heatmaps or masks (or intermediate feature maps) to a next input image (or to next intermediate feature maps) for a next CNN inference. The inference outputs may be refined using cascaded refiner blocks specifically trained. Virtual annotation for training video sequences can be obtained using computer analysis. Benefits of these approaches allows the depth of the CNN, i.e. the number of layers, to be reduced. They also avoid parallel independent inferences to be run for different tasks, while keeping similar prediction quality. Multiple task inferences are useful for Augmented Reality applications.

Public/Granted literature

US20220301295A1 RECURRENT MULTI-TASK CONVOLUTIONAL NEURAL NETWORK ARCHITECTURE Public/Granted day:2022-09-22

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V10/00	图像或视频识别或理解的安排（图像或视频中的字符识别 G06V30/10）
G06V10/70	.使用模式识别或机器学习（光学模式识别或电子计算 G06V10/88）
G06V10/82	..使用神经网络