Invention Grant
- Patent Title: Image sequence processing using neural networks
-
Application No.: US17619968Application Date: 2019-06-18
-
Publication No.: US12106554B2Publication Date: 2024-10-01
- Inventor: Nicolas Livet
- Applicant: XZIMG LIMITED
- Applicant Address: CN Hong Kong
- Assignee: XZIMG LIMITED
- Current Assignee: XZIMG LIMITED
- Current Assignee Address: CN Hong Kong
- Agency: NIXON & VANDERHYE
- International Application: PCT/EP2019/066031 2019.06.18
- International Announcement: WO2020/253947A 2020.12.24
- Date entered country: 2021-12-16
- Main IPC: G06V10/82
- IPC: G06V10/82 ; G06N3/044 ; G06N3/084 ; G06T7/20 ; G06V10/26 ; G06V10/764 ; G06V10/77 ; G06V10/772 ; G06V10/774 ; G06V10/80

Abstract:
A recurrent multi-task CNN with an encoder and multiple decoders infers single value output and dense (image) outputs such as heatmaps and segmentation masks. Recurrence is obtained by reinjecting (with mere concatenation) heatmaps or masks (or intermediate feature maps) to a next input image (or to next intermediate feature maps) for a next CNN inference. The inference outputs may be refined using cascaded refiner blocks specifically trained. Virtual annotation for training video sequences can be obtained using computer analysis. Benefits of these approaches allows the depth of the CNN, i.e. the number of layers, to be reduced. They also avoid parallel independent inferences to be run for different tasks, while keeping similar prediction quality. Multiple task inferences are useful for Augmented Reality applications.
Public/Granted literature
- US20220301295A1 RECURRENT MULTI-TASK CONVOLUTIONAL NEURAL NETWORK ARCHITECTURE Public/Granted day:2022-09-22
Information query