Self-supervised cross-video temporal difference learning for unsupervised domain adaptation

Invention Grant

US11676370B2 Self-supervised cross-video temporal difference learning for unsupervised domain adaptation 有权

Please log in to see more content

Patent Title: Self-supervised cross-video temporal difference learning for unsupervised domain adaptation
Application No.: US17317202

Application Date: 2021-05-11
Publication No.: US11676370B2

Publication Date: 2023-06-13
Inventor: Gaurav Sharma , Jinwoo Choi
Applicant: NEC Laboratories America, Inc.
Applicant Address: US NJ Princeton
Assignee: NEC Corporation
Current Assignee: NEC Corporation
Current Assignee Address: JP Tokyo
Agent Joseph Kolodka
Main IPC: G06V10/764
IPC: G06V10/764 ; G06N3/08 ; G06N3/04 ; G06V20/40 ; G06V10/774 ; G06F18/21 ; G06F18/24 ; G06F18/211 ; G06F18/25 ; G06F18/214

Self-supervised cross-video temporal difference learning for unsupervised domain adaptation

Abstract:

A method is provided for Cross Video Temporal Difference (CVTD) learning. The method adapts a source domain video to a target domain video using a CVTD loss. The source domain video is annotated, and the target domain video is unannotated. The CVTD loss is computed by quantizing clips derived from the source and target domain videos by dividing the source domain video into source domain clips and the target domain video into target domain clips. The CVTD loss is further computed by sampling two clips from each of the source domain clips and the target domain clips to obtain four sampled clips including a first source domain clip, a second source domain clip, a first target domain clip, and a second target domain clip. The CVTD loss is computed as |(second source domain clip−first source domain clip)−(second target domain clip−first target domain clip)|.

Public/Granted literature

US20210374481A1 SELF-SUPERVISED CROSS-VIDEO TEMPORAL DIFFERENCE LEARNING FOR UNSUPERVISED DOMAIN ADAPTATION Public/Granted day:2021-12-02

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V10/00	图像或视频识别或理解的安排（图像或视频中的字符识别 G06V30/10）
G06V10/70	.使用模式识别或机器学习（光学模式识别或电子计算 G06V10/88）
G06V10/764	..使用分类，例如视频对象