Invention Grant
- Patent Title: Weakly supervised natural language localization networks for video proposal prediction based on a text query
-
Application No.: US16531343Application Date: 2019-08-05
-
Publication No.: US11687588B2Publication Date: 2023-06-27
- Inventor: Mingfei Gao , Richard Socher , Caiming Xiong
- Applicant: salesforce.com, inc.
- Applicant Address: US CA San Francisco
- Assignee: Salesforce.com, Inc.
- Current Assignee: Salesforce.com, Inc.
- Current Assignee Address: US CA San Francisco
- Agency: Haynes and Boone, LLP
- Main IPC: G06F16/735
- IPC: G06F16/735 ; G06F16/73 ; G06V10/82 ; G06F16/74 ; G06V20/40 ; G06F17/10 ; G06N3/08 ; G06F40/47 ; G06F18/21 ; G06V10/44

Abstract:
Systems and methods are provided for weakly supervised natural language localization (WSNLL), for example, as implemented in a neural network or model. The WSNLL network is trained with long, untrimmed videos, i.e., videos that have not been temporally segmented or annotated. The WSNLL network or model defines or generates a video-sentence pair, which corresponds to a pairing of an untrimmed video with an input text sentence. According to some embodiments, the WSNLL network or model is implemented with a two-branch architecture, where one branch performs segment sentence alignment and the other one conducts segment selection. These methods and systems are specifically used to predict how a video proposal matches a text query using respective visual and text features.
Public/Granted literature
- US20200372116A1 Weakly Supervised Natural Language Localization Networks Public/Granted day:2020-11-26
Information query