Iterative spatio-temporal action detection in video
Abstract:
Iterative prediction systems and methods for the task of action detection process an inputted sequence of video frames to generate an output of both action tubes and respective action labels, wherein the action tubes comprise a sequence of bounding boxes on each video frame. An iterative predictor processes large offsets between the bounding boxes and the ground-truth.
Public/Granted literature
Information query
Patent Agency Ranking
0/0