Recognizing minutes-long activities in videos
Abstract:
A method for classifying subject activities in videos includes learning latent (previously generated) concepts that are analogous to nodes of a graph to be generated for an activity in a video. The method also includes receiving video segments of the video. A similarity between the video segments and the previously generated concepts is measured to obtain segment representations as a weighted set of latent concepts. The method further includes determining a relationship between the segment representations and their transitioning pattern over time to determine a reduced set of nodes and/or edges for the graph. The graph of the activity in the video represented by the video segments is generated based on the reduced set of nodes and/or edges. The nodes of the graph are represented by the latent concepts. Subject activities in the video are classified based on the graph.
Public/Granted literature
Information query
Patent Agency Ranking
0/0