APPARATUS FOR VIDEO SEARCHING USING MULTI-MODAL CRITERIA AND METHOD THEREOF

    公开(公告)号:US20210193187A1

    公开(公告)日:2021-06-24

    申请号:US16725609

    申请日:2019-12-23

    Abstract: An apparatus for video searching, includes a memory storing instructions, and a processor configured to execute the instructions to split a video into scenes, obtain, from the scenes into which the video is split, one or more textual descriptors describing each of the scenes, encode the obtained one or more textual descriptors describing each of the scenes into a video scene vector of each of the scenes, encode a user query into a query vector having a same semantic representation as that of the video scene vector of each of the scenes into which the one or more textual descriptors describing each of the scenes are encoded, and identify whether the video scene vector of at least one among the scenes corresponds to the query vector into which the user query is encoded.

Patent Agency Ranking