NATURAL LANGUAGE OBJECT TRACKING
    1.
    发明申请
    NATURAL LANGUAGE OBJECT TRACKING 审中-公开
    自然语言对象跟踪

    公开(公告)号:WO2018089158A1

    公开(公告)日:2018-05-17

    申请号:PCT/US2017/056195

    申请日:2017-10-11

    Abstract: A method of tracking an object across a sequence of video frames using a natural language query includes receiving the natural language query and identifying an initial target in an initial frame of the sequence of video frames based on the natural language query. The method also includes adjusting the natural language query, for a subsequent frame, based on content of the subsequent frame and/or a likelihood of a semantic property of the initial target appearing in the subsequent frame. The method further includes identifying a text driven target and a visual driven target in the subsequent frame. The method still further includes combining the visual driven target with the text driven target to obtain a final target in the subsequent frame.

    Abstract translation: 使用自然语言查询在视频帧序列上追踪对象的方法包括接收自然语言查询并基于自然语言查询识别视频帧序列的初始帧中的初始目标 语言查询。 该方法还包括基于后续帧的内容和/或初始目标的语义属性出现在后续帧中的可能性来调整后续帧的自然语言查询。 该方法还包括在随后的帧中识别文本驱动的目标和视觉驱动的目标。 该方法还包括将视觉驱动目标与文本驱动目标相结合以获得后续帧中的最终目标。

    ENHANCED SIAMESE TRACKERS
    3.
    发明申请
    ENHANCED SIAMESE TRACKERS 审中-公开
    增强的SIAMESE TRACKERS

    公开(公告)号:WO2018084948A1

    公开(公告)日:2018-05-11

    申请号:PCT/US2017/052545

    申请日:2017-09-20

    Abstract: In one configuration, a visual object tracking apparatus is provided that receives a position of an object in a first frame of a video, and determines a current position of the object in subsequent frames of the video using a Siamese neural network. To facilitate determining the current position of the object, the apparatus may adjust a spatial resolution of an image, adjust a size of a probe region, and/or adjust a scale of a plurality of sampled images. In one configuration, a visual object tracking using a Siamese neural network is provided. The apparatus feeds outputs from a plurality of subnetworks of the Siamese neural network to a comparison layer. In addition, the apparatus compares, at the comparison layer, inputs from the plurality of subnetworks to generate a comparison result. Further, the apparatus combines comparison results based on weights to obtain a final comparison result.

    Abstract translation: 在一种配置中,提供了一种视觉对象跟踪装置,其接收视频的第一帧中的对象的位置,并且使用视频确定视频的后续帧中的对象的当前位置 连体神经网络。 为了便于确定对象的当前位置,设备可以调整图像的空间分辨率,调整探测区域的大小和/或调整多个采样图像的比例。 在一种配置中,提供了使用连体神经网络的视觉对象跟踪。 该设备将来自连体神经网络的多个子网络的输出馈送到比较层。 另外,该设备在比较层处比较来自多个子网络的输入以产生比较结果。 此外,该设备将基于权重的比较结果组合以获得最终比较结果。

Patent Agency Ranking