-
公开(公告)号:US20190332935A1
公开(公告)日:2019-10-31
申请号:US16389897
申请日:2019-04-19
Applicant: QUALCOMM Incorporated
Inventor: Yoel SANCHEZ BERMUDEZ , Efstratios GAVVES , Ran TAO
Abstract: An apparatus may be configured to obtain, for a Siamese neural network having a recurrent neural network (RNN), an initial representation associated with a target object at a first time step and a set of candidate regions at a current time step. The apparatus may determine an updated representation associated with the target object based on the initial representation at the first time step and observed information associated with the target object at a set of previous time steps, and the observed information associated with the target object may be represented by a hidden state of the RNN. The apparatus may output the updated representation associated with the target object for matching with the set of candidate regions at the current time step by the Siamese neural network. The apparatus may determine the updated representation further based on a hidden state at a previous time step.
-
公开(公告)号:US20200012865A1
公开(公告)日:2020-01-09
申请号:US16577515
申请日:2019-09-20
Applicant: QUALCOMM Incorporated
Inventor: Ran TAO , Efstratios GAVVES , Arnold Wilhelmus Maria SMEULDERS
Abstract: A method of tracking a position of a target object in a video sequence includes identifying the target object in a reference frame. A generic mapping is applied to the target object being tracked. The generic mapping is generated by learning possible appearance variations of a generic object. The method also includes tracking the position of the target object in subsequent frames of the video sequence by determining whether an output of the generic mapping of the target object matches an output of the generic mapping of a candidate object.
-
公开(公告)号:US20190354865A1
公开(公告)日:2019-11-21
申请号:US16417430
申请日:2019-05-20
Applicant: QUALCOMM Incorporated
Inventor: Matthias REISSER , Max WELLING , Efstratios GAVVES , Christos LOUIZOS
Abstract: A neural network may be configured to receive, during a training phase of the neural network, a first input at an input layer of the neural network. The neural network may determine, during the training phase, a first classification at an output layer of the neural network based on the first input. The neural network may adjust, during the training phase and based on a comparison between the determined first classification and an expected classification of the first input, weights for artificial neurons of the neural network based on a loss function. The neural network may output, during an operational phase of the neural network, a second classification determined based on a second input, the second classification being determined by processing the second input through the artificial neurons using the adjusted weights.
-
公开(公告)号:US20180129742A1
公开(公告)日:2018-05-10
申请号:US15587196
申请日:2017-05-04
Applicant: QUALCOMM Incorporated
Inventor: Zhenyang LI , Ran TAO , Efstratios GAVVES , Cornelis Gerardus Maria SNOEK , Arnold Wilhelmus Maria SMEULDERS
CPC classification number: G06F16/7844 , G06F16/3334 , G06F16/338 , G06F16/7834 , G06F16/7837 , G06K9/00744 , G06K9/00771 , G06K9/66 , G06N3/0445 , G06N3/0454 , G06N3/084
Abstract: A method of tracking an object across a sequence of video frames using a natural language query includes receiving the natural language query and identifying an initial target in an initial frame of the sequence of video frames based on the natural language query. The method also includes adjusting the natural language query, for a subsequent frame, based on content of the subsequent frame and/or a likelihood of a semantic property of the initial target appearing in the subsequent frame. The method further includes identifying a text driven target and a visual driven target in the subsequent frame. The method still further includes combining the visual driven target with the text driven target to obtain a final target in the subsequent frame.
-
公开(公告)号:US20170262705A1
公开(公告)日:2017-09-14
申请号:US15267621
申请日:2016-09-16
Applicant: QUALCOMM Incorporated
Inventor: Zhenyang LI , Efstratios GAVVES , Mihir JAIN , Cornelis Gerardus Maria SNOEK
CPC classification number: G06K9/00718 , G06K9/00342 , G06K9/6269 , G06N3/0445 , G06N3/0454
Abstract: A method of predicting action labels for a video stream includes receiving the video stream and calculating an optical flow of consecutive frames of the video stream. An attention map is generated from the current frame of the video stream and the calculated optical flow. An action label is predicted for the current frame based on the optical flow, a previous hidden state and the attention map.
-
公开(公告)号:US20170132472A1
公开(公告)日:2017-05-11
申请号:US15192935
申请日:2016-06-24
Applicant: QUALCOMM Incorporated
Inventor: Ran TAO , Efstratios GAVVES , Arnold Wilhelmus Maria SMEULDERS
CPC classification number: G06K9/00758 , G06K9/3241 , G06K9/4628 , G06K9/6234 , G06K2009/3291 , G06N3/04 , G06T7/2033 , G06T7/248 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084
Abstract: A method of tracking a position of a target object in a video sequence includes identifying the target object in a reference frame. A generic mapping is applied to the target object being tracked. The generic mapping is generated by learning possible appearance variations of a generic object. The method also includes tracking the position of the target object in subsequent frames of the video sequence by determining whether an output of the generic mapping of the target object matches an output of the generic mapping of a candidate object.
-
公开(公告)号:US20180314896A1
公开(公告)日:2018-11-01
申请号:US16030685
申请日:2018-07-09
Applicant: QUALCOMM Incorporated
Inventor: Ran TAO , Efstratios GAVVES , Arnold Wilhelmus Maria SMEULDERS
CPC classification number: G06K9/00758 , G06K9/3241 , G06K9/4628 , G06K9/6234 , G06K2009/3291 , G06N3/04 , G06N3/0454 , G06N3/08 , G06T7/246 , G06T7/248 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084
Abstract: A method of tracking a position of a target object in a video sequence includes identifying the target object in a reference frame. A generic mapping is applied to the target object being tracked. The generic mapping is generated by learning possible appearance variations of a generic object. The method also includes tracking the position of the target object in subsequent frames of the video sequence by determining whether an output of the generic mapping of the target object matches an output of the generic mapping of a candidate object.
-
公开(公告)号:US20180137360A1
公开(公告)日:2018-05-17
申请号:US15629663
申请日:2017-06-21
Applicant: QUALCOMM Incorporated
CPC classification number: G06K9/00718 , G06F16/3331 , G06K9/00751 , G06K9/627 , G06K2009/00738 , G06N3/0454 , G06N3/08
Abstract: A method, a computer-readable medium, and an apparatus for zero-exemplar event detection are provided. The apparatus may receive a plurality of text blocks, each of which may describe one of a plurality of pre-defined events. The apparatus may receive a plurality of training videos, each of which may be associated with one of the plurality of text blocks. The apparatus may propagate each text block through a neural network to obtain a textual representation in a joint space of textual and video representations. The apparatus may propagate each training video through the neural network to obtain a visual representation in the joint space. The apparatus may adjust parameters of the neural network to reduce, for each pair of associated text block and training video, the distance in the joint space between the textual representation of the associated text block and the visual representation of the associated training video.
-
公开(公告)号:US20190026917A1
公开(公告)日:2019-01-24
申请号:US16039133
申请日:2018-07-18
Applicant: QUALCOMM Incorporated
Inventor: Shuai LIAO , Efstratios GAVVES , Cornelis Gerardus Maria SNOEK
Abstract: A method aligns, with an artificial neural network, a three-dimensional (3D) model to an object in a 2D image. The method includes detecting, with an object detector, the object from the 2D image. The method also includes estimating a geodesic distance value between the object and multiple discretized poses of the 3D model. The method further includes selecting a discretized pose of the multiple discretized poses corresponding to a smallest geodesic distance value. The method still further includes propagating pose parameters of the selected discretized pose of the 3D model to the object.
-
公开(公告)号:US20180129934A1
公开(公告)日:2018-05-10
申请号:US15621741
申请日:2017-06-13
Applicant: QUALCOMM Incorporated
Inventor: Ran TAO , Efstratios GAVVES , Arnold Wilhelmus Maria SMEULDERS
CPC classification number: G06N3/0454 , G06K9/00624 , G06K9/3241 , G06K2009/3291 , G06N3/084 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084
Abstract: In one configuration, a visual object tracking apparatus is provided that receives a position of an object in a first frame of a video, and determines a current position of the object in subsequent frames of the video using a Siamese neural network To facilitate determining the current position of the object, the apparatus may adjust a spatial resolution of an image, adjust a size of a probe region, and/or adjust a scale of a plurality of sampled images. In one configuration, a visual object tracking using a Siamese neural network is provided. The apparatus feeds outputs from a plurality of subnetworks of the Siamese neural network to a comparison layer. In addition, the apparatus compares, at the comparison layer, inputs from the plurality of subnetworks to generate a comparison result. Further, the apparatus combines comparison results based on weights to obtain a final comparison result.
-
-
-
-
-
-
-
-
-