-
公开(公告)号:US20220405322A1
公开(公告)日:2022-12-22
申请号:US17354786
申请日:2021-06-22
申请人: Varshanth RAO , Md Ibrahim KHALIL , Peng DAI , Juwei LU
发明人: Varshanth RAO , Md Ibrahim KHALIL , Peng DAI , Juwei LU
IPC分类号: G06F16/55 , G06F16/53 , G06F16/51 , G06F16/583 , G06K9/62
摘要: Methods, systems, and media for image searching are described. Images comprising one query image and a plurality of candidate images are received. For each candidate image, a first model similarity measure from an output of a first model configured for scene classification to perceive scenes in the images is determined. Further, for each candidate image of the plurality of candidate images, a second model similarity measure from the output of a second model configured for attribute classification to perceive attributes in the images is determined. For each candidate image of the plurality of candidate images, a similarity agglomerate index of a weighted aggregate of the first model similarity measure and the second model similarity measure is computed. The plurality of candidate images based on the respective similarity agglomerate index of each candidate image are ranked and a first ranked candidate images corresponding to the searched images are generated.
-
公开(公告)号:US20220222469A1
公开(公告)日:2022-07-14
申请号:US17145219
申请日:2021-01-08
申请人: Varshanth RAO , Peng DAI , Hanwen LIANG , Md Ibrahim KHALIL , Juwei LI
发明人: Varshanth RAO , Peng DAI , Hanwen LIANG , Md Ibrahim KHALIL , Juwei LI
摘要: System and method of analyzing a video, comprising dividing the video into a set of successive basic units; generating semantic tags for the basic units using a set of hierarchical classifier nodes that comprise a parent classifier node and a plurality of child classifier nodes, wherein the basic units are each routed through selected child classifier nodes based on classification of the basic units by the parent classifier node; and generating a semantic topic for the video based on the semantic tags generated for the basic units.
-
公开(公告)号:US20210142106A1
公开(公告)日:2021-05-13
申请号:US17095257
申请日:2020-11-11
申请人: Niamul QUADER , Md Ibrahim KHALIL , Juwei LU , Peng DAI , Wei LI
发明人: Niamul QUADER , Md Ibrahim KHALIL , Juwei LU , Peng DAI , Wei LI
摘要: Methods and systems for updating the weights of a set of convolution kernels of a convolutional layer of a neural network are described. A set of convolution kernels having attention-infused weights is generated by using an attention mechanism based on characteristics of the weights. For example, a set of location-based attention multipliers is applied to weights in the set of convolution kernels, a magnitude-based attention function is applied to the weights in the set of convolution kernels, or both. An output activation map is generated using the set of convolution kernels with attention-infused weights. A loss for the neural network is computed, and the gradient is back propagated to update the attention-infused weights of the convolution kernels.
-
4.
公开(公告)号:US20220114424A1
公开(公告)日:2022-04-14
申请号:US17066220
申请日:2020-10-08
申请人: Niamul QUADER , Md Ibrahim KHALIL , Juwei LU , Peng DAI , Wei LI
发明人: Niamul QUADER , Md Ibrahim KHALIL , Juwei LU , Peng DAI , Wei LI
摘要: Methods, processing units and media for multi-bandwidth separated feature extraction convolution in a neural network are described. A convolution block splits input channels of an activation map into multiple branches, each branch undergoing convolution at a different bandwidth by using down-sampling of the inputs. The outputs are concatenated by up-sampling the outputs of the low-bandwidth branches using pixel shuffling. The concatenation operation may be a shuffled concatenation operation that preserves separated multi-bandwidth feature information for use by subsequent layers of the neural network. Embodiments are described which apply frequency-based and magnitude-based attention to the weights of the convolution kernels based on the frequency band locations of the weights.
-
-
-