-
1.
公开(公告)号:US20230394079A1
公开(公告)日:2023-12-07
申请号:US18453838
申请日:2023-08-22
Applicant: Samsung Electronics Co., Ltd.
Inventor: Haotian ZHANG , Allan JEPSON , Iqbal Ismail MOHOMED , Konstantinos DERPANIS , Ran ZHANG , Afsaneh FAZLY
IPC: G06F16/535 , G06N20/00
CPC classification number: G06F16/535 , G06N20/00
Abstract: A method of personalized image retrieval includes obtaining a natural language query including a name; replacing the name in the natural language query with a generic term to provide an anonymized query and named entity information; obtaining a plurality of initial ranking scores and a plurality of attention weights corresponding to a plurality of images using a trained scoring model that inputs the anonymized query and the plurality of images; obtaining a plurality of delta scores corresponding to the plurality of images using a re-scoring model that inputs the plurality of attention weights and the named entity information; and obtaining a plurality of final ranking scores by modifying the plurality of initial ranking scores based on the plurality of delta scores. The trained scoring model performs semantic based searching and the re-scoring model determines a probability that faces detected in the plurality of images correspond to the name.
-
公开(公告)号:US20240370491A1
公开(公告)日:2024-11-07
申请号:US18428626
申请日:2024-01-31
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Zhiming HU , Ning YE , Iqbal Ismail MOHOMED , Salar HOSSEINI KHORASGANI
IPC: G06F16/74 , G06F16/71 , G06F16/738 , G06N3/0499 , G06V10/776 , G06V10/82 , G06V10/94 , G06V20/40
Abstract: Provided are system, method, and device for performing multimodal video retrieval. According to embodiments, the method may include: obtaining a first plurality of frames of a video; selecting a second plurality of frames from among the first plurality of frames using a frame selection module, wherein a number of the second plurality of frames may be less than a number of the first plurality of frames; determining a representation of the video based on the selected second plurality of frames using a neural network model; and storing the representation of the video in a memory.
-
公开(公告)号:US20220138489A1
公开(公告)日:2022-05-05
申请号:US17402877
申请日:2021-08-16
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Ning YE , Zhiming HU , Caleb Ryan PHILLIPS , Iqbal Ismail MOHOMED
IPC: G06K9/62 , G06K9/00 , G06F16/732 , G06N20/00
Abstract: A method of real-time video event detection includes: obtaining, based on a natural language query, a query vector; performing multimodal feature extraction on a video stream to obtain a video vector, obtaining a similarity score by comparing the query vector to the video vector; comparing the similarity score to a predetermined threshold; and activating, based on the similarity score being above the predetermined threshold, an action trigger. The multimodal feature extraction is performed using a plurality of overlapping windows that include sequential frames of the video stream.
-
-