-
公开(公告)号:US12051227B2
公开(公告)日:2024-07-30
申请号:US17948981
申请日:2022-09-20
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Ashima Jain , Ruchika Saxena , Maneesh Jain , Sachin Dev Sharma
CPC classification number: G06V10/25 , G06T19/006 , G06V20/20 , G06V20/63
Abstract: A method for selection of a marker in an augmented reality (AR) environment is provided. The method includes capturing a scene in the augmented reality environment; extracting a set of region of interest from the scene captured; identifying a text in the region of interest or from a document associated to the region of interest; determining a set of phrase-action pairs from the text; generating a representation of a set of region of interest and a representation of a set of phrase-action pairs; calculating inter model similarity using the set of region of interest and the set of phrase-action pairs in common embedding space; computing intra model similarity by comparing the extracted ROI with a generated ROI and the extracted phrase-action with generated phrase actions; and selecting a phrase-action-ROI tuple having the highest intra modal similarity as the marker.
-
公开(公告)号:US20230111356A1
公开(公告)日:2023-04-13
申请号:US17948981
申请日:2022-09-20
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Ashima JAIN , Ruchika Saxena , Maneesh Jain , Sachin Dev Sharma
Abstract: A method for selection of a marker in an augmented reality (AR) environment is provided. The method includes capturing a scene in the augmented reality environment; extracting a set of region of interest from the scene captured; identifying a text in the region of interest or from a document associated to the region of interest; determining a set of phrase-action pairs from the text; generating a representation of a set of region of interest and a representation of a set of phrase-action pairs; calculating inter model similarity using the set of region of interest and the set of phrase-action pairs in common embedding space; computing intra model similarity by comparing the extracted ROI with a generated ROI and the extracted phrase-action with generated phrase actions; and selecting a phrase-action-ROI tuple having the highest intra modal similarity as the marker.
-