-
公开(公告)号:US20230206665A1
公开(公告)日:2023-06-29
申请号:US18076787
申请日:2022-12-07
Applicant: Samsung Electronics Co., Ltd.
Inventor: Younguk KIM , Kyungsu KIM , Ohjoon KWON , Yehoon KIM , Hyunhan KIM , Hyosang KIM , Hyungmin LEE
IPC: G06V30/148 , G06F40/253 , G06F40/232 , G06T7/70 , G06V30/19
CPC classification number: G06V30/148 , G06F40/253 , G06F40/232 , G06T7/70 , G06V30/19 , G06T2207/20132
Abstract: A method and an electronic device for recognizing text are provided. The method includes detecting positions of pieces of text included in the text in the image, generating cropped images by cropping areas corresponding to the pieces of text in the image, recognizing characters of the pieces of text based on the cropped images, generating a sentence by inputting the positions of the pieces of text and the characters of the pieces of text to a multimodal language model, wherein the multimodal language model is an artificial intelligence (AI) model for inferring an original sentence of the text, and displaying the sentence.
-
公开(公告)号:US20230206298A1
公开(公告)日:2023-06-29
申请号:US18148629
申请日:2022-12-30
Applicant: Samsung Electronics Co., Ltd.
Inventor: Kyungsu KIM , Ohjoon KWON , Younguk KIM , Hyunsoo CHOI , Yehoon KIM , Hyunhan KIM , Hyosang KIM , Hyungmin LEE
IPC: G06Q30/0601 , G06V10/80 , G06V30/14
CPC classification number: G06Q30/0627 , G06V10/806 , G06V30/1444
Abstract: A method and electronic device for recognizing a product are provided. The method includes obtaining first feature information and second feature information from an image related to a product, obtaining fusion feature information based on the first feature information and the second feature information by using a main encoder model that reflects a correlation between feature information of different modalities, matching the fusion feature information against a database of the product, and providing information about the product, based on a result of the matching.
-