-
公开(公告)号:US20240394284A1
公开(公告)日:2024-11-28
申请号:US18386803
申请日:2023-11-03
Applicant: Google LLC
Inventor: Daniel Vlasic , Yiming Gu , Daniel Hernandez Diaz , Ilaï Deutel , Xi Xiong , Tianli Yu , Joseph Pagadora , Mingyang Ling , Jill Daley , Guolong Su
IPC: G06F16/332 , G06F40/40 , G06T7/11 , G06V30/414
Abstract: An aspect of the disclosed technology is a system and process that are able to answer a document query as text and also provide the location in an image where the answer text is detected. In one aspect of the disclosed technology, a machine learning model combines vision and language features for joint learning.